Sample records for edman sequencing research

  1. N-Terminal Amino Acid Sequence Determination of Proteins by N-Terminal Dimethyl Labeling: Pitfalls and Advantages When Compared with Edman Degradation Sequence Analysis.

    PubMed

    Chang, Elizabeth; Pourmal, Sergei; Zhou, Chun; Kumar, Rupesh; Teplova, Marianna; Pavletich, Nikola P; Marians, Kenneth J; Erdjument-Bromage, Hediye

    2016-07-01

    In recent history, alternative approaches to Edman sequencing have been investigated, and to this end, the Association of Biomolecular Resource Facilities (ABRF) Protein Sequencing Research Group (PSRG) initiated studies in 2014 and 2015, looking into bottom-up and top-down N-terminal (Nt) dimethyl derivatization of standard quantities of intact proteins with the aim to determine Nt sequence information. We have expanded this initiative and used low picomole amounts of myoglobin to determine the efficiency of Nt-dimethylation. Application of this approach on protein domains, generated by limited proteolysis of overexpressed proteins, confirms that it is a universal labeling technique and is very sensitive when compared with Edman sequencing. Finally, we compared Edman sequencing and Nt-dimethylation of the same polypeptide fragments; results confirm that there is agreement in the identity of the Nt amino acid sequence between these 2 methods.

  2. Unraveling the sequence and structure of the protein osteocalcin from a 42 ka fossil horse

    NASA Astrophysics Data System (ADS)

    Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Andrews, Philip C.; Leykam, Joseph; Stafford, Thomas W.; Kelly, Robert L.; Walker, Danny N.; Buckley, Mike; Humpula, James

    2006-04-01

    We report the first complete amino acid sequence and evidence of secondary structure for osteocalcin from a temperate fossil. The osteocalcin derives from a 42 ka equid bone excavated from Juniper Cave, Wyoming. Results were determined by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-MS) and Edman sequencing with independent confirmation of the sequence in two laboratories. The ancient sequence was compared to that of three modern taxa: horse ( Equus caballus), zebra ( Equus grevyi), and donkey ( Equus asinus). Although there was no difference in sequence among modern taxa, MALDI-MS and Edman sequencing show that residues 48 and 49 of our modern horse are Thr, Ala rather than Pro, Val as previously reported (Carstanjen B., Wattiez, R., Armory, H., Lepage, O.M., Remy, B., 2002. Isolation and characterization of equine osteocalcin. Ann. Med. Vet.146(1), 31-38). MALDI-MS and Edman sequencing data indicate that the osteocalcin sequence of the 42 ka fossil is similar to that of modern horse. Previously inaccessible structural attributes for ancient osteocalcin were observed. Glu 39 rather than Gln 39 is consistent with deamidation, a process known to occur during fossilization and aging. Two post-translational modifications were documented: Hyp 9 and a disulfide bridge. The latter suggests at least partial retention of secondary structure. As has been done for ancient DNA research, we recommend standards for preparation and criteria for authenticating results of ancient protein sequencing.

  3. A proteomic analysis of leaf sheaths from rice.

    PubMed

    Shen, Shihua; Matsubae, Masami; Takao, Toshifumi; Tanaka, Naoki; Komatsu, Setsuko

    2002-10-01

    The proteins extracted from the leaf sheaths of rice seedlings were separated by 2-D PAGE, and analyzed by Edman sequencing and mass spectrometry, followed by database searching. Image analysis revealed 352 protein spots on 2-D PAGE after staining with Coomassie Brilliant Blue. The amino acid sequences of 44 of 84 proteins were determined; for 31 of these proteins, a clear function could be assigned, whereas for 12 proteins, no function could be assigned. Forty proteins did not yield amino acid sequence information, because they were N-terminally blocked, or the obtained sequences were too short and/or did not give unambiguous results. Fifty-nine proteins were analyzed by mass spectrometry; all of these proteins were identified by matching to the protein database. The amino acid sequences of 19 of 27 proteins analyzed by mass spectrometry were similar to the results of Edman sequencing. These results suggest that 2-D PAGE combined with Edman sequencing and mass spectrometry analysis can be effectively used to identify plant proteins.

  4. Venom characterization of the Amazonian scorpion Tityus metuendus.

    PubMed

    Batista, C V F; Martins, J G; Restano-Cassulini, R; Coronas, F I V; Zamudio, F Z; Procópio, R; Possani, L D

    2018-03-01

    The soluble venom from the scorpion Tityus metuendus was characterized by various methods. In vivo experiments with mice showed that it is lethal. Extended electrophysiological recordings using seven sub-types of human voltage gated sodium channels (hNav1.1 to 1.7) showed that it contains both α- and β-scorpion toxin types. Fingerprint analysis by mass spectrometry identified over 200 distinct molecular mass components. At least 60 sub-fractions were recovered from HPLC separation. Five purified peptides were sequenced by Edman degradation, and their complete primary structures were determined. Additionally, three other peptides have had their N-terminal amino acid sequences determined by Edman degradation and reported. Mass spectrometry analysis of tryptic digestion of the soluble venom permitted the identification of the amino acid sequence of 111 different peptides. Search for similarities of the sequences found indicated that they probably are: sodium and potassium channel toxins, metalloproteinases, hyaluronidases, endothelin and angiotensin-converting enzymes, bradykinin-potentiating peptide, hypothetical proteins, allergens, other enzymes, other proteins and peptides. Copyright © 2018 Elsevier Ltd. All rights reserved.

  5. Isoforms of a cuticular protein from larvae of the meal beetle, Tenebrio molitor, studied by mass spectrometry in combination with Edman degradation and two-dimensional polyacrylamide gel electrophoresis.

    PubMed Central

    Haebel, S.; Jensen, C.; Andersen, S. O.; Roepstorff, P.

    1995-01-01

    Simultaneous sequencing, using a combination of mass spectrometry and Edman degradation, of three approximately 15-kDa variants of a cuticular protein extracted from the meal beetle Tenebrio molitor larva is demonstrated. The information obtained by matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) time-course monitoring of enzymatic digests was found essential to identify the differences among the three variants and for alignment of the peptides in the sequence. To determine whether each individual insect larva contains all three protein variants, proteins extracted from single animals were separated by two-dimensional gel electrophoresis, electroeluted from the gel spots, and analyzed by MALDI MS. Molecular weights of the proteins present in each sample could be obtained, and mass spectrometric mapping of the peptides after digestion with trypsin gave additional information. The protein isoforms were found to be allelic variants. PMID:7795523

  6. Isoforms of a cuticular protein from larvae of the meal beetle, Tenebrio molitor, studied by mass spectrometry in combination with Edman degradation and two-dimensional polyacrylamide gel electrophoresis.

    PubMed

    Haebel, S; Jensen, C; Andersen, S O; Roepstorff, P

    1995-03-01

    Simultaneous sequencing, using a combination of mass spectrometry and Edman degradation, of three approximately 15-kDa variants of a cuticular protein extracted from the meal beetle Tenebrio molitor larva is demonstrated. The information obtained by matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) time-course monitoring of enzymatic digests was found essential to identify the differences among the three variants and for alignment of the peptides in the sequence. To determine whether each individual insect larva contains all three protein variants, proteins extracted from single animals were separated by two-dimensional gel electrophoresis, electroeluted from the gel spots, and analyzed by MALDI MS. Molecular weights of the proteins present in each sample could be obtained, and mass spectrometric mapping of the peptides after digestion with trypsin gave additional information. The protein isoforms were found to be allelic variants.

  7. Complete covalent structure of statherin, a tyrosine-rich acidic peptide which inhibits calcium phosphate precipitation from human parotid saliva.

    PubMed

    Schlesinger, D H; Hay, D I

    1977-03-10

    The complete amino acid sequence of human salivary statherin, a peptide which strongly inhibits precipitation from supersaturated calcium phosphate solutions, and therefore stabilizes supersaturated saliva, has been determined. The NH2-terminal half of this Mr=5380 (43 amino acids) polypeptide was determined by automated Edman degradations (liquid phase) on native statherin. The peptide was digested separately with trypsin, chymotrypsin, and Staphylococcus aureus protease, and the resulting peptides were purified by gel filtration. Manual Edman degradations on purified peptide fragments yielded peptides that completed the amino acid sequence through the penultimate COOH-terminal residue. These analyses, together with carboxypeptidase digestion of native statherin and of peptide fragments of statherin, established the complete sequence of the molecule. The 2 serine residues (positions 2 and 3) in statherin were identified as phosphoserine. The amino acid sequence of human salivary statherin is striking in a number of ways. The NH2-terminal one-third is highly polar and includes three polar dipeptides: H2PO3-Ser-Ser-H2PO3-Arg-Arg-, and Glu-Glu-. The COOH-terminal two-thirds of the molecule is hydrophobic, containing several repeating dipeptides: four of -Gn-Pro-, three of -Tyr-Gln-, two of -Gly-Tyr-, two of-Gln-Tyr-, and two of the tetrapeptide sequence -Pro-Tyr-Gln-Pro-. Unusual cleavage sites in the statherin sequence obtained with chymotrypsin and S. aureus protease were also noted.

  8. Identification of Tumor Suppressor Genes by Genetic and Epigenetic Genome-Scanning

    DTIC Science & Technology

    2008-04-01

    SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT 18. NUMBER OF PAGES 19a. NAME OF RESPONSIBLE PERSON USAMRMC a. REPORT U b. ABSTRACT U...oncogene-related sequences in human neuroblastomas. Cell 35: 359-67; 1983. 3. Capon, D. J.; Seeburg, P. H.; McGrath, J. P.; Hayflick , J. S.; Edman

  9. Characterization, production, and purification of leucocin H, a two-peptide bacteriocin from Leuconostoc MF215B.

    PubMed

    Blom, H; Katla, T; Holck, A; Sletten, K; Axelsson, L; Holo, H

    1999-07-01

    Leuconostoc MF215B was found to produce a two-peptide bacteriocin referred to as leucocin H. The two peptides were termed leucocin Halpha and leucocin Hbeta. When acting together, they inhibit, among others, Listeria monocytogenes, Bacillus cereus, and Clostridium perfringens. Production of leucocin H in growth medium takes place at temperatures down to 6 degrees C and at pH below 7. The highest activity of leucocin H in growth medium was demonstrated in the late exponential growth phase. The bacteriocin was purified by precipitation with ammonium sulfate, ion-exchange (SP Sepharose) and reverse phase chromatography. Upon purification, specific activity increased 10(5)-fold, and the final specific activity was 2 x 10(7) BU/OD280. Amino acid composition analyses of leucocin Halpha and leucocin Hbeta indicated that both peptides consisted of around 40 amino acid residues. Their N-termini were blocked for Edman degradation, and the methionin residues of leucocin Hbeta did not respond to Cyanogen Bromide (CNBr) cleavage. Absorbance at 280 nm indicated the presence of tryptophan residues and tryptophan-fracturing opened for partial sequencing by Edman degradation. From leucocin Halpha, the sequence of 20 amino acids was obtained; from leucocin Hbeta the sequence of 28 amino acid residues was obtained. No sequence homology to other known bacteriocins could be demonstrated. It also appeared that the two peptides themselves shared little or no sequence homology. The presence of soy oil did not affect the activity of leucocin H in agar.

  10. Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

    NASA Astrophysics Data System (ADS)

    Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

    2000-02-01

    Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.

  11. Isolation, purification and functional characterization of alpha-BnIA from Conus bandanus venom.

    PubMed

    Nguyen, Bao; Le Caer, Jean-Pierre; Aráoz, Romulo; Thai, Robert; Lamthanh, Hung; Benoit, Evelyne; Molgó, Jordi

    2014-12-01

    We report the isolation and characterization by proteomic approach of a native conopeptide, named BnIA, from the crude venom of Conus bandanus, a molluscivorous cone snail species, collected in the South central coast of Vietnam. Its primary sequence was determined by matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometry using collision-induced dissociation and confirmed by Edman's degradation of the pure native fraction. BnIA was present in high amounts in the crude venom and the complete sequence of the 16 amino acid peptide was the following GCCSHPACSVNNPDIC*, with C-terminal amidation deduced from Edman's degradation and theoretical monoisotopic mass calculation. Sequence alignment revealed that its -C1C2X4C3X7C4- pattern belongs to the A-superfamily of conopeptides. The cysteine connectivity of BnIA was 1-3/2-4 as determined by partial-reduction technique, like other α4/7-conotoxins, reported previously on other Conus species. Additionally, we found that native α-BnIA shared the same sequence alignment as Mr1.1, from the closely related molluscivorous Conus marmoreus venom, in specimens collected in the same coastal region of Vietnam. Functional studies revealed that native α-BnIA inhibited acetylcholine-evoked currents reversibly in oocytes expressing the human α7 nicotinic acetylcholine receptors, and blocked nerve-evoked skeletal muscle contractions in isolated mouse neuromuscular preparations, but with ∼200-times less potency. Copyright © 2014 Elsevier Ltd. All rights reserved.

  12. Primary structure of Lep d I, the main Lepidoglyphus destructor allergen.

    PubMed

    Varela, J; Ventas, P; Carreira, J; Barbas, J A; Gimenez-Gallego, G; Polo, F

    1994-10-01

    The most relevant allergen of the storage mite Lepidoglyphus destructor (Lep d I) has been characterized. Lep d I is a monomer protein of 13273 Da. The primary structure of Lep d I was determined by N-terminal Edman degradation and partially confirmed by cDNA sequencing. Sequence polymorphism was observed at six positions, with non-conservative substitutions in three of them. No potential N-glycosylation site was revealed by peptide sequencing. The 125-residue sequence of Lep d I shows approximately 40% identity (including the six cysteines) with the overlapping regions of group II allergens from the genus Dermatophagoides, which, however, do not share common allergenic epitopes with Lep d I.

  13. Characterization and analysis of posttranslational modifications of the human large cytoplasmic ribosomal subunit proteins by mass spectrometry and Edman sequencing.

    PubMed

    Odintsova, Tatyana I; Müller, Eva-Christina; Ivanov, Anton V; Egorov, Tsezi A; Bienert, Ralf; Vladimirov, Serguei N; Kostka, Susanne; Otto, Albrecht; Wittmann-Liebold, Brigitte; Karpova, Galina G

    2003-04-01

    The 60S ribosomal proteins were isolated from ribosomes of human placenta and separated by reversed phase HPLC. The fractions obtained were subjected to trypsin and Glu-C digestion and analyzed by mass fingerprinting (MALDI-TOF), MS/MS (ESI), and Edman sequencing. Forty-six large subunit proteins were found, 22 of which showed masses in accordance with the SwissProt database (June 2002) masses (proteins L6, L7, L9, L13, L15, L17, L18, L21, L22, L24, L26, L27, L30, L32, L34, L35, L36, L37, L37A, L38, L39, L41). Eleven (proteins L7, L10A, L11, L12, L13A, L23, L23A, L27A, L28, L29, and P0) resulted in mass changes that are consistent with N-terminal loss of methionine, acetylation, internal methylation, or hydroxylation. A loss of methionine without acetylation was found for protein L8 and L17. For nine proteins (L3, L4, L5, L7A, L10, L14, L19, L31, and L40), the molecular masses could not be determined. Proteins P1 and protein L3-like were not identified by the methods applied.

  14. Cloning and characterization of 2S albumin, Car i 1, a major allergen in pecan.

    PubMed

    Sharma, Girdhari M; Irsigler, Andre; Dhanarajan, Pushparani; Ayuso, Rosalia; Bardina, Luda; Sampson, Hugh A; Roux, Kenneth H; Sathe, Shridhar K

    2011-04-27

    Although pecans are associated with IgE-mediated food allergies, the allergens responsible remain to be identified and characterized. The 2S albumin gene was amplified from the pecan cDNA library. Dot-blots were used to screen the recombinant protein with pecan allergic patients' serum. The affinity purified native protein was analyzed by Edman sequencing and mass spectrometry/mass spectrometry (MS/MS) analysis. Cross-reactivity with walnut was determined by inhibition enzyme-linked immunosorbent assay (ELISA). Sequential epitopes were determined by probing the overlapping peptides with three different patients' serum pool. The 3-dimensional homology model was generated, and the locations of the pecan epitopes were compared with those of known sequential epitopes on other allergenic tree nut homologues. Of 28 patients tested by dot-blot, 22 (79%) bound to 2S albumin, designated as Car i 1. Edman sequencing and the MS/MS sequencing of native 2S albumin confirmed the identity of recombinant (r) Car i 1. Both pecan and walnut protein extracts inhibited the IgE-binding to rCar i 1. Sequential epitope mapping indicated weak, moderate, and strong reactivity against 12, 7, and 5 peptides, respectively. Of the 11 peptides recognized by all serum pools, 5 peptides were strongly reactive and located in 3 discrete regions of the Car i 1 (amino acids 43-57, 67-78, and 106-120). Three-dimensional modeling revealed IgE-reactive epitopes to be solvent accessible and share significant homology with other tree nuts providing a possible basis for previously observed cross-reactivity.

  15. A cardioactive peptide from the southern armyworm, Spodoptera eridania.

    PubMed

    Furuya, K; Hackett, M; Cirelli, M A; Schegg, K M; Wang, H; Shabanowitz, J; Hunt, D F; Schooley, D A

    1999-01-01

    A cardioactive peptide was isolated from extracts of whole heads of the southern armyworm, Spodoptera eridania. This peptide has the sequence ENFAVGCTPGYQRTADGRCKPTF (Mr = 2516.8), determined from both Edman sequencing and tandem mass spectrometry in combination with off-line micropreparative capillary liquid chromatography. This peptide, termed Spoer-CAP23, has excitatory effects on a semi-isolated heart from larval Manduca sexta, causing an inotropic effect at low concentrations of peptide and chronotropic and inotropic effects at high doses. The threshold concentration for stimulatory effects of the synthetic peptide on the semi-isolated heart was about 1 nM, suggesting a physiological role as a neuropeptide.

  16. Protein Sequencing with Tandem Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Ziady, Assem G.; Kinter, Michael

    The recent introduction of electrospray ionization techniques that are suitable for peptides and whole proteins has allowed for the design of mass spectrometric protocols that provide accurate sequence information for proteins. The advantages gained by these approaches over traditional Edman Degradation sequencing include faster analysis and femtomole, sometimes attomole, sensitivity. The ability to efficiently identify proteins has allowed investigators to conduct studies on their differential expression or modification in response to various treatments or disease states. In this chapter, we discuss the use of electrospray tandem mass spectrometry, a technique whereby protein-derived peptides are subjected to fragmentation in the gas phase, revealing sequence information for the protein. This powerful technique has been instrumental for the study of proteins and markers associated with various disorders, including heart disease, cancer, and cystic fibrosis. We use the study of protein expression in cystic fibrosis as an example.

  17. Enterocin T, a novel class IIa bacteriocin produced by Enterococcus sp. 812.

    PubMed

    Chen, Yi-Sheng; Yu, Chi-Rong; Ji, Si-Hua; Liou, Min-Shiuan; Leong, Kun-Hon; Pan, Shwu-Fen; Wu, Hui-Chung; Lin, Yu-Hsuan; Yu, Bi; Yanagida, Fujitoshi

    2013-09-01

    Enterococcus sp. 812, isolated from fresh broccoli, was previously found to produce a bacteriocin active against a number of Gram-positive bacteria, including Listeria monocytogenes. Bacteriocin activity decreased slightly after autoclaving (121 °C for 15 min), but was inactivated by protease K. Mass spectrometry analysis revealed the bacteriocin mass to be approximately 4,521.34 Da. N-terminal amino acid sequencing yielded a partial sequence, NH2-ATYYGNGVYXDKKKXWVEWGQA, by Edman degradation, which contained the consensus class IIa bacteriocin motif YGNGV in the N-terminal region. The obtained partial sequence showed high homology with some enterococcal bacteriocins; however, no identical peptide or protein was found. This peptide was therefore considered to be a novel bacteriocin produced by Enterococcus sp. 812 and was termed enterocin T.

  18. Determination and reoxidation of the disulfide bridges of a squash-type trypsin inhibitor from Sechium edule seeds.

    PubMed

    Faça, Vitor M; Pereira, Sandra R; Laure, Hélen J; Greene, Lewis J

    2004-07-01

    The determination of the disulfide pairings of SETI-II, a trypsin inhibitor isolated from Sechium edule, is described herein. The inhibitor contains 31 amino acid residues per mol, 6 of which are cysteine. Forty-five nmol (160 microg) of SETI-II was hydrolyzed with 20 microg thermolysin for 48 hr at 45 degrees C, and peptides were separated by reverse phase high performance liquid chromatography (RP-HPLC). The major products were identified by amino acid composition, Edman degradation, and on the basis of the sequence of the inhibitor. The disulfide bridge pairings and (yields) are: Cys1-Cys4 (79%), Cys2-Cys5 (21%) and Cys3-Cys6 (43%). When the reduced inhibitor was reoxidized with glutathione reduced form (GSH)/glutathione oxidized form (GSSG) at pH 8.5 for 3 hr, full activity was recovered. These data show that disulfide bridge pairing and oxidation can be determined at nanomole levels and that sensitive and quantitative Edman degradation can eliminate the final time- and material-consuming step of disulfide determinations by eliminating the need to purify and cleave each peptide containing a disulfide bridge.

  19. Peptidomic approach identifies cruzioseptins, a new family of potent antimicrobial peptides in the splendid leaf frog, Cruziohyla calcarifer.

    PubMed

    Proaño-Bolaños, Carolina; Zhou, Mei; Wang, Lei; Coloma, Luis A; Chen, Tianbao; Shaw, Chris

    2016-09-02

    Phyllomedusine frogs are an extraordinary source of biologically active peptides. At least 8 families of antimicrobial peptides have been reported in this frog clade, the dermaseptins being the most diverse. By a peptidomic approach, integrating molecular cloning, Edman degradation sequencing and tandem mass spectrometry, a new family of antimicrobial peptides has been identified in Cruziohyla calcarifer. These 15 novel antimicrobial peptides of 20-32 residues in length are named cruzioseptins. They are characterized by having a unique shared N-terminal sequence GFLD- and the sequence motifs -VALGAVSK- or -GKAAL(N/G/S) (V/A)V- in the middle of the peptide. Cruzioseptins have a broad spectrum of antimicrobial activity and low haemolytic effect. The most potent cruzioseptin was CZS-1 that had a MIC of 3.77μM against the Gram positive bacterium, Staphylococcus aureus and the yeast Candida albicans. In contrast, CZS-1 was 3-fold less potent against the Gram negative bacterium, Escherichia coli (MIC 15.11μM). CZS-1 reached 100% haemolysis at 120.87μM. Skin secretions from unexplored species such as C. calcarifer continue to demonstrate the enormous molecular diversity hidden in the amphibian skin. Some of these novel peptides may provide lead structures for the development of a new class of antibiotics and antifungals of therapeutic use. Through the combination of molecular cloning, Edman degradation sequencing, tandem mass spectrometry and MALDI-TOF MS we have identified a new family of 15 antimicrobial peptides in the skin secretion of Cruziohyla calcarifer. The novel family is named "Cruzioseptins" and contains cationic amphipathic peptides of 20-32 residues. They have a broad range of antimicrobial activity that also includes effective antifungals with low haemolytic activity. Therefore, C. calcarifer has proven to be a rich source of novel peptides, which could become leading structures for the development of novel antibiotics and antifungals of clinical application. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. Partial amino acid sequence of the branched chain amino acid aminotransferase (TmB) of E. coli JA199 pDU11

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Feild, M.J.; Armstrong, F.B.

    1987-05-01

    E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and (/sup 3/H)-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealedmore » limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region.« less

  1. [Hemoglobins, XXXII. Analysis of the primary structure of the monomeric hemoglobin CTT VIIA (erythrocruorin) or Chironomus thummi thummi, Diptera (author's transl)].

    PubMed

    Kleinschmidt, T; Braunitzer, G

    1980-01-01

    The dimeric hemoglobin CTT VIIA (erythrocruorin) was isolated from the hemolymph of the larva from Chironomus thummi thummi and purified by preparative polyacrylamide gel electrophoresis. Peptides obtained by limited tryptical digestion were sequenced by automatic Edman degradation. For the elucidation of the sequence in the C-terminal region of the chain, additional cleavages with proteinase of Staphylococcus aureus and chymotrypsin were necessary. CTT VIIA is compared with human beta-chains and other hemoglobins of Chironomus. The amino acid residues in the pocket are especially discussed. Most of them are invariant in all Chironomus hemoglobins, independent of the size of the heme pocket, which is normal in some components and enlarged in others.

  2. Brain cDNA clone for human cholinesterase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McTiernan, C.; Adkins, S.; Chatonnet, A.

    1987-10-01

    A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less

  3. Nucleotide sequence analysis of the gene encoding the Deinococcus radiodurans surface protein, derived amino acid sequence, and complementary protein chemical studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peters, J.; Peters, M.; Lottspeich, F.

    1987-11-01

    The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less

  4. Two new bradykinin-related peptides from the venom of the social wasp Protopolybia exigua (Saussure).

    PubMed

    Mendes, Maria Anita; Palma, Mario Sergio

    2006-11-01

    Two bradykinin-related peptides (Protopolybiakinin-I and Protopolybiakinin-II) were isolated from the venom of the social wasp Protopolybia exigua by RP-HPLC, and sequenced by Edman degradation method. Peptide sequences of Protopolybiakinin-I and Protopolybiakinin-II were DKNKKPIRVGGRRPPGFTR-OH and DKNKKPIWMAGFPGFTPIR-OH, respectively. Synthetic peptides with identical sequences to the bradykinin-related peptides and their biological functions were characterized. Protopolybiakinin-I caused less potent constriction of the isolated rat ileum muscles than bradykinin (BK). In addition, it caused degranulation of mast cells which was seven times more potent than BK. This peptide causes algesic effects due to the direct activation of B(2)-receptors. Protopolybiakinin-II is not an agonist of rat ileum muscle and had no algesic effects. However, Protopolybiakinin-II was found to be 10 times more potent as a mast cell degranulator than BK. The amino acid sequence of Protopolybiakinin-I is the longest among the known wasp kinins.

  5. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kem, W.; Dunn, B.; Parten, B.

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residuesmore » (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.« less

  6. Proprotein convertases generate a highly functional heterodimeric form of thymic stromal lymphopoietin in humans.

    PubMed

    Poposki, Julie A; Klingler, Aiko I; Stevens, Whitney W; Peters, Anju T; Hulse, Kathryn E; Grammer, Leslie C; Schleimer, Robert P; Welch, Kevin C; Smith, Stephanie S; Sidle, Douglas M; Conley, David B; Tan, Bruce K; Kern, Robert C; Kato, Atsushi

    2017-05-01

    Thymic stromal lymphopoietin (TSLP) is known to be elevated and truncated in nasal polyps (NPs) of patients with chronic rhinosinusitis and might play a significant role in type 2 inflammation in this disease. However, neither the structure nor the role of the truncated products of TSLP has been studied. We sought to investigate the mechanisms of truncation of TSLP in NPs and the function of the truncated products. We incubated recombinant human TSLP with NP extracts, and determined the protein sequence of the truncated forms of TSLP using Edman protein sequencing and matrix-assisted laser desorption/ionization-time of flight mass spectrometry. We investigated the functional activity of truncated TSLP using a PBMC-based bioassay. Edman sequencing and mass spectrometry results indicated that NP extracts generated 2 major truncated products, TSLP (residues 29-124) and TSLP (131-159). Interestingly, these 2 products remained linked with disulfide bonds and presented as a dimerized form, TSLP (29-124 + 131-159). We identified that members of the proprotein convertase were rate-limiting enzymes in the truncation of TSLP between residues 130 and 131 and generated a heterodimeric unstable metabolite TSLP (29-130 + 131-159). Carboxypeptidase N immediately digested 6 amino acids from the C terminus of the longer subunit of TSLP to generate a stable dimerized form, TSLP (29-124 + 131-159), in NPs. These truncations were homeostatic but primate-specific events. A metabolite TSLP (29-130 + 131-159) strongly activated myeloid dendritic cells and group 2 innate lymphoid cells compared with mature TSLP. Posttranslational modifications control the functional activity of TSLP in humans and overproduction of TSLP may be a key trigger for the amplification of type 2 inflammation in diseases. Copyright © 2016 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.

  7. Shotgun protein sequencing: assembly of peptide tandem mass spectra from mixtures of modified proteins.

    PubMed

    Bandeira, Nuno; Clauser, Karl R; Pevzner, Pavel A

    2007-07-01

    Despite significant advances in the identification of known proteins, the analysis of unknown proteins by MS/MS still remains a challenging open problem. Although Klaus Biemann recognized the potential of MS/MS for sequencing of unknown proteins in the 1980s, low throughput Edman degradation followed by cloning still remains the main method to sequence unknown proteins. The automated interpretation of MS/MS spectra has been limited by a focus on individual spectra and has not capitalized on the information contained in spectra of overlapping peptides. Indeed the powerful shotgun DNA sequencing strategies have not been extended to automated protein sequencing. We demonstrate, for the first time, the feasibility of automated shotgun protein sequencing of protein mixtures by utilizing MS/MS spectra of overlapping and possibly modified peptides generated via multiple proteases of different specificities. We validate this approach by generating highly accurate de novo reconstructions of multiple regions of various proteins in western diamondback rattlesnake venom. We further argue that shotgun protein sequencing has the potential to overcome the limitations of current protein sequencing approaches and thus catalyze the otherwise impractical applications of proteomics methodologies in studies of unknown proteins.

  8. Isolation and Characterization of Acetylated Derivative of Recombinant Insulin Lispro Produced in Escherichia coli.

    PubMed

    Szewczak, Joanna; Bierczyńska-Krzysik, Anna; Piejko, Marcin; Mak, Paweł; Stadnik, Dorota

    2015-07-01

    Insulin lispro is a rapid-acting insulin analogue produced by recombinant DNA technology. As a biosynthetic drug, the protein undergoes strict monitoring aiming for detection and characterization of impurities. The goal of this study was to isolate and identify a derivative of insulin lispro formed during biosynthesis. For this purpose, ion exchange chromatography in combination with endoproteinase Glu-C digestion, MALDI-TOF/TOF mass spectrometry and Edman sequencing were employed. Ion exchange chromatography analysis of related proteins in development batches of recombinant insulin lispro revealed the existence of unknown derivative in excess of the assumed limit. Its molecular mass was 42 Da higher than the theoretical mass of Lys(B31) insulin lispro--one of the expected process-related intermediates. Endoproteinase Glu-C cleavage enabled indication of the modified peptide. Tandem mass spectrometry (MS/MS) allowed to explore the location and type of the modification. The 42 amu shift was present in the mass of y-type ions, while b-type ions were in agreement with theoretical values. It suggested that the modification is present on B31 lysine. Further inquiry revealed the presence of two diagnostic ions for lysine acetylation at m/z 143.1 and 126.1. In addition, the peptide was isolated and sequenced by Edman degradation. Standards of phenylthiohydantoin derivatives of N-ε-acetyl-L-lysine and N-ε-trimethyl-L-lysine, not available commercially, were synthesized in the laboratory. The retention time of the modified residue confirmed its identity as N-ε-acetyl-L-lysine. The derivative of insulin lispro formed during biosynthesis of the drug was identified to be N-ε-acetyl-L-lysine (B31) insulin lispro.

  9. Enterocin 96, a Novel Class II Bacteriocin Produced by Enterococcus faecalis WHE 96, Isolated from Munster Cheese▿

    PubMed Central

    Izquierdo, Esther; Wagner, Camille; Marchioni, Eric; Aoude-Werner, Dalal; Ennahar, Saïd

    2009-01-01

    Enterococcus faecalis WHE 96, a strain isolated from soft cheese based on its anti-Listeria activity, produced a 5,494-Da bacteriocin that was purified to homogeneity by ultrafiltration and cation-exchange and reversed-phase chromatographies. The amino acid sequence of this bacteriocin, named enterocin 96, was determined by Edman degradation, and its structural gene was sequenced, revealing a double-glycine leader peptide. After a comparison with other bacteriocins, it was shown that enterocin 96 was a new class II bacteriocin that showed very little similarity with known structures. Enterocin 96 was indeed a new bacteriocin belonging to class II bacteriocins. The activity spectrum of enterocin 96 covered a wide range of bacteria, with strong activity against most gram-positive strains but very little or no activity against gram-negative strains. PMID:19411428

  10. Enterocin 96, a novel class II bacteriocin produced by Enterococcus faecalis WHE 96, isolated from Munster cheese.

    PubMed

    Izquierdo, Esther; Wagner, Camille; Marchioni, Eric; Aoude-Werner, Dalal; Ennahar, Saïd

    2009-07-01

    Enterococcus faecalis WHE 96, a strain isolated from soft cheese based on its anti-Listeria activity, produced a 5,494-Da bacteriocin that was purified to homogeneity by ultrafiltration and cation-exchange and reversed-phase chromatographies. The amino acid sequence of this bacteriocin, named enterocin 96, was determined by Edman degradation, and its structural gene was sequenced, revealing a double-glycine leader peptide. After a comparison with other bacteriocins, it was shown that enterocin 96 was a new class II bacteriocin that showed very little similarity with known structures. Enterocin 96 was indeed a new bacteriocin belonging to class II bacteriocins. The activity spectrum of enterocin 96 covered a wide range of bacteria, with strong activity against most gram-positive strains but very little or no activity against gram-negative strains.

  11. An improved procedure, involving mass spectrometry, for N-terminal amino acid sequence determination of proteins which are N alpha-blocked.

    PubMed Central

    Rose, K; Kocher, H P; Blumberg, B M; Kolakofsky, D

    1984-01-01

    A modification to a previously described procedure [Gray & del Valle (1970) Biochemistry 9, 2134-2137; Rose, Simona & Offord (1983) Biochem. J. 215, 261-272] for mass-spectral identification of the N-terminal regions of proteins is shown to be useful in cases where the N-terminus is blocked. Three proteins were studied: vesicular-stomatitis-virus N protein, Sendai-virus NP protein, and a rabbit immunoglobulin lambda-light chain. These proteins, found to be blocked at the N-terminus with either the acetyl group or a pyroglutamic acid residue, had all failed to yield to attempted Edman degradation, in one case even after attempted enzymic removal of the pyroglutamic acid residue. The N-terminal regions of all three proteins were sequenced by using the new procedure. PMID:6421284

  12. Isolation and purification of two antioxidant peptides from alcalase hydrolysate of Arca subcrenata.

    PubMed

    Li, Ting-Fei; Ye, Bin; Song, Li-Yan; Yu, Rong-Min

    2014-07-01

    To investigate the constituents with antioxidant activities from alcalase hydrolysate of Arca subcrenata. The consecutive chromatographic methods were employed,including ion-exchange chromatography, gel filtration chromatography, and reverse phase high-performance liquid chromatography (RP-HPLC). The amino acid sequences of the purified antioxidant peptides were determined by automated Edman degradation. Under the guidance of the assay of scavenging free radicals, two peptides with antioxidant activities, termed as A-Bg1 and A-Bh, were isolated and purified from the alcalase hydrolysate of Arca subcrenata. Constituents from the hydrolysate of Arca subcrenata might be a new potential resource of antioxidants.

  13. UNIT 10.7 Electroblotting from Polyacrylamide Gels

    PubMed Central

    Goldman, Aaron; Speicher, David W.

    2015-01-01

    Transferring proteins from polyacrylamide gels onto retentive membranes is now primarily used for immunoblotting. A second application that was quite common up to about a decade ago was electroblotting of proteins for N-terminal and internal sequencing using Edman chemistry. This unit contains procedures for electroblotting proteins from polyacrylamide gels onto a variety of membranes, including polyvinylidene difluoride (PVDF) and nitrocellulose. In addition to the commonly used tank or wet transfer system, protocols are provided for electroblotting using semidry and dry systems. This unit also describes procedures for eluting proteins from membranes using detergents or acidic extraction with organic solvents for specialized applications. PMID:26521711

  14. Electroblotting from Polyacrylamide Gels.

    PubMed

    Goldman, Aaron; Ursitti, Jeanine A; Mozdzanowski, Jacek; Speicher, David W

    2015-11-02

    Transferring proteins from polyacrylamide gels onto retentive membranes is now primarily used for immunoblotting. A second application that was quite common up to about a decade ago was electroblotting of proteins for N-terminal and internal sequencing using Edman chemistry. This unit contains procedures for electroblotting proteins from polyacrylamide gels onto a variety of membranes, including polyvinylidene difluoride (PVDF) and nitrocellulose. In addition to the commonly used tank or wet transfer system, protocols are provided for electroblotting using semidry and dry systems. This unit also describes procedures for eluting proteins from membranes using detergents or acidic extraction with organic solvents for specialized applications. Copyright © 2015 John Wiley & Sons, Inc.

  15. Low molecular weight squash trypsin inhibitors from Sechium edule seeds.

    PubMed

    Laure, Hélen J; Faça, Vítor M; Izumi, Clarice; Padovan, Júlio C; Greene, Lewis J

    2006-02-01

    Nine chromatographic components containing trypsin inhibitor activity were isolated from Sechium edule seeds by acetone fractionation, gel filtration, affinity chromatography and RP-HPLC in an overall yield of 46% of activity and 0.05% of protein. The components obtained with highest yield of total activity and highest specific activity were sequenced by Edman degradation and their molecular masses determined by mass spectrometry. The inhibitors contained 31, 32 and 27 residues per molecule and their sequences were: SETI-IIa, EDRKCPKILMRCKRDSDCLAKCTCQESGYCG; SETI-IIb, EEDRKCPKILMRCKRDSDCLAKCTCQESGYCG and SETI-V, CPRILMKCKLDTDCFPTCTCRPSGFCG. SETI-IIa and SETI-IIb, which differed by an amino-terminal E in the IIb form, were not separable under the conditions employed. The sequences are consistent with consensus sequences obtained from 37 other inhibitors: CPriI1meCk_DSDCla_C_C_G_CG, where capital letters are invariant amino acid residues and lower case letters are the most preserved in this position. SETI-II and SETI-V form complexes with trypsin with a 1:1 stoichiometry and have dissociation constants of 5.4x10(-11)M and 1.1x10(-9)M, respectively.

  16. Biomonitoring of carcinogenic substances: enzymatic digestion of globin for detecting alkylated amino acids

    NASA Astrophysics Data System (ADS)

    Bader, Michael; Rauscher, Dankwart; Geibel, Kurt; Angerer, Juergen

    1993-03-01

    We report the application of proteases for the total hydrolysis of globin with subsequent determination of amino acids. Optimization of the proteolysis was made with respect to enzyme concentration, time of incubation and type of protease. Ethylene oxide modified globin was used to compare the results of the analysis of the N-terminal amino acid valine after enzymatic cleavage to those obtained from the widely used modified Edman procedure. It is shown that the cleavage is of good reproducibility and yields more alkylated amino acid than the Edman procedure.

  17. The removal of pyroglutamic acid from monoclonal antibodies without denaturation of the protein chains.

    PubMed

    Werner, William E; Wu, Sylvia; Mulkerrin, Michael

    2005-07-01

    Typically, the removal of pyroglutamate from the protein chains of immunoglobulins with the enzyme pyroglutamate aminopeptidase requires the use of chaotropic and reducing agents, quite often with limited success. This article describes a series of optimization experiments using elevated temperatures and detergents to denature and stabilize the heavy chains of immunoglobulins such that the pyroglutamate at the amino terminal was accessible to enzymatic removal using the thermostable protease isolated from Pyrococcus furiosus. The detergent polysorbate 20 (Tween 20) was used successfully to facilitate the removal of pyroglutamate residues. A one-step digestion was developed using elevated temperatures and polysorbate 20, rather than chaotropic and reducing agents, with sample cleanup and preparation for Edman sequencing performed using a commercial cartridge containing the PVDF membrane. All of the immunoglobulins digested with this method yielded heavy chain sequence, but the extent of deblocking was immunglobulin dependent (typically>50%).

  18. Frontoxins, three-finger toxins from Micrurus frontalis venom, decrease miniature endplate potential amplitude at frog neuromuscular junction.

    PubMed

    Moreira, K G; Prates, M V; Andrade, F A C; Silva, L P; Beirão, P S L; Kushmerick, C; Naves, L A; Bloch, C

    2010-08-01

    Neurotoxicity is a major symptom of envenomation caused by Brazilian coral snake Micrurus frontalis. Due to the small amount of material that can be collected, no neurotoxin has been fully sequenced from this venom. In this work we report six new three-finger like toxins isolated from the venom of the coral snake M. frontalis which we named Frontoxin (FTx) I-VI. Toxins were purified using multiple steps of RP-HPLC. Molecular masses were determined by MALDI-TOF and ESI ion-trap mass spectrometry. The complete amino acid sequence of FTx II, III, IV and V were determined by sequencing of overlapping proteolytic fragments by Edman degradation and by de novo sequencing. The amino acid sequences of FTx I, II, III and VI predict 4 conserved disulphide bonds and structural similarity to previously reported short-chain alpha-neurotoxins. FTx IV and V each contained 10 conserved cysteines and share high similarity with long-chain alpha-neurotoxins. At the frog neuromuscular junction FTx II, III and IV reduced miniature endplate potential amplitudes in a time-and concentration-dependent manner suggesting Frontoxins block nicotinic acetylcholine receptors. Copyright 2010 Elsevier Ltd. All rights reserved.

  19. Isolation, characterization, and primary structure of rubredoxin from the photosynthetic bacterium, Heliobacillus mobilis

    NASA Technical Reports Server (NTRS)

    Lee, W. Y.; Brune, D. C.; LoBrutto, R.; Blankenship, R. E.

    1995-01-01

    Rubredoxin is a small nonheme iron protein that serves as an electron carrier in bacterial systems. Rubredoxin has now been isolated and characterized from the strictly anaerobic phototroph, Heliobacillus mobilis. THe molecular mass (5671.3 Da from the amino acid sequence) was confirmed and partial formylation of the N-terminal methionyl residue was established by matrix-assisted laser desorption mass spectroscopy. The complete 52-amino-acid sequence was determined by a combination of N-terminal sequencing by Edman degradation and C-terminal sequencing by a novel method using carboxypeptidase treatment in conjunction with amino acid analysis and laser desorption time of flight mass spectrometry. The molar absorption coefficient of Hc. mobilis rubredoxin at 490 nm is 6.9 mM-1 cm-1 and the midpoint redox potential at pH 8.0 is -46 mV. The EPR spectrum of the oxidized form shows resonances at g = 9.66 and 4.30 due to a high-spin ferric iron. The amino acid sequence is homologous to those of rubredoxins from other species, in particular, the gram-positive bacteria, and the phototrophic green sulfur bacteria, and the evolutionary implications of this are discussed.

  20. De novo protein sequencing by combining top-down and bottom-up tandem mass spectra.

    PubMed

    Liu, Xiaowen; Dekker, Lennard J M; Wu, Si; Vanduijn, Martijn M; Luider, Theo M; Tolić, Nikola; Kou, Qiang; Dvorkin, Mikhail; Alexandrova, Sonya; Vyatkina, Kira; Paša-Tolić, Ljiljana; Pevzner, Pavel A

    2014-07-03

    There are two approaches for de novo protein sequencing: Edman degradation and mass spectrometry (MS). Existing MS-based methods characterize a novel protein by assembling tandem mass spectra of overlapping peptides generated from multiple proteolytic digestions of the protein. Because each tandem mass spectrum covers only a short peptide of the target protein, the key to high coverage protein sequencing is to find spectral pairs from overlapping peptides in order to assemble tandem mass spectra to long ones. However, overlapping regions of peptides may be too short to be confidently identified. High-resolution mass spectrometers have become accessible to many laboratories. These mass spectrometers are capable of analyzing molecules of large mass values, boosting the development of top-down MS. Top-down tandem mass spectra cover whole proteins. However, top-down tandem mass spectra, even combined, rarely provide full ion fragmentation coverage of a protein. We propose an algorithm, TBNovo, for de novo protein sequencing by combining top-down and bottom-up MS. In TBNovo, a top-down tandem mass spectrum is utilized as a scaffold, and bottom-up tandem mass spectra are aligned to the scaffold to increase sequence coverage. Experiments on data sets of two proteins showed that TBNovo achieved high sequence coverage and high sequence accuracy.

  1. The primary structure of the hemoglobin of spectacled bear (Tremarctos ornatus, Carnivora).

    PubMed

    Hofmann, O; Braunitzer, G

    1987-08-01

    The complete primary structure of the alpha- and beta-chains of the hemoglobin of Spectacled Bear (Tremarctos ornatus) is presented. Following cleavage of the heme-protein link and chain separation by RP-HPLC, their amino-acid sequences were determined by Edman degradation in liquid- and gas-phase sequenators. The hemoglobin of Spectacled Bear displays only five amino-acid exchanges to that of Polar Bear (Ursus maritimus, Ursinae) and Asiatic Black Bear (Ursus tibetanus, Ursinae) whereas 8 and 12 replacements, respectively, to Giant Panda (Ailuropoda melanoleuca) and Lesser Panda (Ailurus fulgens) can be found. This clearly demonstrates that the Spectacled Bear, the most aberrant bear of the Ursidae, is somewhat intermediate between Pandas and Ursinae.

  2. Isolation and determination of the primary structure of a lectin protein from the serum of the American alligator (Alligator mississippiensis).

    PubMed

    Darville, Lancia N F; Merchant, Mark E; Maccha, Venkata; Siddavarapu, Vivekananda Reddy; Hasan, Azeem; Murray, Kermit K

    2012-02-01

    Mass spectrometry in conjunction with de novo sequencing was used to determine the amino acid sequence of a 35kDa lectin protein isolated from the serum of the American alligator that exhibits binding to mannose. The protein N-terminal sequence was determined using Edman degradation and enzymatic digestion with different proteases was used to generate peptide fragments for analysis by liquid chromatography tandem mass spectrometry (LC MS/MS). Separate analysis of the protein digests with multiple enzymes enhanced the protein sequence coverage. De novo sequencing was accomplished using MASCOT Distiller and PEAKS software and the sequences were searched against the NCBI database using MASCOT and BLAST to identify homologous peptides. MS analysis of the intact protein indicated that it is present primarily as monomer and dimer in vitro. The isolated 35kDa protein was ~98% sequenced and found to have 313 amino acids and nine cysteine residues and was identified as an alligator lectin. The alligator lectin sequence was aligned with other lectin sequences using DIALIGN and ClustalW software and was found to exhibit 58% and 59% similarity to both human and mouse intelectin-1. The alligator lectin exhibited strong binding affinities toward mannan and mannose as compared to other tested carbohydrates. Copyright © 2011 Elsevier Inc. All rights reserved.

  3. Membrane-associated precursor to poliovirus VPg identified by immunoprecipitation with antibodies directed against a synthetic heptapeptide

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Semelr, B.L.; Anderson, C.W.; Hanecak, R.

    1982-02-01

    A synthetic heptapeptide corresponding to the C-terminal sequence of the poliovirus genome protein (VPg) has been linked to bovine serum albumin and used to raise antibodies in rabbits. These antibodies precipitate not only VPg but also at least two more virus-specific polypeptides. The smaller polypeptide, denoted P3-9 (12,000 daltons), has been mapped by Edman degradation and by fragmentation with cyanogen bromide and determined to be the N-terminal cleavage product of polypeptide P3-1b, a precursor to the RNA polymerase. P3-9 contains the sequence of the basic protein VPg (22 amino acids) at its C terminus. As predicted by the known RNAmore » sequence of poliovirus, P3-9 also contains a hydrophobic region of 22 amino acids preceding VPg, an observation suggesting that P3-9 may be membrane-associated. This was confirmed by fractionation of infected cells in the presence or absence of detergent. We speculate that P3-9 may be the donor of VPg to RNA chains in the membrane-bound RNA replication complex.« less

  4. Organization of the hao gene cluster of Nitrosomonas europaea: genes for two tetraheme c cytochromes.

    PubMed

    Bergmann, D J; Arciero, D M; Hooper, A B

    1994-06-01

    The organization of genes for three proteins involved in ammonia oxidation in Nitrosomonas europaea has been investigated. The amino acid sequence of the N-terminal region and four heme-containing peptides produced by proteolysis of the tetraheme cytochrome c554 of N. europaea were determined by Edman degradation. The gene (cycA) encoding this cytochrome is present in three copies per genome (H. McTavish, F. LaQuier, D. Arciero, M. Logan, G. Mundfrom, J.A. Fuchs, and A. B. Hooper, J. Bacteriol. 175:2445-2447, 1993). Three clones, representing at least two copies of cycA, were isolated and sequenced by the dideoxy-chain termination procedure. In both copies, the sequences of 211 amino acids derived from the gene sequence are identical and include all amino acids predicted by the proteolytic peptides. In two copies, the cycA open reading frame (ORF) is followed closely (three bases in one copy) by a second ORF predicted to encode a 28-kDa tetraheme c cytochrome not previously characterized but similar to the nirT gene product of Pseudomonas stutzeri. In one copy of the cycA gene cluster, the second ORF is absent.

  5. Enterocin TW21, a novel bacteriocin from dochi-isolated Enterococcus faecium D081821.

    PubMed

    Chang, S-Y; Chen, Y-S; Pan, S-F; Lee, Y-S; Chang, C-H; Chang, C-H; Yu, B; Wu, H-C

    2013-09-01

    Purification and characterization of a novel bacteriocin produced by strain Enterococcus faecium D081821. Enterococcus faecium D081821, isolated from the traditional Taiwanese fermented food dochi (fermented black beans), was previously found to produce a bacteriocin against Listeria monocytogenes and some Gram-positive bacteria. This bacteriocin, termed enterocin TW21, was purified from culture supernatant by ammonium sulfate precipitation, Sep-Pak C18 cartridge, ion-exchange and gel filtration chromatography. Mass spectrometry analysis showed the mass of the peptide to be approximately 5300·6 Da. The N-terminal amino acid sequencing yielded a partial sequence NH2 -ATYYGNGVYxNTQK by Edman degradation, and it contains the consensus class IIa bacteriocin motif YGNGV in the N-terminal region. The open reading frame (ORF) encoding the bacteriocin was identified from the draft genome sequence of Enterococcus faecium D081821, and sequence analysis of this peptide indicated that enterocin TW21 is a novel bacteriocin. Enterococcus faecium D081821 produced a bacteriocin named enterocin TW21, the molecular weight and amino acid sequence both revealed it to be a novel bacteriocin. A new member of class IIa bacteriocin was identified. This bacteriocin shows great inhibitory ability against L. monocytogenes and could be applied as a natural food preservative. © 2013 The Society for Applied Microbiology.

  6. The tick plasma lectin, Dorin M, is a fibrinogen-related molecule.

    PubMed

    Rego, Ryan O M; Kovár, Vojtĕch; Kopácek, Petr; Weise, Christoph; Man, Petr; Sauman, Ivo; Grubhoffer, Libor

    2006-04-01

    A lectin, named Dorin M, previously isolated and characterized from the hemolymph plasma of the soft tick, Ornithodoros moubata, was cloned and sequenced. The immunofluorescence using confocal microscopy revealed that Dorin M is produced in the tick hemocytes. A tryptic cleavage of Dorin M was performed and the resulting peptide fragments were sequenced by Edman degradation and/or mass spectrometry. Two of three internal peptide sequences displayed a significant similarity to the family of fibrinogen-related molecules. Degenerate primers were designed and used for PCR with hemocyte cDNA as a template. The sequence of the whole Dorin M cDNA was completed by the method of RACE. The tissue-specific expression investigated by RT-PCR revealed that Dorin M, in addition to hemocytes, is significantly expressed in salivary glands. The derived amino-acid sequence clearly shows that Dorin M has a fibrinogen-like domain, and exhibited the most significant similarity with tachylectins 5A and 5B from a horseshoe crab, Tachypleus tridentatus. In addition, other protein and binding characteristics suggest that Dorin M is closely related to tachylectins-5. Since these lectins have been reported to function as non-self recognizing molecules, we believe that Dorin M may play a similar role in an innate immunity of the tick and, possibly, also in pathogen transmission by this vector.

  7. Isolation and characterization of a new bacteriocin, termed enterocin M, produced by environmental isolate Enterococcus faecium AL41.

    PubMed

    Mareková, Mária; Lauková, Andrea; Skaugen, Morten; Nes, Ingolf

    2007-08-01

    The new bacteriocin, termed enterocin M, produced by Enterococcus faecium AL 41 showed a wide spectrum of inhibitory activity against the indicator organisms from different sources. It was purified by (NH4)2SO4 precipitation, cation-exchange chromatography and reverse phase chromatography (FPLC). The purified peptide was sequenced by N-terminal amino acid Edman degradation and a mass spectrometry analysis was performed. By combining the data obtained from amino acid sequence (39 N-terminal amino acid residues was determined) and the molecular weight (determined to be 4628 Da) it was concluded that the purified enterocin M is a new bacteriocin, which is very similar to enterocin P. However, its molecular weight is different from enterocin P (4701.25). Of the first 39 N-terminal residues of enterocin M, valine was found in position 20 and a lysine in position 35, while enterocin P has tryptophane residues in these positions.

  8. The amino acid sequences of carboxypeptidases I and II from Aspergillus niger and their stability in the presence of divalent cations.

    PubMed

    Svendsen, I; Dal Degan, F

    1998-09-08

    The amino acid sequences of serine carboxypeptidase I (CPD-I) and II (CPD-II), respectively, from Aspergillus niger have been determined by conventional Edman degradation of the reduced and vinylpyridinated enzymes and peptides hereof generated by cleavage with cyanogen bromide, iodobenzoic acid, glutamic acid cleaving enzyme, AspN-endoproteinase and EndoLysC proteinase. CPD-I consists of a single peptide chain of 471 amino acid residues, three disulfide bridges and nine N-glycosylated asparaginyl residues, while CPD-II consists of a single peptide chain of 481 amino acid residues, has three disulfide bridges, one free cysteinyl residue and nine glycosylated asparaginyl residues. The enzymes are closely related to carboxypeptidase S3 from Penicillium janthinellum. Both Ca2+ and Mg2+ stabilize CPD-I as well as CPD-II, at basic pH values, Ca2+ being most effective, while the divalent ions have no effect on the activity of the two enzymes.

  9. Isolation and identification of a cardioactive peptide from Tenebrio molitor and Spodoptera eridania.

    PubMed

    Furuya, K; Liao, S; Reynolds, S E; Ota, R B; Hackett, M; Schooley, D A

    1993-12-01

    We isolated several cardioactive peptides from extracts of whole heads of the mealworm, Tenebrio molitor, and the southern armyworm, Spodoptera eridania, using a semi-isolated heart of Manduca sexta for bioassay. We have now isolated from each species the peptide with the strongest effect on rate of contraction of the heart. The peptides were identified using micro Edman sequencing and mass spectrometric methods. This cardioactive peptide has the same primary structure from both species: Pro-Phe-Cys-Asn-Ala-Phe-Thr-Gly-Cys-NH2, a cyclic nonapeptide which is identical to crustacean cardioactive peptide (CCAP) originally isolated from the shore crab, Carcinus maenas, and subsequently isolated from Locusta migratoria and Manduca sexta. This is additional evidence that CCAP has widespread occurrence in arthropoda.

  10. Proteomic analysis of the venom from the fish eating coral snake Micrurus surinamensis: novel toxins, their function and phylogeny.

    PubMed

    Olamendi-Portugal, Timoteo; Batista, Cesar V F; Restano-Cassulini, Rita; Pando, Victoria; Villa-Hernandez, Oscar; Zavaleta-Martínez-Vargas, Alfonso; Salas-Arruz, Maria C; Rodríguez de la Vega, Ricardo C; Becerril, Baltazar; Possani, Lourival D

    2008-05-01

    The protein composition of the soluble venom from the South American fish-eating coral snake Micrurus surinamensis surinamensis, here abbreviated M. surinamensis, was separated by RP-HPLC and 2-DE, and their components were analyzed by automatic Edman degradation, MALDI-TOF and ESI-MS/MS. Approximately 100 different molecules were identified. Sixty-two components possess molecular masses between 6 and 8 kDa, are basically charged molecules, among which are cytotoxins and neurotoxins lethal to fish (Brachidanios rerio). Six new toxins (abbreviated Ms1-Ms5 and Ms11) were fully sequenced. Amino acid sequences similar to the enzymes phospholipase A2 and amino acid oxidase were identified. Over 20 additional peptides were identified by sequencing minor components of the HPLC separation and from 2-DE gels. A functional assessment of the physiological activity of the six toxins was also performed by patch clamp using muscular nicotinic acetylcholine receptor assays. Variable degrees of blockade were observed, most of them reversible. The structural and functional data obtained were used for phylogenetic analysis, providing information on some evolutionary aspects of the venom components of this snake. This contribution increases by a factor of two the total number of alpha-neurotoxins sequenced from the Micrurus genus in currently available literature.

  11. Isolation and Purification of Water Soluble Proteins from Ginger Root (Zingiber officinale) by Two Dimensional Liquid Chromatography

    PubMed Central

    Sandovall, A.O.; Andrews, K.; Wahab, A.; Choudhary, M.I.; Ahmed, A.

    2014-01-01

    The RI-INBRE Centralized Core Facility was established in 2003 and participates annually in Undergraduate Summer Research Program. It provides students hands on research experience in key technologies in biomedical sciences. We present here the isolation and purification of water soluble proteins from ginger, a rhizome of the plant, Zingiber officinale. It is an important ingredient of species used in traditional South Asian cuisines. In Indian, Pakistani and Chinese folk medicine, ginger is used for gastro-intestinal disorders, nausea, vomiting, inflammatory diseases, muscle and joint pain. Limited studies have been reported on the bioactive proteins from ginger extract. The water soluble proteins were extracted from ginger root and successfully purified to homogeneity by using two-dimensional liquid chromatography (FPLC/RP-HPLC) approach. The ginger root was washed with distilled water; skin removed and then emulsified using an electric blender. Sample was stirred for four days at 4°C with and without protease inhibitor. Purification of a 42kDa protein was achieved by employing gel filtration, ion-exchange and reversed phase HPLC. The homogeneity of the protein was confirmed by SDS-PAGE gel electrophoresis and MALDI-TOF mass spectrometry. Future work will be conducted on the protein characterization using mass spectrometry and Edman protein sequencing. Supported by grant 5P20GM103430 from the National Institute of General Medical Sciences, NIH, USA.

  12. Identification of a major continuous epitope of human alpha crystallin

    NASA Technical Reports Server (NTRS)

    Takemoto, L.; Emmons, T.; Spooner, B. S. (Principal Investigator)

    1992-01-01

    Human lens proteins were digested with trypsin or V8 protease, and the resulting peptides resolved on a C18 reverse phase column. Fractions from this column were probed with polyclonal antiserum made against the whole alpha crystallin molecule. Peptides in the seropositive fraction were purified to homogeneity, then characterized by mass spectral analysis and partial Edman degradation. The tryptic and V8 digests contained only one seropositive peptide that was derived from the C-terminal region of the alpha-A molecule. To determine the exact boundaries of the epitope, various size analogues of this region were synthesized and probed with anti-alpha serum. Together, these studies demonstrate that the major continuous epitope of the alpha-A chain includes the sequence KPTSAPS, corresponding to residues 166-172 of the human alpha-A crystallin chain.

  13. Systemic AA amyloidosis in the red fox (Vulpes vulpes).

    PubMed

    Rising, Anna; Cederlund, Ella; Palmberg, Carina; Uhlhorn, Henrik; Gaunitz, Stefan; Nordling, Kerstin; Ågren, Erik; Ihse, Elisabet; Westermark, Gunilla T; Tjernberg, Lars; Jörnvall, Hans; Johansson, Jan; Westermark, Per

    2017-11-01

    Amyloid A (AA) amyloidosis occurs spontaneously in many mammals and birds, but the prevalence varies considerably among different species, and even among subgroups of the same species. The Blue fox and the Gray fox seem to be resistant to the development of AA amyloidosis, while Island foxes have a high prevalence of the disease. Herein, we report on the identification of AA amyloidosis in the Red fox (Vulpes vulpes). Edman degradation and tandem MS analysis of proteolyzed amyloid protein revealed that the amyloid partly was composed of full-length SAA. Its amino acid sequence was determined and found to consist of 111 amino acid residues. Based on inter-species sequence comparisons we found four residue exchanges (Ser31, Lys63, Leu71, Lys72) between the Red and Blue fox SAAs. Lys63 seems unique to the Red fox SAA. We found no obvious explanation to how these exchanges might correlate with the reported differences in SAA amyloidogenicity. Furthermore, in contrast to fibrils from many other mammalian species, the isolated amyloid fibrils from Red fox did not seed AA amyloidosis in a mouse model. © 2017 The Protein Society.

  14. A Novel Factor Xa-Inhibiting Peptide from Centipedes Venom.

    PubMed

    Kong, Yi; Shao, Yu; Chen, Hao; Ming, Xin; Wang, Jin-Bin; Li, Zhi-Yu; Wei, Ji-Fu

    2013-01-01

    Centipedes have been used as traditional medicine for thousands of years in China. Centipede venoms consist of many biochemical peptides and proteins. Factor Xa (FXa) is a serine endopeptidase that plays the key role in blood coagulation, and has been used as a new target for anti-thrombotic drug development. A novel FXa inhibitor, a natural peptide with the sequence of Thr-Asn-Gly-Tyr-Thr (TNGYT), was isolated from the venom of Scolopendra subspinipes mutilans using a combination of size-exclusion and reverse-phase chromatography. The molecular weight of the TNGYT peptide was 554.3 Da measured by electrospray ionization mass spectrometry. The amino acid sequence of TNGYT was determined by Edman degradation. TNGYT inhibited the activity of FXa in a dose-dependent manner with an IC 50 value of 41.14 mg/ml. It prolonged the partial thromboplastin time and prothrombin time in both in vitro and ex vivo assays. It also significantly prolonged whole blood clotting time and bleeding time in mice. This is the first report that an FXa inhibiting peptide was isolated from centipedes venom.

  15. The primary structure of rat liver ribosomal protein L37. Homology with yeast and bacterial ribosomal proteins.

    PubMed

    Lin, A; McNally, J; Wool, I G

    1983-09-10

    The covalent structure of the rat liver 60 S ribosomal subunit protein L37 was determined. Twenty-four tryptic peptides were purified and the sequence of each was established; they accounted for all 111 residues of L37. The sequence of the first 30 residues of L37, obtained previously by automated Edman degradation of the intact protein, provided the alignment of the first 9 tryptic peptides. Three peptides (CN1, CN2, and CN3) were produced by cleavage of protein L37 with cyanogen bromide. The sequence of CN1 (65 residues) was established from the sequence of secondary peptides resulting from cleavage with trypsin and chymotrypsin. The sequence of CN1 in turn served to order tryptic peptides 1 through 14. The sequence of CN2 (15 residues) was determined entirely by a micromanual procedure and allowed the alignment of tryptic peptides 14 through 18. The sequence of the NH2-terminal 28 amino acids of CN3 (31 residues) was determined; in addition the complete sequences of the secondary tryptic and chymotryptic peptides were done. The sequence of CN3 provided the order of tryptic peptides 18 through 24. Thus the sequence of the three cyanogen bromide peptides also accounted for the 111 residues of protein L37. The carboxyl-terminal amino acids were identified after carboxypeptidase A treatment. There is a disulfide bridge between half-cystinyl residues at positions 40 and 69. Rat liver ribosomal protein L37 is homologous with yeast YP55 and with Escherichia coli L34. Moreover, there is a segment of 17 residues in rat L37 that occurs, albeit with modifications, in yeast YP55 and in E. coli S4, L20, and L34.

  16. Mass fingerprinting of the venom and transcriptome of venom gland of scorpion Centruroides tecomanus.

    PubMed

    Valdez-Velázquez, Laura L; Quintero-Hernández, Verónica; Romero-Gutiérrez, Maria Teresa; Coronas, Fredy I V; Possani, Lourival D

    2013-01-01

    Centruroides tecomanus is a Mexican scorpion endemic of the State of Colima, that causes human fatalities. This communication describes a proteome analysis obtained from milked venom and a transcriptome analysis from a cDNA library constructed from two pairs of venom glands of this scorpion. High perfomance liquid chromatography separation of soluble venom produced 80 fractions, from which at least 104 individual components were identified by mass spectrometry analysis, showing to contain molecular masses from 259 to 44,392 Da. Most of these components are within the expected molecular masses for Na(+)- and K(+)-channel specific toxic peptides, supporting the clinical findings of intoxication, when humans are stung by this scorpion. From the cDNA library 162 clones were randomly chosen, from which 130 sequences of good quality were identified and were clustered in 28 contigs containing, each, two or more expressed sequence tags (EST) and 49 singlets with only one EST. Deduced amino acid sequence analysis from 53% of the total ESTs showed that 81% (24 sequences) are similar to known toxic peptides that affect Na(+)-channel activity, and 19% (7 unique sequences) are similar to K(+)-channel especific toxins. Out of the 31 sequences, at least 8 peptides were confirmed by direct Edman degradation, using components isolated directly from the venom. The remaining 19%, 4%, 4%, 15% and 5% of the ESTs correspond respectively to proteins involved in cellular processes, antimicrobial peptides, venom components, proteins without defined function and sequences without similarity in databases. Among the cloned genes are those similar to metalloproteinases.

  17. Leptoglycin: a new Glycine/Leucine-rich antimicrobial peptide isolated from the skin secretion of the South American frog Leptodactylus pentadactylus (Leptodactylidae).

    PubMed

    Sousa, Juliana C; Berto, Raquel F; Gois, Elicélia A; Fontenele-Cardi, Nauíla C; Honório, José E R; Konno, Katsuhiro; Richardson, Michael; Rocha, Marcos F G; Camargo, Antônio A C M; Pimenta, Daniel C; Cardi, Bruno A; Carvalho, Krishnamurti M

    2009-07-01

    Antimicrobial peptides are components of innate immunity that is the first-line defense against invading pathogens for a wide range of organisms. Here, we describe the isolation, biological characterization and amino acid sequencing of a novel neutral Glycine/Leucine-rich antimicrobial peptide from skin secretion of Leptodactylus pentadactylus named leptoglycin. The amino acid sequence of the peptide purified by RP-HPLC (C(18) column) was deduced by mass spectrometric de novo sequencing and confirmed by Edman degradation: GLLGGLLGPLLGGGGGGGGGLL. Leptoglycin was able to inhibit the growth of Gram-negative bacteria Pseudomonas aeruginosa, Escherichia coli and Citrobacter freundii with minimal inhibitory concentrations (MICs) of 8 microM, 50 microM, and 75 microM respectively, but it did not show antimicrobial activity against Gram-positive bacteria (Staphylococcus aureus, Micrococcus luteus and Enterococcus faecalis), yeasts (Candida albicans and Candida tropicalis) and dermatophytes fungi (Microsporum canis and Trichophyton rubrum). No hemolytic activity was observed at the 2-200 microM range concentration. The amino acid sequence of leptoglycin with high level of glycine (59.1%) and leucine (36.4%) containing an unusual central proline suggests the existence of a new class of Gly/Leu-rich antimicrobial peptides. Taken together, these results suggest that this natural antimicrobial peptide could be a tool to develop new antibiotics.

  18. Purification, characterization and molecular cloning of chymotrypsin inhibitor peptides from the venom of Burmese Daboia russelii siamensis.

    PubMed

    Guo, Chun-Teng; McClean, Stephen; Shaw, Chris; Rao, Ping-Fan; Ye, Ming-Yu; Bjourson, Anthony J

    2013-05-01

    One novel Kunitz BPTI-like peptide designated as BBPTI-1, with chymotrypsin inhibitory activity was identified from the venom of Burmese Daboia russelii siamensis. It was purified by three steps of chromatography including gel filtration, cation exchange and reversed phase. A partial N-terminal sequence of BBPTI-1, HDRPKFCYLPADPGECLAHMRSF was obtained by automated Edman degradation and a Ki value of 4.77nM determined. Cloning of BBPTI-1 including the open reading frame and 3' untranslated region was achieved from cDNA libraries derived from lyophilized venom using a 3' RACE strategy. In addition a cDNA sequence, designated as BBPTI-5, was also obtained. Alignment of cDNA sequences showed that BBPTI-5 exhibited an identical sequence to BBPTI-1 cDNA except for an eight nucleotide deletion in the open reading frame. Gene variations that represented deletions in the BBPTI-5 cDNA resulted in a novel protease inhibitor analog. Amino acid sequence alignment revealed that deduced peptides derived from cloning of their respective precursor cDNAs from libraries showed high similarity and homology with other Kunitz BPTI proteinase inhibitors. BBPTI-1 and BBPTI-5 consist of 60 and 66 amino acid residues respectively, including six conserved cysteine residues. As these peptides have been reported to have influence on the processes of coagulation, fibrinolysis and inflammation, their potential application in biomedical contexts warrants further investigation. Copyright © 2013 Elsevier Inc. All rights reserved.

  19. Investigation of the protein osteocalcin of Camelops hesternus: Sequence, structure and phylogenetic implications

    NASA Astrophysics Data System (ADS)

    Humpula, James F.; Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Stafford, Thomas W.; Smith, James J.; Voorhies, Michael R.; George Corner, R.; Andrews, Phillip C.

    2007-12-01

    Ancient DNA sequences offer an extraordinary opportunity to unravel the evolutionary history of ancient organisms. Protein sequences offer another reservoir of genetic information that has recently become tractable through the application of mass spectrometric techniques. The extent to which ancient protein sequences resolve phylogenetic relationships, however, has not been explored. We determined the osteocalcin amino acid sequence from the bone of an extinct Camelid (21 ka, Camelops hesternus) excavated from Isleta Cave, New Mexico and three bones of extant camelids: bactrian camel ( Camelus bactrianus); dromedary camel ( Camelus dromedarius) and guanaco ( Llama guanacoe) for a diagenetic and phylogenetic assessment. There was no difference in sequence among the four taxa. Structural attributes observed in both modern and ancient osteocalcin include a post-translation modification, Hyp 9, deamidation of Gln 35 and Gln 39, and oxidation of Met 36. Carbamylation of the N-terminus in ancient osteocalcin may result in blockage and explain previous difficulties in sequencing ancient proteins via Edman degradation. A phylogenetic analysis using osteocalcin sequences of 25 vertebrate taxa was conducted to explore osteocalcin protein evolution and the utility of osteocalcin sequences for delineating phylogenetic relationships. The maximum likelihood tree closely reflected generally recognized taxonomic relationships. For example, maximum likelihood analysis recovered rodents, birds and, within hominins, the Homo-Pan-Gorilla trichotomy. Within Artiodactyla, character state analysis showed that a substitution of Pro 4 for His 4 defines the Capra-Ovis clade within Artiodactyla. Homoplasy in our analysis indicated that osteocalcin evolution is not a perfect indicator of species evolution. Limited sequence availability prevented assigning functional significance to sequence changes. Our preliminary analysis of osteocalcin evolution represents an initial step towards a complete character analysis aimed at determining the evolutionary history of this functionally significant protein. We emphasize that ancient protein sequencing and phylogenetic analyses using amino acid sequences must pay close attention to post-translational modifications, amino acid substitutions due to diagenetic alteration and the impacts of isobaric amino acids on mass shifts and sequence alignments.

  20. Detection of Proteins on Blot Membranes

    PubMed Central

    Goldman, Aaron; Harper, Sandra; Speicher, David W.

    2017-01-01

    Staining of blot membranes enables the visualization of bound proteins. Proteins are usually transferred to blot membranes by electroblotting, by direct spotting of protein solutions, or by contact blots. Staining allows the efficiency of transfer to the membrane to be monitored. This unit describes protocols for staining proteins after electroblotting from polyacrylamide gels to blot membranes such as polyvinylidene difluoride (PVDF), nitrocellulose, or nylon membranes. The same methods can be used if proteins are directly spotted, either manually or using robotics. Protocols are included for seven general protein stains (amido black, Coomassie blue, Ponceau S, colloidal gold, colloidal silver, India ink, and MemCode) and three fluorescent protein stains (fluorescamine, IAEDANS, and SYPRO Ruby). Also included is an in-depth discussion of the different blot membrane types and the compatibility of different protein stains with downstream applications, such as immunoblotting or N-terminal Edman sequencing. PMID:27801518

  1. Detection of Proteins on Blot Membranes.

    PubMed

    Goldman, Aaron; Harper, Sandra; Speicher, David W

    2016-11-01

    Staining of blot membranes enables the visualization of bound proteins. Proteins are usually transferred to blot membranes by electroblotting, by direct spotting of protein solutions, or by contact blots. Staining allows the efficiency of transfer to the membrane to be monitored. This unit describes protocols for staining proteins after electroblotting from polyacrylamide gels to blot membranes such as polyvinylidene difluoride (PVDF), nitrocellulose, or nylon membranes. The same methods can be used if proteins are directly spotted, either manually or using robotics. Protocols are included for seven general protein stains (amido black, Coomassie blue, Ponceau S, colloidal gold, colloidal silver, India ink, and MemCode) and three fluorescent protein stains (fluorescamine, IAEDANS, and SYPRO Ruby). Also included is an in-depth discussion of the different blot membrane types and the compatibility of different protein stains with downstream applications, such as immunoblotting or N-terminal Edman sequencing. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.

  2. Frog secretions and hunting magic in the upper Amazon: identification of a peptide that interacts with an adenosine receptor.

    PubMed Central

    Daly, J W; Caceres, J; Moni, R W; Gusovsky, F; Moos, M; Seamon, K B; Milton, K; Myers, C W

    1992-01-01

    A frog used for "hunting magic" by several groups of Panoan-speaking Indians in the borderline between Brazil and Peru is identified as Phyllomedusa bicolor. This frog's skin secretion, which the Indians introduce into the body through fresh burns, is rich in peptides. These include vasoactive peptides, opioid peptides, and a peptide that we have named adenoregulin, with the sequence GLWSKIKEVGKEAAKAAAKAAGKAALGAVSEAV as determined from mass spectrometry and Edman degradation. The natural peptide may contain a D amino acid residue, since it is not identical in chromatographic properties to the synthetic peptide. Adenoregulin enhances binding of agonists to A1 adenosine receptors; it is accompanied in the skin secretion by peptides that inhibit binding. The vasoactive peptide sauvagine, the opioid peptides, and adenoregulin and related peptides affect behavior in mice and presumably contribute to the behavioral sequelae observed in humans. Images PMID:1438301

  3. Frog secretions and hunting magic in the upper Amazon: identification of a peptide that interacts with an adenosine receptor.

    PubMed

    Daly, J W; Caceres, J; Moni, R W; Gusovsky, F; Moos, M; Seamon, K B; Milton, K; Myers, C W

    1992-11-15

    A frog used for "hunting magic" by several groups of Panoan-speaking Indians in the borderline between Brazil and Peru is identified as Phyllomedusa bicolor. This frog's skin secretion, which the Indians introduce into the body through fresh burns, is rich in peptides. These include vasoactive peptides, opioid peptides, and a peptide that we have named adenoregulin, with the sequence GLWSKIKEVGKEAAKAAAKAAGKAALGAVSEAV as determined from mass spectrometry and Edman degradation. The natural peptide may contain a D amino acid residue, since it is not identical in chromatographic properties to the synthetic peptide. Adenoregulin enhances binding of agonists to A1 adenosine receptors; it is accompanied in the skin secretion by peptides that inhibit binding. The vasoactive peptide sauvagine, the opioid peptides, and adenoregulin and related peptides affect behavior in mice and presumably contribute to the behavioral sequelae observed in humans.

  4. Antifungal mechanism of a novel antifungal protein from pumpkin rinds against various fungal pathogens.

    PubMed

    Park, Seong-Cheol; Kim, Jin-Young; Lee, Jong-Kook; Hwang, Indeok; Cheong, Hyeonsook; Nah, Jae-Woon; Hahm, Kyung-Soo; Park, Yoonkyung

    2009-10-14

    A novel antifungal protein (Pr-2) was identified from pumpkin rinds using water-soluble extraction, ultrafiltration, cation exchange chromatography, and reverse-phase high-performance liquid chromatography. Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry indicated that the protein had a molecular mass of 14865.57 Da. Automated Edman degradation showed that the N-terminal sequence of Pr-2 was QGIGVGDNDGKRGKR-. The Pr-2 protein strongly inhibited in vitro growth of Botrytis cinerea, Colletotrichum coccodes, Fusarium solani, Fusarium oxysporum, and Trichoderma harzianum at 10-20 microM. The results of confocal laser scanning microscopy and SYTOX Green uptake demonstrated that its effective region was the membrane of the fungal cell surface. In addition, this protein was found to be noncytotoxic and heat-stable. Taken together, the results of this study indicate that Pr-2 is a good candidate for use as a natural antifungal agent.

  5. Streptococcal phosphoenolpyruvate-sugar phosphotransferase system: amino acid sequence and site of ATP-dependent phosphorylation of HPr

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Deutscher, J.; Pevec, B.; Beyreuther, K.

    1986-10-21

    The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less

  6. An EThcD-Based Method for Discrimination of Leucine and Isoleucine Residues in Tryptic Peptides

    NASA Astrophysics Data System (ADS)

    Zhokhov, Sergey S.; Kovalyov, Sergey V.; Samgina, Tatiana Yu.; Lebedev, Albert T.

    2017-08-01

    An EThcD-based approach for the reliable discrimination of isomeric leucine and isoleucine residues in peptide de novo sequencing procedure has been proposed. A multistage fragmentation of peptide ions was performed with Orbitrap Elite mass spectrometer in electrospray ionization mode. At the first stage, z-ions were produced by ETD or ETcaD fragmentation of doubly or triply charged peptide precursor ions. These primary ions were further fragmented by HCD with broad-band ion isolation, and the resulting w-ions showed different mass for leucine and isoleucine residues. The procedure did not require manual isolation of specific z-ions prior to HCD stage. Forty-three tryptic peptides (3 to 27 residues) obtained by trypsinolysis of human serum albumin (HSA) and gp188 protein were analyzed. To demonstrate a proper solution for radical site migration problem, three non-tryptic peptides were also analyzed. A total of 93 leucine and isoleucine residues were considered and 83 of them were correctly identified. The developed approach can be a reasonable substitution for additional Edman degradation procedure, which is still used in peptide sequencing for leucine and isoleucine discrimination.

  7. Definition of the HLA-A29 peptide ligand motif allows prediction of potential T-cell epitopes from the retinal soluble antigen, a candidate autoantigen in birdshot retinopathy.

    PubMed Central

    Boisgerault, F; Khalil, I; Tieng, V; Connan, F; Tabary, T; Cohen, J H; Choppin, J; Charron, D; Toubert, A

    1996-01-01

    The peptide-binding motif of HLA-A29, the predisposing allele for birdshot retinopathy, was determined after acid-elution of endogenous peptides from purified HLA-A29 molecules. Individual and pooled HPLC fractions were sequenced by Edman degradation. Major anchor residues could be defined as glutamate at the second position of the peptide and as tyrosine at the carboxyl terminus. In vitro binding of polyglycine synthetic peptides to purified HLA-A29 molecules also revealed the need for an auxiliary anchor residue at the third position, preferably phenylalanine. By using this motif, we synthesized six peptides from the retinal soluble antigen, a candidate autoantigen in autoimmune uveoretinitis. Their in vitro binding was tested on HLA-A29 and also on HLA-B44 and HLA-B61, two alleles sharing close peptide-binding motifs. Two peptides derived from the carboxyl-terminal sequence of the human retinal soluble antigen bound efficiently to HLA-A29. This study could contribute to the prediction of T-cell epitopes from retinal autoantigens implicated in birdshot retinopathy. PMID:8622959

  8. [Purification, characterization and partial primary structure analysis of rutin-degrading enzyme in tartary buckwheat seeds].

    PubMed

    Zhang, Yuwei; Li, Jie; Yuan, Yong; Gu, Jijuan; Chen, Peng

    2017-05-25

    Rutin-degrading enzymes (RDE) can degrade rutin into poorly water soluble compound, quercetin, and cause the bitter taste in tartary buckwheat. In the present study RDE from Yu 6-21 tartary buckwheat seeds was purified by ammonium sulphate precipitation, followed by hydrophobic interaction chromatography on Phenyl Sepharose CL-4B, ion exchange chromatography on CM-Cellulose and gel filtration chromatography on Sephadex G-150. Purified RDE showed single band with molecular weight of 66 kDa on SDS-PAGE. The optimum pH and temperature of RDE were 5.0 and 50 ℃ respectively. The Km was 0.27 mmol/L, and the Vmax was 39.68 U/mg. The RDE activity could be inhibited by Cu²⁺, Zn²⁺, Mn²⁺ and EDTA, and showed tolerance to 50% methanol (V/V). The N terminal sequence (TVSRSSFPDGFLFGL) was obtained by Edman degradation method and 15 internal peptide sequences were determined by MALDI-TOF-MS (matrix-assisted laser desorption ionization time of flight mass spectrometry). These results established the foundations for identification of the candidate gene of RDE via transcriptome data and further studying RDE biological function.

  9. Structural and biological characterization of three novel mastoparan peptides from the venom of the neotropical social wasp Protopolybia exigua (Saussure).

    PubMed

    Mendes, Maria Anita; de Souza, Bibiana Monson; Palma, Mario Sergio

    2005-01-01

    The venom of the Neotropical social wasp Protopolybia exigua(Saussure) was fractionated by RP-HPLC resulting in the elution of 20 fractions. The homogeneity of the preparations were checked out by using ESI-MS analysis and the fractions 15, 17 and 19 (eluted at the most hydrophobic conditions) were enough pure to be sequenced by Edman degradation chemistry, resulting in the following sequences: Protopolybia MPI I-N-W-L-K-L-G-K-K-V-S-A-I-L-NH2 Protopolybia-MP II I-N-W-K-A-I-I-E-A-A-K-Q-A-L-NH2 Protopolybia-MP III I-N-W-L-K-L-G-K-A-V-I-D-A-L-NH2 All the peptides were manually synthesized on-solid phase and functionally characterized. Protopolybia-MP I is a hemolytic mastoparan, probably acting on mast cells by assembling in plasma membrane, resulting in pore formation; meanwhile, the peptides Protopolybia-MP II and -MP III were characterized as a non-hemolytic mast cell degranulator toxins, which apparently act by virtue of their binding to G-protein receptor, activating the mast cell degranulation.

  10. A novel cysteine-rich antifungal peptide ToAMP4 from Taraxacum officinale Wigg. flowers.

    PubMed

    Astafieva, A A; Rogozhin, Eugene A; Andreev, Yaroslav A; Odintsova, T I; Kozlov, S A; Grishin, Eugene V; Egorov, Tsezi A

    2013-09-01

    A novel peptide named ToAMP4 was isolated from Taraxacum officinale Wigg. flowers by a combination of acetic acid extraction and different types of chromatography: affinity, size-exclusion, and RP-HPLC. The amino acid sequence of ToAMP4 was determined by automated Edman degradation. The peptide is basic, consists of 41 amino acids, and incorporates three disulphide bonds. Due to the unusual cysteine spacing pattern, ToAMP4 does not belong to any known plant AMP family, but classifies together with two other antimicrobial peptides ToAMP1 and ToAMP2 previously isolated from the dandelion flowers. To study the biological activity of ToAMP4, it was successfully produced in a prokaryotic expression system as a fusion protein with thioredoxin. The recombinant peptide was shown to be identical to the native ToAMP4 by chromatographic behavior, molecular mass, and N-terminal amino acid sequence. The peptide displays broad-spectrum antifungal activity against important phytopathogens. Two ToAMP4-mediated inhibition strategies depending on the fungus were demonstrated. The results obtained add to our knowledge on the structural and functional diversity of AMPs in plants. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  11. Membrane fractions active in poliovirus RNA replication contain VPg precursor polypeptides

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Takegami, T.; Semler, B.L.; Anderson, C.W.

    1983-01-01

    The poliovirus specific polypeptide P3-9 is of special interest for studies of viral RNA replication because it contains a hydrophobic region and, separated by only seven amino acids from that region, the amino acid sequence of the genome-linked protein VPg. Membraneous complexes of poliovirus-infected HeLa cells that contain poliovirus RNA replicating proteins have been analyzed for the presence of P3-9 by immunoprecipitation. Incubation of a membrane fraction rich in P3-9 with proteinase leaves the C-terminal 69 amino acids of P3-9 intact, an observation suggesting that this portion is protected by its association with the cellular membrane. These studies have alsomore » revealed two hitherto undescribed viral polypeptides consisting of amino acid sequences of the P2 andf P3 regions of the polyprotein. Sequence analysis by stepwise Edman degradation show that these proteins are 3b/9 (M/sub r/77,000) and X/9 (M/sub r/50,000). 3b/9 and X/9 are membrane bound and are turned over rapidly and may be direct precursors to proteins P2-X and P3-9 of the RNA replication complex. P2-X, a polypeptide void of hydrophobic amino acid sequences but also found associated with membranes, is rapidly degraded when the membraneous complex is treated with trypsin. It is speculated that P2-X is associated with membranes by its affinity to the N-terminus of P3-9.« less

  12. A new cofactor in prokaryotic enzyme: Tryptophan tryptophylquinone as the redox prosthetic group in methylamine dehydrogenase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McIntire, W.S.; Wemmer, D.E.; Chistoserdov, A.

    Methylamine dehydrogenase (MADH), an {alpha}{sub 2}{beta}{sub 2} enzyme from numerous methylotrophic soil bacteria, contains a novel quinonoid redox prosthetic group that is covalently bound to its small {beta} subunit through two amino acyl residues. A comparison of the amino acid sequence deduced from the gene sequence of the small subunit for the enzyme from Methylobacterium extorquens AM1 with the published amino acid sequence obtained by Edman degradation method, allowed the identification of the amino acyl constituents of the cofactor as two tryptophyl residues. This information was crucial for interpreting {sup 1}H and {sup 13}C nuclear magnetic resonance, and mass spectralmore » data collected for the semicarbazide- and carboxymethyl-derivatized bis(tripeptidyl)-cofactor of MADH from bacterium W3A1. The cofactor is composed of two cross-linked tryptophyl residues. Although there are many possible isomers, only one is consistent with all the data: The first tryptophyl residue in the peptide sequence exists as an indole-6,7-dione, and is attached at its 4 position to the 2 position of the second, otherwise unmodified, indole side group. Contrary to earlier reports, the cofactor of MADH is not 2,7,9-tricarboxypyrroloquinoline quinone (PQQ), a derivative thereof, of pro-PQQ. This appears to be the only example of two cross-linked, modified amino acyl residues having a functional role in the active site of an enzyme, in the absence of other cofactors or metal ions.« less

  13. Echinoderm phosphorylated matrix proteins UTMP16 and UTMP19 have different functions in sea urchin tooth mineralization.

    PubMed

    Alvares, Keith; Dixit, Saryu N; Lux, Elizabeth; Veis, Arthur

    2009-09-18

    Studies of mineralization of embryonic spicules and of the sea urchin genome have identified several putative mineralization-related proteins. These predicted proteins have not been isolated or confirmed in mature mineralized tissues. Mature Lytechinus variegatus teeth were demineralized with 0.6 N HCl after prior removal of non-mineralized constituents with 4.0 M guanidinium HCl. The HCl-extracted proteins were fractionated on ceramic hydroxyapatite and separated into bound and unbound pools. Gel electrophoresis compared the protein distributions. The differentially present bands were purified and digested with trypsin, and the tryptic peptides were separated by high pressure liquid chromatography. NH2-terminal sequences were determined by Edman degradation and compared with the genomic sequence bank data. Two of the putative mineralization-related proteins were found. Their complete amino acid sequences were cloned from our L. variegatus cDNA library. Apatite-binding UTMP16 was found to be present in two isoforms; both isoforms had a signal sequence, a Ser-Asp-rich extracellular matrix domain, and a transmembrane and cytosolic insertion sequence. UTMP19, although rich in Glu and Thr did not bind to apatite. It had neither signal peptide nor transmembrane domain but did have typical nuclear localization and nuclear exit signal sequences. Both proteins were phosphorylated and good substrates for phosphatase. Immunolocalization studies with anti-UTMP16 show it to concentrate at the syncytial membranes in contact with the mineral. On the basis of our TOF-SIMS analyses of magnesium ion and Asp mapping of the mineral phase composition, we speculate that UTMP16 may be important in establishing the high magnesium columns that fuse the calcite plates together to enhance the mechanical strength of the mineralized tooth.

  14. Two new 4-Cys conotoxins (framework 14) of the vermivorous snail Conus austini from the Gulf of Mexico with activity in the central nervous system of mice

    PubMed Central

    Zugasti-Cruz, Alejandro; Falcón, Andrés; Heimer de la Cotera, Edgar P.; Olivera, Baldomero M.; Aguilar, Manuel B.

    2008-01-01

    As part of continuing studies of the venom components present in Conus austini (syn.: Conus cancellatus), a vermivorous cone snail collected in the western Gulf of Mexico, Mexico, two major peptides, as14a and as14b, were purified and characterized. Their amino acid sequences were determined by automatic Edman sequencing after reduction and alkylation. Their molecular masses, established by matrix-assisted laser desorption ionization time-of-flight mass spectrometry, confirmed the chemical analyses and indicated that as14a and as14b have free C-termini. Each peptide contains four Cys residues arranged in a pattern (C-C-C-C, framework 14). The primary structure of as14a is GGVGRCIYNCMNSGGGLNFIQCKTMCY (experimental monoisotopic mass 2,883.92 Da; calculated monoisotopic mass 2,884.20 Da), whereas that of as14b is RWDVDQCIYYCLNGVVGYSYTECQTMCT (experimental monoisotopic mass 3,308.63 Da; calculated monoisotopic mass 3,308.34 Da). Both purified peptides elicited scratching and grooming activity in mice, and as14b also caused body and rear limb extension and tail curling immediately upon injection. The high sequence similarity of peptide as14a with peptide vil14a from the vermivorous C. villepinii suggests that the former might block K+ channels. PMID:18206266

  15. PhcrTx2, a New Crab-Paralyzing Peptide Toxin from the Sea Anemone Phymanthus crucifer

    PubMed Central

    Garateix, Anoland; Salceda, Emilio; Zaharenko, André Junqueira; Pons, Tirso; Santos, Yúlica; Arreguín, Roberto; Ständker, Ludger; Forssmann, Wolf-Georg; Tytgat, Jan; Vega, Rosario

    2018-01-01

    Sea anemones produce proteinaceous toxins for predation and defense, including peptide toxins that act on a large variety of ion channels of pharmacological and biomedical interest. Phymanthus crucifer is commonly found in the Caribbean Sea; however, the chemical structure and biological activity of its toxins remain unknown, with the exception of PhcrTx1, an acid-sensing ion channel (ASIC) inhibitor. Therefore, in the present work, we focused on the isolation and characterization of new P. crucifer toxins by chromatographic fractionation, followed by a toxicity screening on crabs, an evaluation of ion channels, and sequence analysis. Five groups of toxic chromatographic fractions were found, and a new paralyzing toxin was purified and named PhcrTx2. The toxin inhibited glutamate-gated currents in snail neurons (maximum inhibition of 35%, IC50 4.7 µM), and displayed little or no influence on voltage-sensitive sodium/potassium channels in snail and rat dorsal root ganglion (DRG) neurons, nor on a variety of cloned voltage-gated ion channels. The toxin sequence was fully elucidated by Edman degradation. PhcrTx2 is a new β-defensin-fold peptide that shares a sequence similarity to type 3 potassium channels toxins. However, its low activity on the evaluated ion channels suggests that its molecular target remains unknown. PhcrTx2 is the first known paralyzing toxin in the family Phymanthidae. PMID:29414882

  16. Glycosaminoglycan Chain of Dentin Sialoprotein Proteoglycan

    PubMed Central

    Zhu, Q.; Sun, Y.; Prasad, M.; Wang, X.; Yamoah, A.K.; Li, Y.; Feng, J.; Qin, C.

    2010-01-01

    Dentin sialophosphoprotein (DSPP) is processed into dentin sialoprotein (DSP) and dentin phosphoprotein. A molecular variant of rat DSP, referred to as “HMW-DSP”, has been speculated to be a proteoglycan form of DSP. To determine if HMW-DSP is the proteoglycan form of DSP and to identify the glycosaminoglycan side-chain attachment site(s), we further characterized HMW-DSP. Chondroitinase ABC treatment reduced the migration rate for portions of rat HMW-DSP to the level of DSP. Disaccharide analysis showed that rat HMW-DSP contains glycosaminoglycan chains made of chondroitin-4-sulfate and has an average of 31-32 disaccharides/mol. These observations confirmed that HMW-DSP is the proteoglycan form of DSP (renamed “DSP-PG”). Edman degradation and mass spectrometric analyses of tryptic peptides from rat DSP-PG, along with substitution analyses of candidate Ser residues in mouse DSPP, confirmed that 2 glycosaminoglycan chains are attached to Ser241 and Ser253 in the rat, or Ser242 and Ser254 in the mouse DSPP sequence. PMID:20400719

  17. Glycosaminoglycan chain of dentin sialoprotein proteoglycan.

    PubMed

    Zhu, Q; Sun, Y; Prasad, M; Wang, X; Yamoah, A K; Li, Y; Feng, J; Qin, C

    2010-08-01

    Dentin sialophosphoprotein (DSPP) is processed into dentin sialoprotein (DSP) and dentin phosphoprotein. A molecular variant of rat DSP, referred to as "HMW-DSP", has been speculated to be a proteoglycan form of DSP. To determine if HMW-DSP is the proteoglycan form of DSP and to identify the glycosaminoglycan side-chain attachment site(s), we further characterized HMW-DSP. Chondroitinase ABC treatment reduced the migration rate for portions of rat HMW-DSP to the level of DSP. Disaccharide analysis showed that rat HMW-DSP contains glycosaminoglycan chains made of chondroitin-4-sulfate and has an average of 31-32 disaccharides/mol. These observations confirmed that HMW-DSP is the proteoglycan form of DSP (renamed "DSP-PG"). Edman degradation and mass spectrometric analyses of tryptic peptides from rat DSP-PG, along with substitution analyses of candidate Ser residues in mouse DSPP, confirmed that 2 glycosaminoglycan chains are attached to Ser(241) and Ser(253) in the rat, or Ser(242) and Ser(254) in the mouse DSPP sequence.

  18. Structural characterization of osmoregulator peptides from the brain of the leech Theromyzon tessulatum: IPEPYVWD and IPEPYVWD-amide.

    PubMed

    Salzet, M; Vandenbulcke, F; Verger-Bocquet, M

    1996-12-31

    Neurons immunoreactive to an antiserum (a-OT) directed specifically against the C-terminal part (prolyl-leucyl-glycinamide) of vertebrate oxytocin (OT) were detected in the brain of the leech Theromyzon tessulatum. With high pressure gel permeation chromatography followed by reversed-phase HPLC on brain extracts, evidence was given of the presence of three peptides (P1, P2, P3) immunoreactive to a-OT. Results of injection experiments in T. tessulatum and of titrations of each peptide at the different physiological stages of the animals which showed a peak in peptide P1 amount at stage 3B, indicated that P1 is the active OT-like peptide. Using three steps of reversed-phase HPLC, Edman degradation and electrospray mass spectrometry, two sequences for P1 (IPEPYVWD and IPEPYVWD-amide) were found. These peptides differ from peptides to the oxytocin/vasopressin family and are unique in the animal kingdom. Confirmation of their action on the hydric balance and their distribution in the CNS were presented.

  19. Lacticin LC14, a new bacteriocin produced by Lactococcus lactis BMG6.14: isolation, purification and partial characterization.

    PubMed

    Lasta, Samar; Ouzari, Hadda; Andreotti, Nicolas; Fajloun, Ziad; Mansuelle, Pascal; Boudabous, Abdellatif; Sampieri, Francois; Sabatier, Jean Marc

    2012-08-01

    A new bacteriocin, lacticin LC14, produced by Lactococcus lactis BMG6.14, was isolated and characterized. It was purified to homogeneity from overnight broth culture by ammonium sulfate precipitation, Sep-Pak chromatography, and two steps of reversed-phase HPLC. Lacticin LC14 showed bactericidal-type antimicrobial activity against several lactic acid bacteria and pathogenic strains including Listeria monocytogenes. It was inactivated by proteinase K and pronase E, but was resistant to papain, lysozyme, lipase and catalase. Lacticin LC14 was heat resistant, stable over a wide range of pH (2-10) and after treatment by solvents and detergents. Its N-terminal end was found unreactive towards Edman sequencing. Based on MALDI-TOF mass spectrometry, its molecular mass was 3333.7 Da. LC14 amino acid composition revealed a high proportion of hydrophobic residues, but no modified ones. LC14 may be able to challenge other well known other bacteriocins in probiotic and therapeutic applications.

  20. [The primary structure of the alpha-amylase inhibitor Hoe 467A from Streptomyces tendae 4158. A new class of inhibitors].

    PubMed

    Aschauer, H; Vértesy, L; Nesemann, G; Braunitzer, G

    1983-10-01

    The native or modified alpha-amylase inhibitor Hoe 467A - isolated from the culture medium of Streptomyces tendae 4158 - and overlapping peptides were degraded by the automatic Edman technique. The oxidized or aminoethylated or oxidized and maleoylated inhibitor was digested with trypsin and the native inhibitor with pepsin. Further digestion with Staphylococcus aureus proteinase was also carried out. After peptic digestion two cystin peptides were isolated, which allowed the establishment of the disulfide bonds. The alpha-amylase inhibitor is a polypeptid consisting of 74 amino-acid residues with a molecular mass of 7958 Da. The inhibitor is composed of all naturally occurring amino acids except methionine and phenylalanine and shows no sequence homology to known inhibitors. The clinical and pharmacological importance in respect to the inhibitors ability for inactivation of human salivary and pancreatic alpha-amylase is discussed. Especially the proteinase resistance of the inhibitor enables a clinical application in human (e.g. Diabetes mellitus) per os.

  1. A Low Molecular Weight Protein from the Sea Anemone Anemonia viridis with an Anti-Angiogenic Activity

    PubMed Central

    Loret, Erwann P.; Luis, José; Nuccio, Christopher; Villard, Claude; Mansuelle, Pascal; Lebrun, Régine; Villard, Pierre Henri

    2018-01-01

    Sea anemones are a remarkable source of active principles due to a decentralized venom system. New blood vessel growth or angiogenesis is a very promising target against cancer, but the few available antiangiogenic compounds have limited efficacy. In this study, a protein fraction, purified from tentacles of Anemonia viridis, was able to limit endothelial cells proliferation and angiogenesis at low concentration (14 nM). Protein sequences were determined with Edman degradation and mass spectrometry in source decay and revealed homologies with Blood Depressing Substance (BDS) sea anemones. The presence of a two-turn alpha helix observed with circular dichroism and a trypsin activity inhibition suggested that the active principle could be a Kunitz-type inhibitor, which may interact with an integrin due to an Arginine Glycin Aspartate (RGD) motif. Molecular modeling showed that this RGD motif was well exposed to solvent. This active principle could improve antiangiogenic therapy from existing antiangiogenic compounds binding on the Vascular Endothelial Growth Factor (VEGF). PMID:29671760

  2. Purification and characterization of a strong fibrinolytic enzyme (nattokinase) in the vegetable cheese natto, a popular soybean fermented food in Japan.

    PubMed

    Fujita, M; Nomura, K; Hong, K; Ito, Y; Asada, A; Nishimuro, S

    1993-12-30

    A strong fibrinolytic enzyme (nattokinase) was purified from the vegetable cheese natto. Nattokinase was extracted from natto with saline and isolated by sequential use of hydrophobic chromatography on Butyl-Toyopearl, ion-exchange chromatography on CM-Toyopearl, and gel-filtration on Sephadex G-50. The isolated protein gave a single sharp band on SDS-PAGE either before or after reduction. The sequence, as determined by automated Edman degradation of the uncleaved molecule and its enzymatically derived peptide, consisted of a total 275 amino acid residues (M.W = 27,728) and exhibited a high homology with the subtilisins. The purified nattokinase digested not only fibrin but also several synthetic substrates. Among the synthetic substrates, the most sensitive substrate was Suc-Ala-Ala-Pro-Phe-pNA for subtilisin. PMSF inhibited both the fibrinolytic activity and the amidolytic activity. The results indicate that nattokinase is a subtilisin-like serine protease.

  3. Characterization of the novel antifungal protein PgAFP and the encoding gene of Penicillium chrysogenum.

    PubMed

    Rodríguez-Martín, Andrea; Acosta, Raquel; Liddell, Susan; Núñez, Félix; Benito, M José; Asensio, Miguel A

    2010-04-01

    The strain RP42C from Penicillium chrysogenum produces a small protein PgAFP that inhibits the growth of some toxigenic molds. The molecular mass of the protein determined by electrospray ionization mass spectrometry (ESI-MS) was 6 494Da. PgAFP showed a cationic character with an estimated pI value of 9.22. Upon chemical and enzymatic treatments of PgAFP, no evidence for N- or O-glycosylations was obtained. Five partial sequences of PgAFP were obtained by Edman degradation and by ESI-MS/MS after trypsin and chymotrypsin digestions. Using degenerate primers from these peptide sequences, a segment of 70bp was amplified by PCR from pgafp gene. 5'- and 3'-ends of pgafp were obtained by RACE-PCR with gene-specific primers designed from the 70bp segment. The complete pgafp sequence of 404bp was obtained using primers designed from 5'- and 3'-ends. Comparison of genomic and cDNA sequences revealed a 279bp coding region interrupted by two introns of 63 and 62bp. The precursor of the antifungal protein consists of 92 amino acids and appears to be processed to the mature 58 amino acids PgAFP. The deduced amino acid sequence of the mature protein shares 79% identity to the antifungal protein Anafp from Aspergillus niger. PgAFP is a new protein that belongs to the group of small, cysteine-rich, and basic proteins with antifungal activity produced by ascomycetes. Given that P. chrysogenum is regarded as safe mold commonly found in foods, PgAFP may be useful to prevent growth of toxigenic molds in food and agricultural products. Copyright (c) 2009 Elsevier Inc. All rights reserved.

  4. Characterization of the Organic Component of Low-Molecular-Weight Chromium-Binding Substance and Its Binding of Chromium123

    PubMed Central

    Chen, Yuan; Watson, Heather M.; Gao, Junjie; Sinha, Sarmistha Halder; Cassady, Carolyn J.; Vincent, John B.

    2011-01-01

    Chromium was proposed to be an essential element over 50 y ago and was shown to have therapeutic potential in treating the symptoms of type 2 diabetes; however, its mechanism of action at a molecular level is unknown. One chromium-binding biomolecule, low-molecular weight chromium-binding substance (LMWCr or chromodulin), has been found to be biologically active in in vitro assays and proposed as a potential candidate for the in vivo biologically active form of chromium. Characterization of the organic component of LMWCr has proven difficult. Treating bovine LMWCr with trifluoroacetic acid followed by purification on a graphite powder micro-column generates a heptapeptide fragment of LMWCr. The peptide sequence of the fragment was analyzed by MS and tandem MS (MS/MS and MS/MS/MS) using collision-induced dissociation and post-source decay. Two candidate sequences, pEEEEGDD and pEEEGEDD (where pE is pyroglutamate), were identified from the MS/MS experiments; additional tandem MS suggests the sequence is pEEEEGDD. The N-terminal glutamate residues explain the inability to sequence LMWCr by the Edman method. Langmuir isotherms and Hill plots were used to analyze the binding constants of chromic ions to synthetic peptides similar in composition to apoLMWCr. The sequence pEEEEGDD was found to bind 4 chromic ions per peptide with nearly identical cooperativity and binding constants to those of apoLMWCr. This work should lead to further studies elucidating or eliminating a potential role for LMWCr in treating the symptoms of type 2 diabetes and other conditions resulting from improper carbohydrate and lipid metabolism. PMID:21593351

  5. Biochemical and molecular characterization of the venom from the Cuban scorpion Rhopalurus junceus.

    PubMed

    García-Gómez, B I; Coronas, F I V; Restano-Cassulini, R; Rodríguez, R R; Possani, L D

    2011-07-01

    This communication describes the first general biochemical, molecular and functional characterization of the venom from the Cuban blue scorpion Rhopalurus junceus, which is often used as a natural product for anti-cancer therapy in Cuba. The soluble venom of this arachnid is not toxic to mice, injected intraperitoneally at doses up to 200 μg/20 g body weight, but it is deadly to insects at doses of 10 μg per animal. The venom causes typical alpha and beta-effects on Na+ channels, when assayed using patch-clamp techniques in neuroblastoma cells in vitro. It also affects K+ currents conducted by ERG (ether-a-go-go related gene) channels. The soluble venom was shown to display phospholipase, hyaluronidase and anti-microbial activities. High performance liquid chromatography of the soluble venom can separate at least 50 components, among which are peptides lethal to crickets. Four such peptides were isolated to homogeneity and their molecular masses and N-terminal amino acid sequence were determined. The major component (RjAa12f) was fully sequenced by Edman degradation. It contains 64 amino acid residues and four disulfide bridges, similar to other known scorpion toxins. A cDNA library prepared from the venomous glands of one scorpion allowed cloning 18 genes that code for peptides of the venom, including RjA12f and eleven other closely related genes. Sequence analyses and phylogenetic reconstruction of the amino acid sequences deduced from the cloned genes showed that this scorpion contains sodium channel like toxin sequences clearly segregated into two monophyletic clusters. Considering the complex set of effects on Na+ currents verified here, this venom certainly warrant further investigation. Copyright © 2011 Elsevier Ltd. All rights reserved.

  6. Carolina View, Vol II, 1986.

    ERIC Educational Resources Information Center

    Davis, Char W., Ed.; Small, LaVeta T., Ed.

    1986-01-01

    Diverse issues in higher education are addressed in 19 articles. Titles and authors are as follows: "2001: Formulation of a Vision" (Kenneth L. Schwab); Trustees' Roles and Student Issues" (Davis Powers); "What Do Students and Faculty Talk about When They Eat Meals Together?" (John H. Schuh, Neal Edman); "Student…

  7. Purification and complete amino acid sequence of a new type of sweet protein taste-modifying activity, curculin.

    PubMed

    Yamashita, H; Theerasilp, S; Aiuchi, T; Nakaya, K; Nakamura, Y; Kurihara, Y

    1990-09-15

    A new taste-modifying protein named curculin was extracted with 0.5 M NaCl from the fruits of Curculigo latifolia and purified by ammonium sulfate fractionation, CM-Sepharose ion-exchange chromatography, and gel filtration. Purified curculin thus obtained gave a single band having a Mr of 12,000 on sodium dodecyl sulfate-polyacrylamide gel electrophoresis in the presence of 8 M urea. The molecular weight determined by low-angle laser light scattering was 27,800. These results suggest that native curculin is a dimer of a 12,000-Da polypeptide. The complete amino acid sequence of curculin was determined by automatic Edman degradation. Curculin consists of 114 residues. Curculin itself elicits a sweet taste. After curculin, water elicits a sweet taste, and sour substances induce a stronger sense of sweetness. No protein with both sweet-tasting and taste-modifying activities has ever been found. There are five sets of tripeptides common to miraculin (a taste-modifying protein), six sets of tripeptides common to thaumatin (a sweet protein), and two sets of tripeptides common to monellin (a sweet protein). Anti-miraculin serum was not immunologically reactive with curculin. The mechanism of the taste-modifying action of curculin is discussed.

  8. Comparative proteomic analysis of male and female venoms from the Cuban scorpion Rhopalurus junceus.

    PubMed

    Rodríguez-Ravelo, Rodolfo; Batista, Cesar V F; Coronas, Fredy I V; Zamudio, Fernando Z; Hernández-Orihuela, Lorena; Espinosa-López, Georgina; Ruiz-Urquiola, Ariel; Possani, Lourival D

    2015-12-01

    A complete mass spectrometry analysis of venom components from male and female scorpions of the species Rhophalurus junceus of Cuba is reported. In the order of 200 individual molecular masses were identified in both venoms, from which 63 are identical in male and females genders. It means that a significant difference of venom components exists between individuals of different sexes, but the most abundant components are present in both sexes. The relative abundance of identical components is different among the genders. Three well defined groups of different peptides were separated and identified. The first group corresponds to peptides with molecular masses of 1000-2000 Da; the second to peptides with 3500-4500 Da molecular weight, and the third with 6500-8000 Da molecular weights. A total of 86 peptides rich in disulfide bridges were found in the venoms, 27 with three disulfide bridges and 59 with four disulfide bridges. LC-MS/MS analysis allowed the identification and amino acid sequence determination of 31 novel peptides in male venom. Two new putative K(+)-channel peptides were sequences by Edman degradation. They contain 37 amino acid residues, packed by three disulfide bridges and were assigned the systematic numbers: α-KTx 1.18 and α-KTx 2.15. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. Purification and Molecular Characterization of the Novel Highly Potent Bacteriocin TSU4 Produced by Lactobacillus animalis TSU4.

    PubMed

    Sahoo, Tapasa Kumar; Jena, Prasant Kumar; Patel, Amiya Kumar; Seshadri, Sriram

    2015-09-01

    Bacterial infections causing fish diseases and spoilage during fish food processing and storage are major concerns in aquaculture. Use of bacteriocins has recently been considered as an effective strategy for prevention of bacterial infections. A novel bacteriocin produced by Catla catla gut isolates, Lactobacillus animalis TSU4, designated as bacteriocin TSU4 was purified to homogeneity by a three-step protocol. The molecular mass of bacteriocin TSU4 was 4117 Da determined by Q-TOF LC/MS analysis. Its isoelectric point was ~9. Secondary conformation obtained by circular dichroism spectroscopy showed molecular conformation with significant proportions of the structure in α-helix (23.7 %) and β-sheets (17.1 %). N-terminal sequencing was carried out by the Edman degradation method; partial sequence identified was NH2-SMSGFSKPHD. Bacteriocin TSU4 exhibited a wide range of antimicrobial activity, pH and thermal stability. It showed a bacteriocidal mode of action against the indicator strain Aeromonas hydrophila MTCC 646. Bacteriocin TSU4 is the first reported bacteriocin produced by fish isolate Lactobacillus animalis. The characterization of bacteriocin TSU4 suggested that it is a novel bacteriocin with potential value against infections of bacteria such as A. hydrophila MTCC 646 and Pseudomonas aeruginosa MTCC 1688 and application to prevent spoilage during food preservation.

  10. Structural characterization of peptides derived from prosomatostatins I and II isolated from the pancreatic islets of two species of teleostean fish: the daddy sculpin and the flounder.

    PubMed

    Conlon, J M; Davis, M S; Falkmer, S; Thim, L

    1987-11-02

    The primary structures of three peptides from extracts from the pancreatic islets of the daddy sculpin (Cottus scorpius) and three analogous peptides from the islets of the flounder (Platichthys flesus), two species of teleostean fish, have been determined by automated Edman degradation. The structures of the flounder peptides were confirmed by fast-atom bombardment mass spectrometry. The peptides show strong homology to residues (49-60), (63-96) and (98-125) of the predicted sequence of preprosomatostatin II from the anglerfish (Lophius americanus). The amino acid sequences of the peptides suggest that, in the sculpin, prosomatostatin II is cleaved at a dibasic amino acid residue processing site (corresponding to Lys61-Arg62 in anglerfish preprosomatostatin II). The resulting fragments are further cleaved at monobasic residue processing sites (corresponding to Arg48 and Arg97 in anglerfish preprosomatostatin II). In the flounder the same dibasic residue processing site is utilised but cleavage at different monobasic sites takes place (corresponding to Arg50 and Arg97 in anglerfish preprosomatostatin II). A peptide identical to mammalian somatostatin-14 was also isolated from the islets of both species and is presumed to represent a cleavage product of prosomatostatin I.

  11. R(-)-4-(3-Isothiocyanatopyrrolidin-1-yl)-7-(N,N-dimethylaminosulfonyl)-2,1,3-benzoxadiazole, a fluorescent chiral tagging reagent: sensitive resolution of chiral amines and amino acids by reversed-phase liquid chromatography.

    PubMed

    Toyo'oka, T; Jin, D; Tomoi, N; Oe, T; Hiranuma, H

    2001-02-01

    The usefulness of R(-)-4-(3-isothiocyanatopyrrolidin-1-yl)-7-(N,N-dimethylaminosulfonyl)-2,1,3-benzoxadiazole [R(-)-DBD-PyNCS], a fluorescent chiral tagging reagent, for the determination of racemic amines and amino acids, was studied. The reagent reacted with beta-blockers selected as representative secondary amines to produce corresponding fluorescent diastereomers (excitation at 460 nm and emission at 550 nm). The yields of the derivatization reaction were dependent on the stereostructure arround the NH group in beta-blockers. The resulting diastereomers were completely separated with single chromatographic run using linear gradient elutions by reversed-phase chromatography. R(-)-DBD-PyNCS was also applied to the determination of DL-amino acid, considered to be one of the primary amines, in human urine and foodstuffs. DL-amino acids tested equally reacted with the reagent, and the thiocarbamoyl derivatives were separated with an ODS column. The epimerization during the derivatization reaction was negligible judging from the resolution of opposite diastereomers on the chromatogram. The occurence of D-amino acids (D-Ala, D-Ser, D-Asp and/or D-Glu) was identified in the samples tested. The structures and the purities were elucidated with on-line HPLC-MS. The chiral reagent possessing an isothiocyanate group (-NCS) in the structure seems to be applicable to continuous sequential analysis of peptides containing D-amino acids. The thiocarbamoyl derivatives obtained from the reaction with DL-amino acids were converted to thiohydantoins via thiazolinones in acidic medium. The thiohydantoins produced from acidic, basic, neutral, hydroxyl and aromatic amino acids were completely separated with isocratic elutions using acidic mobile phase containing 0.1% TFA. The separations were sufficient for the identification of DL-amino acid in peptide sequences. Although the epimerization during the conversion reaction to thiohydantoins was not avoidable, the descrimination of D- and L-configuration was demonstrated with some commercially available peptides such as beta-lipotropin and [D-Ala2]-deltorphin II. The Edman degaradation method using R(-)-DBD-PyNCS was also adopted to autoanlaysis by gas-phase sequencer. The separation and the detection (UV 254 nm) conditions of the derivatives were used without any change from those for the Edman degradation method using PITC as the tagging reagent. The three DL-amino acid residues (Tyr, Ala and Gly) in [L-Ala2]-leucine-enkephalin and [D-Ala2]-leucine-enkephalin were perfectly identidied with the autoanalysis.

  12. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fox, J.W.; Elzinga, M.; Tu, A.T.

    The primary structure of myotoxin a, a myotoxin protein from the venom of the North American rattlesnake Crotalus viridis viridis, was determined and the position of the disulfide bonds assigned. The toxin was isolated, carboxymethylated, and cleaved by cyanogen bromide, and the resultant peptides were isolated. The cyanogen bromide peptides were subjected to amino acid sequence analysis. In order to assign the positions of the three disulfide bonds, the native toxin was cleaved sequentially with cyanogen bromide and trypsin. A two peptide unit connected by one disulfide bond was isolated and characterized, and a three-peptide unit connected by two disulfidemore » bonds was isolated. One peptide in the three-peptide unit was identified as Cys-Cys-Lys. In order to establish the linkages between the peptides and Cys-Cys-Lys, one cycle of Edman degradation was carried out such that the Cys-Cys bond was cleaved. Upon isolation and analysis of the cleavage products, the disulfide bonds connecting the three peptides were determined. The positions of the disulfide bridges of myotoxin a were determined to be totally different from those of neurotoxins isolated from snake venoms. The sequence of myotoxin a was compared with the sequences of other snake venom toxins using the computer program RELATE to determine whether myotoxin a is similar to any other types of toxins. From the computer analysis, myotoxin a did not show any close relationship to other toxins except crotamine from the South American rattlesnake Crotalus durissus terrificus.« less

  13. Production, purification, sequencing and activity spectra of mutacins D-123.1 and F-59.1

    PubMed Central

    2011-01-01

    Background The increase in bacterial resistance to antibiotics impels the development of new anti-bacterial substances. Mutacins (bacteriocins) are small antibacterial peptides produced by Streptococcus mutans showing activity against bacterial pathogens. The objective of the study was to produce and characterise additional mutacins in order to find new useful antibacterial substances. Results Mutacin F-59.1 was produced in liquid media by S. mutans 59.1 while production of mutacin D-123.1 by S. mutans 123.1 was obtained in semi-solid media. Mutacins were purified by hydrophobic chromatography. The amino acid sequences of the mutacins were obtained by Edman degradation and their molecular mass was determined by mass spectrometry. Mutacin F-59.1 consists of 25 amino acids, containing the YGNGV consensus sequence of pediocin-like bacteriocins with a molecular mass calculated at 2719 Da. Mutacin D-123.1 has an identical molecular mass (2364 Da) with the same first 9 amino acids as mutacin I. Mutacins D-123.1 and F-59.1 have wide activity spectra inhibiting human and food-borne pathogens. The lantibiotic mutacin D-123.1 possesses a broader activity spectrum than mutacin F-59.1 against the bacterial strains tested. Conclusion Mutacin F-59.1 is the first pediocin-like bacteriocin identified and characterised that is produced by Streptococcus mutans. Mutacin D-123.1 appears to be identical to mutacin I previously identified in different strains of S. mutans. PMID:21477375

  14. Production, purification, sequencing and activity spectra of mutacins D-123.1 and F-59.1.

    PubMed

    Nicolas, Guillaume G; LaPointe, Gisèle; Lavoie, Marc C

    2011-04-10

    The increase in bacterial resistance to antibiotics impels the development of new anti-bacterial substances. Mutacins (bacteriocins) are small antibacterial peptides produced by Streptococcus mutans showing activity against bacterial pathogens. The objective of the study was to produce and characterise additional mutacins in order to find new useful antibacterial substances. Mutacin F-59.1 was produced in liquid media by S. mutans 59.1 while production of mutacin D-123.1 by S. mutans 123.1 was obtained in semi-solid media. Mutacins were purified by hydrophobic chromatography. The amino acid sequences of the mutacins were obtained by Edman degradation and their molecular mass was determined by mass spectrometry. Mutacin F-59.1 consists of 25 amino acids, containing the YGNGV consensus sequence of pediocin-like bacteriocins with a molecular mass calculated at 2719 Da. Mutacin D-123.1 has an identical molecular mass (2364 Da) with the same first 9 amino acids as mutacin I. Mutacins D-123.1 and F-59.1 have wide activity spectra inhibiting human and food-borne pathogens. The lantibiotic mutacin D-123.1 possesses a broader activity spectrum than mutacin F-59.1 against the bacterial strains tested. Mutacin F-59.1 is the first pediocin-like bacteriocin identified and characterised that is produced by Streptococcus mutans. Mutacin D-123.1 appears to be identical to mutacin I previously identified in different strains of S. mutans.

  15. Novel proline-hydroxyproline glycopeptides from the dandelion (Taraxacum officinale Wigg.) flowers: de novo sequencing and biological activity.

    PubMed

    Astafieva, Alexandra A; Enyenihi, Atim A; Rogozhin, Eugene A; Kozlov, Sergey A; Grishin, Eugene V; Odintsova, Tatyana I; Zubarev, Roman A; Egorov, Tsezi A

    2015-09-01

    Two novel homologous peptides named ToHyp1 and ToHyp2 that show no similarity to any known proteins were isolated from Taraxacum officinale Wigg. flowers by multidimensional liquid chromatography. Amino acid and mass spectrometry analyses demonstrated that the peptides have unusual structure: they are cysteine-free, proline-hydroxyproline-rich and post-translationally glycosylated by pentoses, with 5 carbohydrates in ToHyp2 and 10 in ToHyp1. The ToHyp2 peptide with a monoisotopic molecular mass of 4350.3Da was completely sequenced by a combination of Edman degradation and de novo sequencing via top down multistage collision induced dissociation (CID) and higher energy dissociation (HCD) tandem mass spectrometry (MS(n)). ToHyp2 consists of 35 amino acids, contains eighteen proline residues, of which 8 prolines are hydroxylated. The peptide displays antifungal activity and inhibits growth of Gram-positive and Gram-negative bacteria. We further showed that carbohydrate moieties have no significant impact on the peptide structure, but are important for antifungal activity although not absolutely necessary. The deglycosylated ToHyp2 peptide was less active against the susceptible fungus Bipolaris sorokiniana than the native peptide. Unique structural features of the ToHyp2 peptide place it into a new family of plant defense peptides. The discovery of ToHyp peptides in T. officinale flowers expands the repertoire of molecules of plant origin with practical applications. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  16. Identification of the regulatory autophosphorylation site of autophosphorylation-dependent protein kinase (auto-kinase). Evidence that auto-kinase belongs to a member of the p21-activated kinase family.

    PubMed

    Yu, J S; Chen, W J; Ni, M H; Chan, W H; Yang, S D

    1998-08-15

    Autophosphorylation-dependent protein kinase (auto-kinase) was identified from pig brain and liver on the basis of its unique autophosphorylation/activation property [Yang, Fong, Yu and Liu (1987) J. Biol. Chem. 262, 7034-7040; Yang, Chang and Soderling (1987) J. Biol. Chem. 262, 9421-9427]. Its substrate consensus sequence motif was determined as being -R-X-(X)-S*/T*-X3-S/T-. To characterize auto-kinase further, we partly sequenced the kinase purified from pig liver. The N-terminal sequence (VDGGAKTSDKQKKKAXMTDE) and two internal peptide sequences (EKLRTIV and LQNPEK/ILTP/FI) of auto-kinase were obtained. These sequences identify auto-kinase as a C-terminal catalytic fragment of p21-activated protein kinase 2 (PAK2 or gamma-PAK) lacking its N-terminal regulatory region. Auto-kinase can be recognized by an antibody raised against the C-terminal peptide of human PAK2 by immunoblotting. Furthermore the autophosphorylation site sequence of auto-kinase was successfully predicted on the basis of its substrate consensus sequence motif and the known PAK2 sequence, and was further demonstrated to be RST(P)MVGTPYWMAPEVVTR by phosphoamino acid analysis, manual Edman degradation and phosphopeptide mapping via the help of phosphorylation site analysis of a synthetic peptide corresponding to the sequence of PAK2 from residues 396 to 418. During the activation process, auto-kinase autophosphorylates mainly on a single threonine residue Thr402 (according to the sequence numbering of human PAK2). In addition, a phospho-specific antibody against a synthetic phosphopeptide containing this identified sequence was generated and shown to be able to differentially recognize the activated auto-kinase autophosphorylated at Thr402 but not the non-phosphorylated/inactive auto-kinase. Immunoblot analysis with this phospho-specific antibody further revealed that the change in phosphorylation level of Thr402 of auto-kinase was well correlated with the activity change of the kinase during both autophosphorylation/activation and protein phosphatase-mediated dephosphorylation/inactivation processes. Taken together, our results identify Thr402 as the regulatory autophosphorylation site of auto-kinase, which is a C-terminal catalytic fragment of PAK2.

  17. Identification of the regulatory autophosphorylation site of autophosphorylation-dependent protein kinase (auto-kinase). Evidence that auto-kinase belongs to a member of the p21-activated kinase family.

    PubMed Central

    Yu, J S; Chen, W J; Ni, M H; Chan, W H; Yang, S D

    1998-01-01

    Autophosphorylation-dependent protein kinase (auto-kinase) was identified from pig brain and liver on the basis of its unique autophosphorylation/activation property [Yang, Fong, Yu and Liu (1987) J. Biol. Chem. 262, 7034-7040; Yang, Chang and Soderling (1987) J. Biol. Chem. 262, 9421-9427]. Its substrate consensus sequence motif was determined as being -R-X-(X)-S*/T*-X3-S/T-. To characterize auto-kinase further, we partly sequenced the kinase purified from pig liver. The N-terminal sequence (VDGGAKTSDKQKKKAXMTDE) and two internal peptide sequences (EKLRTIV and LQNPEK/ILTP/FI) of auto-kinase were obtained. These sequences identify auto-kinase as a C-terminal catalytic fragment of p21-activated protein kinase 2 (PAK2 or gamma-PAK) lacking its N-terminal regulatory region. Auto-kinase can be recognized by an antibody raised against the C-terminal peptide of human PAK2 by immunoblotting. Furthermore the autophosphorylation site sequence of auto-kinase was successfully predicted on the basis of its substrate consensus sequence motif and the known PAK2 sequence, and was further demonstrated to be RST(P)MVGTPYWMAPEVVTR by phosphoamino acid analysis, manual Edman degradation and phosphopeptide mapping via the help of phosphorylation site analysis of a synthetic peptide corresponding to the sequence of PAK2 from residues 396 to 418. During the activation process, auto-kinase autophosphorylates mainly on a single threonine residue Thr402 (according to the sequence numbering of human PAK2). In addition, a phospho-specific antibody against a synthetic phosphopeptide containing this identified sequence was generated and shown to be able to differentially recognize the activated auto-kinase autophosphorylated at Thr402 but not the non-phosphorylated/inactive auto-kinase. Immunoblot analysis with this phospho-specific antibody further revealed that the change in phosphorylation level of Thr402 of auto-kinase was well correlated with the activity change of the kinase during both autophosphorylation/activation and protein phosphatase-mediated dephosphorylation/inactivation processes. Taken together, our results identify Thr402 as the regulatory autophosphorylation site of auto-kinase, which is a C-terminal catalytic fragment of PAK2. PMID:9693111

  18. Biosynthesis of riboflavin: an unusual riboflavin synthase of Methanobacterium thermoautotrophicum.

    PubMed Central

    Eberhardt, S; Korn, S; Lottspeich, F; Bacher, A

    1997-01-01

    Riboflavin synthase was purified by a factor of about 1,500 from cell extract of Methanobacterium thermoautotrophicum. The enzyme had a specific activity of about 2,700 nmol mg(-1) h(-1) at 65 degrees C, which is relatively low compared to those of riboflavin synthases of eubacteria and yeast. Amino acid sequences obtained after proteolytic cleavage had no similarity with known riboflavin synthases. The gene coding for riboflavin synthase (designated ribC) was subsequently cloned by marker rescue with a ribC mutant of Escherichia coli. The ribC gene of M. thermoautotrophicum specifies a protein of 153 amino acid residues. The predicted amino acid sequence agrees with the information gleaned from Edman degradation of the isolated protein and shows 67% identity with the sequence predicted for the unannotated reading frame MJ1184 of Methanococcus jannaschii. The ribC gene is adjacent to a cluster of four genes with similarity to the genes cbiMNQO of Salmonella typhimurium, which form part of the cob operon (this operon contains most of the genes involved in the biosynthesis of vitamin B12). The amino acid sequence predicted by the ribC gene of M. thermoautotrophicum shows no similarity whatsoever to the sequences of riboflavin synthases of eubacteria and yeast. Most notably, the M. thermoautotrophicum protein does not show the internal sequence homology characteristic of eubacterial and yeast riboflavin synthases. The protein of M. thermoautotrophicum can be expressed efficiently in a recombinant E. coli strain. The specific activity of the purified, recombinant protein is 1,900 nmol mg(-1) h(-1) at 65 degrees C. In contrast to riboflavin synthases from eubacteria and fungi, the methanobacterial enzyme has an absolute requirement for magnesium ions. The 5' phosphate of 6,7-dimethyl-8-ribityllumazine does not act as a substrate. The findings suggest that riboflavin synthase has evolved independently in eubacteria and methanobacteria. PMID:9139911

  19. Comparative analysis of the XopD T3S effector family in plant pathogenic bacteria

    PubMed Central

    Kim, Jung-Gun; Taylor, Kyle W.; Mudgett, Mary Beth

    2011-01-01

    SUMMARY XopD is a type III effector protein that is required for Xanthomonas campestris pathovar vesicatoria (Xcv) growth in tomato. It is a modular protein consisting of an N-terminal DNA-binding domain, two EAR transcriptional repressor motifs, and a C-terminal SUMO protease. In tomato, XopD functions as a transcriptional repressor, resulting in the suppression of defense responses at late stages of infection. A survey of available genome sequences for phytopathogenic bacteria revealed that XopD homologs are limited to species within three Genera of Proteobacteria – Xanthomonas, Acidovorax, and Pseudomonas. While the EAR motif(s) and SUMO protease domain are conserved in all the XopD-like proteins, variation exists in the length and sequence identity of the N-terminal domains. Comparative analysis of the DNA sequences surrounding xopD and xopD-like genes led to revised annotation of the xopD gene. Edman degradation sequence analysis and functional complementation studies confirmed that the xopD gene from Xcv encodes a 760 amino acid protein with a longer N-terminal domain than previously predicted. None of the XopD-like proteins studied complemented Xcv ΔxopD mutant phenotypes in tomato leaves suggesting that the N-terminus of XopD defines functional specificity. Xcv ΔxopD strains expressing chimeric fusion proteins containing the N-terminus of XopD fused to the EAR motif(s) and SUMO protease domain of the XopD-like protein from Xanthomonas campestris pathovar campestris strain B100 were fully virulent in tomato demonstrating that the N-terminus of XopD controls specificity in tomato. PMID:21726373

  20. Isolation and identification of an extracellular subtilisin-like serine protease secreted by the bat pathogen Pseudogymnoascus destructans.

    PubMed

    Pannkuk, Evan L; Risch, Thomas S; Savary, Brett J

    2015-01-01

    White nose syndrome (WNS) is a cutaneous fungal disease of bats. WNS is responsible for unprecedented mortalities in North American cave bat populations. There have been few descriptions of enzyme activities that may function in WNS host/pathogen interactions, while no study has isolated and described secreted proteases. To address the hypothesis that Pseudogymnoascus destructans secretes extracellular proteases that function in wing necrosis during WNS infection, the object of this study was to culture P. destructans on various media, then isolate and structurally identify those proteases accumulated stably in the culture medium. We found a single dominant protease activity on minimal nutrient broth enriched with protein substrates, which was strongly inhibited by phenylmethylsulfonyl fluoride. This P. destructans serine protease (PdSP1) was isolated by preparative isoelectric focusing and concanavalin A lectin affinity chromatography. PdSP1 showed a molecular weight 27,900 (estimated by SDS-PAGE), broad pH optimum 6-8, and temperature optimum 60°C. Structural characterization of PdSP1 by MALDI-TOF MS, Orbitrap MS/MS, and Edman amino-terminal peptide sequencing matched it directly to a hypothetical protein accession from the sequenced P. destructans genome that is further identified as a MEROPS family S8A subtilisin-like serine peptidase. Two additional isoforms, PdSP2 and PdSP3, were identified in the P. destructans genome with 90% and 53% homology, respectively. P. destructans S8A serine proteases showed closer sequence conservation to P. pannorum and plant pathogenic fungi than to human pathogenic dermatophytes. Peptide-specific polyclonal antibodies developed from the PdSP1 sequence detected the protein in western blots. These subtilisin-like serine proteases are candidates for further functional studies in WNS host-pathogen interaction.

  1. Isolation and Identification of an Extracellular Subtilisin-Like Serine Protease Secreted by the Bat Pathogen Pseudogymnoascus destructans

    PubMed Central

    Pannkuk, Evan L.; Risch, Thomas S.; Savary, Brett J.

    2015-01-01

    White nose syndrome (WNS) is a cutaneous fungal disease of bats. WNS is responsible for unprecedented mortalities in North American cave bat populations. There have been few descriptions of enzyme activities that may function in WNS host/pathogen interactions, while no study has isolated and described secreted proteases. To address the hypothesis that Pseudogymnoascus destructans secretes extracellular proteases that function in wing necrosis during WNS infection, the object of this study was to culture P. destructans on various media, then isolate and structurally identify those proteases accumulated stably in the culture medium. We found a single dominant protease activity on minimal nutrient broth enriched with protein substrates, which was strongly inhibited by phenylmethylsulfonyl fluoride. This P. destructans serine protease (PdSP1) was isolated by preparative isoelectric focusing and concanavalin A lectin affinity chromatography. PdSP1 showed a molecular weight 27,900 (estimated by SDS-PAGE), broad pH optimum 6-8, and temperature optimum 60°C. Structural characterization of PdSP1 by MALDI-TOF MS, Orbitrap MS/MS, and Edman amino-terminal peptide sequencing matched it directly to a hypothetical protein accession from the sequenced P. destructans genome that is further identified as a MEROPS family S8A subtilisin-like serine peptidase. Two additional isoforms, PdSP2 and PdSP3, were identified in the P. destructans genome with 90% and 53% homology, respectively. P. destructans S8A serine proteases showed closer sequence conservation to P. pannorum and plant pathogenic fungi than to human pathogenic dermatophytes. Peptide-specific polyclonal antibodies developed from the PdSP1 sequence detected the protein in western blots. These subtilisin-like serine proteases are candidates for further functional studies in WNS host-pathogen interaction. PMID:25785714

  2. The Acheta domesticus Densovirus, Isolated from the European House Cricket, Has Evolved an Expression Strategy Unique among Parvoviruses▿†

    PubMed Central

    Liu, Kaiyu; Li, Yi; Jousset, Françoise-Xavière; Zadori, Zoltan; Szelei, Jozsef; Yu, Qian; Pham, Hanh Thi; Lépine, François; Bergoin, Max; Tijssen, Peter

    2011-01-01

    The Acheta domesticus densovirus (AdDNV), isolated from crickets, has been endemic in Europe for at least 35 years. Severe epizootics have also been observed in American commercial rearings since 2009 and 2010. The AdDNV genome was cloned and sequenced for this study. The transcription map showed that splicing occurred in both the nonstructural (NS) and capsid protein (VP) multicistronic RNAs. The splicing pattern of NS mRNA predicted 3 nonstructural proteins (NS1 [576 codons], NS2 [286 codons], and NS3 [213 codons]). The VP gene cassette contained two VP open reading frames (ORFs), of 597 (ORF-A) and 268 (ORF-B) codons. The VP2 sequence was shown by N-terminal Edman degradation and mass spectrometry to correspond with ORF-A. Mass spectrometry, sequencing, and Western blotting of baculovirus-expressed VPs versus native structural proteins demonstrated that the VP1 structural protein was generated by joining ORF-A and -B via splicing (splice II), eliminating the N terminus of VP2. This splice resulted in a nested set of VP1 (816 codons), VP3 (467 codons), and VP4 (429 codons) structural proteins. In contrast, the two splices within ORF-B (Ia and Ib) removed the donor site of intron II and resulted in VP2, VP3, and VP4 expression. ORF-B may also code for several nonstructural proteins, of 268, 233, and 158 codons. The small ORF-B contains the coding sequence for a phospholipase A2 motif found in VP1, which was shown previously to be critical for cellular uptake of the virus. These splicing features are unique among parvoviruses and define a new genus of ambisense densoviruses. PMID:21775445

  3. Identification of a Novel Small Cysteine-Rich Protein in the Fraction from the Biocontrol Fusarium oxysporum Strain CS-20 that Mitigates Fusarium Wilt Symptoms and Triggers Defense Responses in Tomato

    PubMed Central

    Shcherbakova, Larisa A.; Odintsova, Tatyana I.; Stakheev, Alexander A.; Fravel, Deborah R.; Zavriev, Sergey K.

    2016-01-01

    The biocontrol effect of the non-pathogenic Fusarium oxysporum strain CS-20 against the tomato wilt pathogen F. oxysporum f. sp. lycopersici (FOL) has been previously reported to be primarily plant-mediated. This study shows that CS-20 produces proteins, which elicit defense responses in tomato plants. Three protein-containing fractions were isolated from CS-20 biomass using size exclusion chromatography. Exposure of seedling roots to one of these fractions prior to inoculation with pathogenic FOL strains significantly reduced wilt severity. This fraction initiated an ion exchange response in cultured tomato cells resulting in a reversible alteration of extracellular pH; increased tomato chitinase activity, and induced systemic resistance by enhancing PR-1 expression in tomato leaves. Two other protein fractions were inactive in seedling protection. The main polypeptide (designated CS20EP), which was specifically present in the defense-inducing fraction and was not detected in inactive protein fractions, was identified. The nucleotide sequence encoding this protein was determined, and its complete amino acid sequence was deduced from direct Edman degradation (25 N-terminal amino acid residues) and DNA sequencing. The CS20EP was found to be a small basic cysteine-rich protein with a pI of 9.87 and 23.43% of hydrophobic amino acid residues. BLAST search in the NCBI database showed that the protein is new; however, it displays 48% sequence similarity with a hypothetical protein FGSG_10784 from F. graminearum strain PH-1. The contribution of CS20EP to elicitation of tomato defense responses resulting in wilt mitigating is discussed. PMID:26779237

  4. Characterization of a periplasmic S1-like nuclease coded by the Mesorhizobium loti symbiosis island

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pimkin, Maxim; Miller, C. Glenn; Blakesley, Lauryn

    DNA sequences encoding hypothetical proteins homologous to S1 nuclease from Aspergillus oryzae are found in many organisms including fungi, plants, pathogenic bacteria, and eukaryotic parasites. One of these is the M1 nuclease of Mesorhizobium loti which we demonstrate herein to be an enzymatically active, soluble, and stable S1 homolog that lacks the extensive mannosyl-glycosylation found in eukaryotic S1 nuclease homologs. We have expressed the cloned M1 protein in M. loti and purified recombinant native M1 to near homogeneity and have also isolated a homogeneous M1 carboxy-terminal hexahistidine tag fusion protein. Mass spectrometry and N-terminal Edman degradation sequencing confirmed the proteinmore » identity. The enzymatic properties of the purified M1 nuclease are similar to those of S1. At acidic pH M1 is 25 times more active on single-stranded DNA than on double-stranded DNA and 3 times more active on single-stranded DNA than on single-stranded RNA. At neutral pH the RNase activity of M1 exceeds the DNase activity. M1 nicks supercoiled RF-I plasmid DNA and rapidly cuts the phosphodiester bond across from the nick in the resultant relaxed RF-II plasmid DNA. Therefore, M1 represents an active bacterial S1 homolog in spite of great sequence divergence. The biochemical characterization of M1 nuclease supports our sequence alignment that reveals the minimal 21 amino acid residues that are necessarily conserved for the structure and functions of this enzyme family. The ability of M1 to degrade RNA at neutral pH implies previously unappreciated roles of these nucleases in biological systems.« less

  5. An asparagine residue at the N-terminus affects the maturation process of low molecular weight glutenin subunits of wheat endosperm

    PubMed Central

    2014-01-01

    Background Wheat glutenin polymers are made up of two main subunit types, the high- (HMW-GS) and low- (LMW-GS) molecular weight subunits. These latter are represented by heterogeneous proteins. The most common, based on the first amino acid of the mature sequence, are known as LMW-m and LMW-s types. The mature sequences differ as a consequence of three extra amino acids (MET-) at the N-terminus of LMW-m types. The nucleotide sequences of their encoding genes are, however, nearly identical, so that the relationship between gene and protein sequences is difficult to ascertain. It has been hypothesized that the presence of an asparagine residue in position 23 of the complete coding sequence for the LMW-s type might account for the observed three-residue shortened sequence, as a consequence of cleavage at the asparagine by an asparaginyl endopeptidase. Results We performed site-directed mutagenesis of a LMW-s gene to replace asparagine at position 23 with threonine and thus convert it to a candidate LMW-m type gene. Similarly, a candidate LMW-m type gene was mutated at position 23 to replace threonine with asparagine. Next, we produced transgenic durum wheat (cultivar Svevo) lines by introducing the mutated versions of the LMW-m and LMW-s genes, along with the wild type counterpart of the LMW-m gene. Proteomic comparisons between the transgenic and null segregant plants enabled identification of transgenic proteins by mass spectrometry analyses and Edman N-terminal sequencing. Conclusions Our results show that the formation of LMW-s type relies on the presence of an asparagine residue close to the N-terminus generated by signal peptide cleavage, and that LMW-GS can be quantitatively processed most likely by vacuolar asparaginyl endoproteases, suggesting that those accumulated in the vacuole are not sequestered into stable aggregates that would hinder the action of proteolytic enzymes. Rather, whatever is the mechanism of glutenin polymer transport to the vacuole, the proteins remain available for proteolytic processing, and can be converted to the mature form by the removal of a short N-terminal sequence. PMID:24629124

  6. Molecular Cloning and Characterization of Novel Morus alba Germin-Like Protein Gene Which Encodes for a Silkworm Gut Digestion-Resistant Antimicrobial Protein

    PubMed Central

    Patnaik, Bharat Bhusan; Kim, Dong Hyun; Oh, Seung Han; Song, Yong-Su; Chanh, Nguyen Dang Minh; Kim, Jong Sun; Jung, Woo-jin; Saha, Atul Kumar; Bindroo, Bharat Bhushan; Han, Yeon Soo

    2012-01-01

    Background Silkworm fecal matter is considered one of the richest sources of antimicrobial and antiviral protein (substances) and such economically feasible and eco-friendly proteins acting as secondary metabolites from the insect system can be explored for their practical utility in conferring broad spectrum disease resistance against pathogenic microbial specimens. Methodology/Principal Findings Silkworm fecal matter extracts prepared in 0.02 M phosphate buffer saline (pH 7.4), at a temperature of 60°C was subjected to 40% saturated ammonium sulphate precipitation and purified by gel-filtration chromatography (GFC). SDS-PAGE under denaturing conditions showed a single band at about 21.5 kDa. The peak fraction, thus obtained by GFC wastested for homogeneityusing C18reverse-phase high performance liquid chromatography (HPLC). The activity of the purified protein was tested against selected Gram +/− bacteria and phytopathogenic Fusarium species with concentration-dependent inhibitionrelationship. The purified bioactive protein was subjected to matrix-assisted laser desorption and ionization-time of flight mass spectrometry (MALDI-TOF-MS) and N-terminal sequencing by Edman degradation towards its identification. The N-terminal first 18 amino acid sequence following the predicted signal peptide showed homology to plant germin-like proteins (Glp). In order to characterize the full-length gene sequence in detail, the partial cDNA was cloned and sequenced using degenerate primers, followed by 5′- and 3′-rapid amplification of cDNA ends (RACE-PCR). The full-length cDNA sequence composed of 630 bp encoding 209 amino acids and corresponded to germin-like proteins (Glps) involved in plant development and defense. Conclusions/Significance The study reports, characterization of novel Glpbelonging to subfamily 3 from M. alba by the purification of mature active protein from silkworm fecal matter. The N-terminal amino acid sequence of the purified protein was found similar to the deduced amino acid sequence (without the transit peptide sequence) of the full length cDNA from M. alba. PMID:23284650

  7. DOE Office of Scientific and Technical Information (OSTI.GOV)

    James Graham, Robert Leslie; Graham, Ciaren; McClean, Stephen

    A novel undecapeptide has been isolated and structurally characterized from the venoms of three species of New World pit vipers from the subfamily, Crotalinae. These include the Mexican moccasin (Agkistrodon bilineatus), the prairie rattlesnake (Crotalus viridis viridis), and the South American bushmaster (Lachesis muta). The peptide was purified from all three venoms using a combination of gel permeation chromatography and reverse-phase HPLC. Automated Edman degradation sequencing and MALDI-TOF mass spectrometry established its peptide primary structure as: Thr-Pro-Pro-Ala-Gly-Pro-Asp-Val-Gly-Pro-Arg-OH, with a non-protonated molecular mass of 1063.18 Da. A synthetic replicate of the peptide was found to be an antagonist of bradykinin actionmore » at the rat vascular B2 receptor. This is the first bradykinin inhibitory peptide isolated from snake venom. Database searching revealed the peptide to be highly structurally related (10/11 residues) with a domain residing between the bradykinin-potentiating peptide and C-type natriuretic peptide domains of a recently cloned precursor from tropical rattlesnake (Crotalus durissus terrificus) venom gland. BIP thus represents a novel biological entity from snake venom.« less

  8. Antimicrobial proline-rich peptides from the hemolymph of marine snail Rapana venosa.

    PubMed

    Dolashka, Pavlina; Moshtanska, Vesela; Borisova, Valika; Dolashki, Aleksander; Stevanovic, Stefan; Dimanov, Tzvetan; Voelter, Wolfgang

    2011-07-01

    Hemolymph of Rapana venosa snails is a complex mixture of biochemically and pharmacologically active components such as peptides and proteins. Antimicrobial peptides are gaining attention as antimicrobial alternatives to chemical food preservatives and commonly used antibiotics. Therefore, for the first time we have explored the isolation, identification and characterisation of 11 novel antimicrobial peptides produced by the hemolymph of molluscs. The isolated peptides from the hemolymph applying ultrafiltration and reverse-phase high-performance liquid chromatography (RP-HPLC) have molecular weights between 3000 and 9500 Da, determined by mass spectrometric analysis. The N-terminal sequences of the peptides identified by Edman degradation matched no peptides in the MASCOT search database, indicating novel proline-rich peptides. UV spectra revealed that these substances possessed the characteristics of protein peptides with acidic isoelectric points. However, no Cotton effects were observed between 190 and 280 nm by circular dichroism spectroscopy. Four of the pro-rich peptides also showed strong antimicrobial activities against tested microorganisms including Gram-positive and Gram-negative bacteria. Copyright © 2011 Elsevier Inc. All rights reserved.

  9. Rapid removal of acetimidoyl groups from proteins and peptides. Applications to primary structure determination.

    PubMed Central

    Dubois, G C; Robinson, E A; Inman, J K; Perham, R N; Appella, E

    1981-01-01

    Methylamine buffers can be used for the rapid quantitative removal of acetimidoyl groups from proteins and peptides modified by treatment with ethyl or methyl acetimidate. The half-life for displacement of acetimidoyl groups from fully amidinated proteins incubated in 3.44 M-methylamine/HCl buffer at pH 11.5 and 25 degrees C was approx. 26 min; this half life is 29 times less than that observed in ammonia/HCl buffer under the same conditions of pH and amine concentration. Incubation of acetimidated proteins with methylamine for 4 h resulted in greater than 95% removal of acetimidoyl groups. No deleterious effects on primary structure were detected by amino acid analysis or by automated Edman degradation. Reversible amidination of lysine residues, in conjunction with tryptic digestion, has been successfully applied to the determination of the amino acid sequence of an acetimidated mouse immunoglobulin heavy chain peptide. The regeneration of amino groups in amidinated proteins and peptides by methylaminolysis makes amidination a valuable alternative to citraconoylation and maleoylation in structural studies. PMID:6803762

  10. An insecticidal toxin from Nephila clavata spider venom.

    PubMed

    Jin, Lin; Fang, Mingqian; Chen, Mengrou; Zhou, Chunling; Ombati, Rose; Hakim, Md Abdul; Mo, Guoxiang; Lai, Ren; Yan, Xiuwen; Wang, Yumin; Yang, Shilong

    2017-07-01

    Spiders are the most successful insect predators given that they use their venom containing insecticidal peptides as biochemical weapons for preying. Due to the high specificity and potency of peptidic toxins, discoveries of insecticidal toxins from spider venom have provided an opportunity to obtain natural compounds for agricultural applications without affecting human health. In this study, a novel insecticidal toxin (μ-NPTX-Nc1a) was identified and characterized from the venom of Nephila clavata. Its primary sequence is GCNPDCTGIQCGWPRCPGGQNPVMDKCVSCCPFCPPKSAQG which was determined by automated Edman degradation, cDNA cloning, and MS/MS analysis. BLAST search indicated that Nc1a shows no similarity with known peptides or proteins, indicating that Nc1a belongs to a novel family of insecticidal peptide. Nc1a displayed inhibitory effects on Na V and K V channels in cockroach dorsal unpaired median neurons. The median lethal dose (LD50) of Nc1a on cockroach was 573 ng/g. Herein, a study that identifies a novel insecticidal toxin, which can be a potential candidate and/or template for the development of bioinsecticides, is presented.

  11. Multi-omic Mitoprotease Profiling Defines a Role for Oct1p in Coenzyme Q Production.

    PubMed

    Veling, Mike T; Reidenbach, Andrew G; Freiberger, Elyse C; Kwiecien, Nicholas W; Hutchins, Paul D; Drahnak, Michael J; Jochem, Adam; Ulbrich, Arne; Rush, Matthew J P; Russell, Jason D; Coon, Joshua J; Pagliarini, David J

    2017-12-07

    Mitoproteases are becoming recognized as key regulators of diverse mitochondrial functions, although their direct substrates are often difficult to discern. Through multi-omic profiling of diverse Saccharomyces cerevisiae mitoprotease deletion strains, we predicted numerous associations between mitoproteases and distinct mitochondrial processes. These include a strong association between the mitochondrial matrix octapeptidase Oct1p and coenzyme Q (CoQ) biosynthesis-a pathway essential for mitochondrial respiration. Through Edman sequencing and in vitro and in vivo biochemistry, we demonstrated that Oct1p directly processes the N terminus of the CoQ-related methyltransferase, Coq5p, which markedly improves its stability. A single mutation to the Oct1p recognition motif in Coq5p disrupted its processing in vivo, leading to CoQ deficiency and respiratory incompetence. This work defines the Oct1p processing of Coq5p as an essential post-translational event for proper CoQ production. Additionally, our data visualization tool enables efficient exploration of mitoprotease profiles that can serve as the basis for future mechanistic investigations. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Purification and characteristics of a novel bacteriocin produced by Enterococcus faecalis L11 isolated from Chinese traditional fermented cucumber.

    PubMed

    Gao, Yurong; Li, Benling; Li, Dapeng; Zhang, Liyuan

    2016-05-01

    To purify and characterize a novel bacteriocin with broad inhibitory spectrum produced by an isolate of Enterococcus faecalis from Chinese fermented cucumber. E. faecalis L11 produced a bacteriocin with antimicrobial activity against both Escherichia coli and Staphylococcus aureus. The amino acid sequence of the purified bacteriocin, enterocin L11, was assayed by Edman degradation method. It differs from other class II bacteriocins and exhibited a broad antimicrobial activity against not only Gram-positive bacteria, including Bacillus subtilis, S. aureus, Listeria monocytogenes, Sarcina flava, Lactobacillus acidophilus, L. plantarum, L. delbrueckii subsp. delbrueckii, L. delbrueckii subsp. bulgaricus and Streptococcus thermophilus, but also some Gram-negative bacteria including Salmonella typhimurium, E. coli and Shigella flexneri. Enterocin L11 retained 91 % of its activity after holding at 121 °C for 30 min. It was also resistant to acids and alkalis. Enterocin L11 is a novel broad-spectrum Class II bacteriocin produced by E. faecalis L11, and may have potential as a food biopreservative.

  13. PH-sauvagine from the skin secretion of Phyllomedusa hypochondrialis: A novel CRF-like peptide with smooth muscle contraction activity.

    PubMed

    Zhou, Yu; Shaw, Chris; Chen, Tianbao

    2015-09-15

    Amphibian skin, and particularly that of south/Central American phyllomedusine frogs, is supposed to be "a huge factory and store house of a variety of active peptides". The 40 amino acid amphibian CRF-like peptide, sauvagine, is a prototype member of a unique family of these Phyllomedusa skin peptides. In this study, we describe for the first time the structure of a mature novel peptide from the skin secretion of the South American orange-legged leaf frog, Phyllomedusa hypochondrialis, which belongs to the amphibian CRF/sauvagine family. Partial amino acid sequence from the N-terminal was obtained by automated Edman degradation with the following structure: pGlu-GPPISIDLNMELLRNMIEI-. The biosynthetic precursor of this novel sauvagine peptide, consisted of 85 amino acid residues and was deduced from cDNA library constructed from the same skin secretion. Compared with the standard sauvagine from the frog, Phyllomedusa sauvagei, this novel peptide was found to exert similar contraction effects on isolated guinea-pig colon and rat urinary bladder smooth muscle preparations. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Crab digestive phospholipase: a new invertebrate member.

    PubMed

    Cherif, Slim; Ben Bacha, Abir; Ben Ali, Yassine; Horchani, Habib; Rekik, Wiem; Gargouri, Youssef

    2010-01-01

    Crab digestive phospholipase (CDPL) was purified from the hepatopancreas of Carcinus mediterraneus crabs. Homogeneous enzyme was obtained after two chromatography steps: anion exchange and size exclusion HPLC column. Homogeneous CDPL has a molecular mass of 14 kDa as determined by SDS/PAGE analysis. Unlike known digestive phospholipases like porcine PLA(2) (PPPL), CDPL displayed its maximal activity at 50 degrees C and not at 37 degrees C. A specific activity of 40 U/mg for the purified CDPL was measured using PC as substrate under optimal conditions (pH 8 and 50 degrees C) in the presence of 8 mM sodium deoxycholate (NaDC) and 10 mM CaCl(2). In contrast to PPPL, purified CDPL was completely inactivated at 60 degrees C. The N-terminal sequence was determined by automatic Edman degradation. No similarity between 12 N-terminal amino acid residues of CDPL was found with those of known digestive phospholipases. CDPL appears to be a new member of invertebrate phospholipases, and it is potentially useful for treat phospholipid-rich industrial effluents, or to synthesize useful chemical compounds which can be used in the food industry.

  15. Evidence for the proteolytic processing of dentin matrix protein 1. Identification and characterization of processed fragments and cleavage sites.

    PubMed

    Qin, Chunlin; Brunn, Jan C; Cook, Richard G; Orkiszewski, Ralph S; Malone, James P; Veis, Arthur; Butler, William T

    2003-09-05

    Full-length cDNA coding for dentin matrix protein 1 (DMP1) has been cloned and sequenced, but the corresponding complete protein has not been isolated. In searching for naturally occurring DMP1, we recently discovered that the extracellular matrix of bone contains fragments originating from DMP1. Shortened forms of DMP1, termed 37K and 57K fragments, were treated with alkaline phosphatase and then digested with trypsin. The resultant peptides were purified by a two-dimensional method: size exclusion followed by reversed-phase high performance liquid chromatography. Purified peptides were sequenced by Edman degradation and mass spectrometry, and the sequences compared with the DMP1 sequence predicted from cDNA. Extensive sequencing of tryptic peptides revealed that the 37K fragments originated from the NH2-terminal region, and the 57K fragments were from the COOH-terminal part of DMP1. Phosphate analysis indicated that the 37K fragments contained 12 phosphates, and the 57K fragments had 41. From 37K fragments, two peptides lacked a COOH-terminal lysine or arginine; instead they ended at Phe173 and Ser180 and were thus COOH termini of 37K fragments. Two peptides were from the NH2 termini of 57K fragments, starting at Asp218 and Asp222. These findings indicated that DMP1 is proteolytically cleaved at four bonds, Phe173-Asp174, Ser180-Asp181, Ser217-Asp218, and Gln221-Asp222, forming eight fragments. The uniformity of cleavages at the NH2-terminal peptide bonds of aspartyl residues suggests that a single proteinase is involved. Based on its reported specificity, we hypothesize that these scissions are catalyzed by PHEX protein. We envision that the proteolytic processing of DMP1 plays a crucial role during osteogenesis and dentinogenesis.

  16. Characterization of on-target generated tryptic peptides from Giberella zeae conidia spore proteins by means of matrix-assisted laser desorption/ionization mass spectrometry.

    PubMed

    Dong, Hongjuan; Marchetti-Deschmann, Martina; Allmaier, Günter

    2014-01-01

    Traditionally characterization of microbial proteins is performed by a complex sequence of steps with the final step to be either Edman sequencing or mass spectrometry, which generally takes several weeks or months to be complete. In this work, we proposed a strategy for the characterization of tryptic peptides derived from Giberella zeae (anamorph: Fusarium graminearum) proteins in parallel to intact cell mass spectrometry (ICMS) in which no complicated and time-consuming steps were needed. Experimentally, after a simple washing treatment of the spores, the aliquots of the intact G. zeae macro conidia spores solution, were deposited two times onto one MALDI (matrix-assisted laser desorption ionization) mass spectrometry (MS) target (two spots). One spot was used for ICMS and the second spot was subject to a brief on-target digestion with bead-immobilized or non-immobilized trypsin. Subsequently, one spot was analyzed immediately by MALDI MS in the linear mode (ICMS) whereas the second spot containing the digested material was investigated by MALDI MS in the reflectron mode ("peptide mass fingerprint") followed by protonated peptide selection for MS/MS (post source decay (PSD) fragment ion) analysis. Based on the formed fragment ions of selected tryptic peptides a complete or partial amino acid sequence was generated by manual de novo sequencing. These sequence data were used for homology search for protein identification. Finally four different peptides of varying abundances have been identified successfully allowing the verification that our desorbed/ionized surface compounds were indeed derived from proteins. The presence of three different proteins could be found unambiguously. Interestingly, one of these proteins is belonging to the ribosomal superfamily which indicates that not only surface-associated proteins were digested. This strategy minimized the amount of time and labor required for obtaining deeper information on spore preparations within the nowadays widely used ICMS approach. Copyright © 2013 Elsevier Ltd. All rights reserved.

  17. Characterization of papain-like isoenzymes from latex of Asclepias curassavica by molecular biology validated by proteomic approach.

    PubMed

    Obregón, Walter D; Liggieri, Constanza S; Trejo, Sebastian A; Avilés, Francesc X; Vairo-Cavalli, Sandra E; Priolo, Nora S

    2009-01-01

    Latices from Asclepias spp are used in wound healing and the treatment of some digestive disorders. These pharmacological actions have been attributed to the presence of cysteine proteases in these milky latices. Asclepias curassavica (Asclepiadaceae), "scarlet milkweed" is a perennial subshrub native to South America. In the current paper we report a new approach directed at the selective biochemical and molecular characterization of asclepain cI (acI) and asclepain cII (acII), the enzymes responsible for the proteolytic activity of the scarlet milkweed latex. SDS-PAGE spots of both purified peptidases were digested with trypsin and Peptide Mass Fingerprints (PMFs) obtained showed no equivalent peptides. No identification was possible by MASCOT search due to the paucity of information concerning Asclepiadaceae latex cysteine proteinases available in databases. From total RNA extracted from latex samples, cDNA of both peptidases was obtained by RT-PCR using degenerate primers encoding Asclepiadaceae cysteine peptidase conserved domains. Theoretical PMFs of partial polypeptide sequences obtained by cloning (186 and 185 amino acids) were compared with empirical PMFs, confirming that the sequences of 186 and 185 amino acids correspond to acI and acII, respectively. N-terminal sequences of acI and acII, characterized by Edman sequencing, were overlapped with those coming from the cDNA to obtain the full-length sequence of both mature peptidases (212 and 211 residues respectively). Alignment and phylogenetic analysis confirmed that acI and acII belong to the subfamily C1A forming a new group of papain-like cysteine peptidases together with asclepain f from Asclepias fruticosa. We conclude that PMF could be adopted as an excellent tool to differentiate, in a fast and unequivocal way, peptidases with very similar physicochemical and functional properties, with advantages over other conventional methods (for instance enzyme kinetics) that are time consuming and afford less reliable results.

  18. Protein sequence analysis, cloning, and expression of flammutoxin, a pore-forming cytolysin from Flammulina velutipes. Maturation of dimeric precursor to monomeric active form by carboxyl-terminal truncation.

    PubMed

    Tomita, Toshio; Mizumachi, Yoshihiro; Chong, Kang; Ogawa, Kanako; Konishi, Norihide; Sugawara-Tomita, Noriko; Dohmae, Naoshi; Hashimoto, Yohichi; Takio, Koji

    2004-12-24

    Flammutoxin (FTX), a 31-kDa pore-forming cytolysin from Flammulina velutipes, is specifically expressed during the fruiting body formation. We cloned and expressed the cDNA encoding a 272-residue protein with an identical N-terminal sequence with that of FTX but failed to obtain hemolytically active protein. This, together with the presence of multiple FTX family proteins in the mushroom, prompted us to determine the complete primary structure of FTX by protein sequence analysis. The N-terminal 72 and C-terminal 107 residues were sequenced by Edman degradation of the fragments generated from the alkylated FTX by enzymatic digestions with Achromobacter protease I or Staphylococcus aureus V8 protease and by chemical cleavages with CNBr, hydroxylamine, or 1% formic acid. The central part of FTX was sequenced with a surface-adhesive 7-kDa fragment, which was generated by a tryptic digestion of FTX and recovered by rinsing the wall of a test tube with 6 M guanidine HCl. The 7-kDa peptide was cleaved with 12 M HCl, thermolysin, or S. aureus V8 protease to produce smaller peptides for sequence analysis. As a result, FTX consisted of 251 residues, and protein and nucleotide sequences were in accord except for the lack of the initial Met and the C-terminal 20 residues in protein. Recombinant FTX (rFTX) with or without the C-terminal 20 residues (rFTX271 or rFTX251, respectively) was prepared to study the maturation process of FTX. Like natural FTX, rFTX251 existed as a monomer in solution and assembled into an SDS-stable, ring-shaped pore complex on human erythrocytes, causing hemolysis. In contrast, rFTX271, existing as a dimer in solution, bound to the cells but failed to form pore complex. The dimeric rFTX271 was converted to hemolytically active monomers upon the cleavage between Lys(251) and Met(252) by trypsin.

  19. A chondroitin sulfate chain attached to the bone dentin matrix protein 1 NH2-terminal fragment.

    PubMed

    Qin, Chunlin; Huang, Bingzhen; Wygant, James N; McIntyre, Bradley W; McDonald, Charles H; Cook, Richard G; Butler, William T

    2006-03-24

    Dentin matrix protein 1 (DMP1) is an acidic noncollagenous protein shown by gene ablations to be critical for the proper mineralization of bone and dentin. In the extracellular matrix of these tissues DMP1 is present as fragments representing the NH2-terminal (37 kDa) and COOH-terminal (57 kDa) portions of the cDNA-deduced amino acid sequence. During our separation of bone noncollagenous proteins, we observed a high molecular weight, DMP1-related component (designated DMP1-PG). We purified DMP1-PG with a monoclonal anti-DMP1 antibody affinity column. Amino acid analysis and Edman degradation of tryptic peptides proved that the core protein for DMP1-PG is the 37-kDa fragment of DMP1. Chondroitinase treatments demonstrated that the slower migration rate of DMP1-PG is due to the presence of glycosaminoglycan. Quantitative disaccharide analysis indicated that the glycosaminoglycan is made predominantly of chondroitin 4-sulfate. Further analysis on tryptic peptides led us to conclude that a single glycosaminoglycan chain is linked to the core protein via Ser74, located in the Ser74-Gly75 dipeptide, an amino acid sequence specific for the attachment of glycosaminoglycans. Our findings show that in addition to its existence as a phosphoprotein, the NH2-terminal fragment from DMP1 occurs as a proteoglycan. Amino acid sequence alignment analysis showed that the Ser74-Gly75 dipeptide and its flanking regions are highly conserved among a wide range of species from caiman to the Homo sapiens, indicating that this glycosaminoglycan attachment domain has survived an extremely long period of evolution pressure, suggesting that the glycosaminoglycan may be critical for the basic biological functions of DMP1.

  20. Characterization of Am IT, an anti-insect β-toxin isolated from the venom of scorpion Androctonus mauretanicus.

    PubMed

    Oukkache, Naoual; ElJaoudi, Rachid; Chgoury, Fatima; Rocha, Marisa Teixeira; Sabatier, Jean-Marc

    2015-06-25

    In the present study, a 'novel' toxin, called Am IT from the venom of scorpion Androctonus mauretanicus is isolated and characterized. A detailed analysis of the action of Am IT on insect axonal sodium currents is reported. Am IT was purified through gel filtration followed by C18 reversed-phase HPLC. Toxicity of Am IT in vivo was assessed on male German cockroach (Blattella germanica) larvae and C57/BL6 mice. Cross-reactivity of Am IT with two β-toxins was evidenced using (125)I-iodinated toxin-based radioimmunoassays with synaptosomal preparations from rat brain. The complete amino acid sequence of Am IT was finally determined by Edman sequencing. Am IT was observed to compete with AaH IT4 purified from the venom of scorpion Androctonus australis in binding assays. It was recognized by an antibody raised against a β-type toxin, which indicated some structural similarity with β-toxins (or related toxin family). The 'novel' toxin exhibited dual activity since it competed with anti-mammal toxins in binding assays as well as showed contracting activity to insect. The toxin competed with radio-labeled β-toxin Css IV by binding to Na(+) channels of rat brain synaptosomes. Analysis of toxin amino acid sequences showed that Am IT shares high structural identity (92%) with AaH IT4. In conclusion, Am IT not only reveals an anti-insect compound properties secreted by 'Old World' scorpions, paralyzing insect larvae by binding to Na(+) channels on larvae's nerve-cell membranes, but also exerts toxic activity in mice, which is similar to anti-mammal toxins from 'New World' scorpions (North and South Americas). Therefore, Am IT appears to be structurally and functionally similar to AaH IT4.

  1. Skin secretion peptides: the molecular facet of the deimatic behavior of the four-eyed frog, Physalaemus nattereri (Anura, Leptodactylidae).

    PubMed

    Barbosa, Eder Alves; Iembo, Tatiane; Martins, Graciella Ribeiro; Silva, Luciano Paulino; Prates, Maura Vianna; Andrade, Alan Carvalho; Bloch, Carlos

    2015-11-15

    Amphibians can produce a large amount of bioactive peptides over the skin. In order to map the precise tissue localization of these compounds and evaluate their functions, mass spectrometry imaging (MSI) and gene expression studies were used to investigate a possible correlation between molecules involved in the antimicrobial defense mechanisms and anti-predatory behavior by Physalaemus nattereri. Total skin secretion of P. nattereri was analyzed by classical Protein Chemistry and proteomic techniques. Intact inguinal macroglands were dissected from the rest of the skin and both tissues were analyzed by MSI and real-time polymerase chain reaction (RT-PCR) experiments. Peptides were primarily identified by de novo sequencing, automatic Edman degradation and cDNA data. Fifteen bradykinin (BK)-related peptides and two antimicrobial peptides were sequenced and mapped by MSI on the inguinal macrogland and the rest of P. nattereri skin. RT-PCR results revealed that BK-related peptide levels of expression were about 30,000 times higher on the inguinal macroglands than on the any other region of the skin, whilst antimicrobial peptide ions appear to be evenly distributed in both investigated regions. The presence of antimicrobial peptides in all investigated tissue regions is in accordance with the defensive role against microorganisms thoroughly demonstrated in the literature, whereas BK-related molecules are largely found on the inguinal macroglands suggesting an intriguing link between their noxious activities against potential predators of P. nattereri and the frog's deimatic behavior. Copyright © 2015 John Wiley & Sons, Ltd.

  2. AcT-2: a novel myotropic and antimicrobial type 2 tryptophyllin from the skin secretion of the Central American red-eyed leaf frog, Agalychnis callidryas.

    PubMed

    Ge, Lilin; Lyu, Peng; Zhou, Mei; Zhang, Huiling; Wan, Yuantai; Li, Bin; Li, Renjie; Wang, Lei; Chen, Tianbao; Shaw, Chris

    2014-01-01

    Tryptophyllins are a diverse family of amphibian peptides originally found in extracts of phyllomedusine frog skin by chemical means. Their biological activities remain obscure. Here we describe the isolation and preliminary pharmacological characterization of a novel type 2 tryptophyllin, named AcT-2, from the skin secretion of the red-eyed leaf frog, Agalychnis callidryas. The peptide was initially identified during smooth muscle pharmacological screening of skin secretion HPLC fractions and the unique primary structure--GMRPPWF-NH2--was established by both Edman degradation and electrospray MS/MS fragmentation sequencing. A. cDNA encoding the biosynthetic precursor of AcT-2 was successfully cloned from a skin secretion-derived cDNA library by means of RACE PCR and this contained an open-reading frame consisting of 62 amino acid residues with a single AcT-2 encoding sequence located towards the C-terminus. A synthetic replicate of AcT-2 was found to relax arterial smooth muscle (EC50 = 5.1 nM) and to contract rat urinary bladder smooth muscle (EC50 = 9.3 μ M). The peptide could also inhibit the growth of the microorganisms, Staphylococcus aureus, (MIC = 256 mg/L) Escherichia coli (MIC = 512 mg/L), and Candida albicans (128 mg/L). AcT-2 is thus the first amphibian skin tryptophyllin found to possess both myotropic and antimicrobial activities.

  3. Perlinhibin, a Cysteine-, Histidine-, and Arginine-Rich Miniprotein from Abalone (Haliotis laevigata) Nacre, Inhibits In Vitro Calcium Carbonate Crystallization

    PubMed Central

    Mann, Karlheinz; Siedler, Frank; Treccani, Laura; Heinemann, Fabian; Fritz, Monika

    2007-01-01

    We have isolated a 4.785 Da protein from the nacreous layer of the sea snail Haliotis laevigata (greenlip abalone) shell after demineralization with acetic acid. The sequence of 41 amino acids was determined by Edman degradation supported by mass spectrometry. The most abundant amino acids were cysteine (19.5%), histidine (17%), and arginine (14.6%). The positively charged amino acids were almost counterbalanced by negatively charged ones resulting in a calculated isoelectric point of 7.86. Atomic-force microscopy studies of the interaction of the protein with calcite surfaces in supersaturated calcium carbonate solution or calcium chloride solution showed that the protein bound specifically to calcite steps, inhibiting further crystal growth at these sites in carbonate solution and preventing crystal dissolution when carbonate was substituted with chloride. Therefore this protein was named perlinhibin. X-ray diffraction investigation of the crystal after atomic-force microscopy growth experiments showed that the formation of aragonite was induced on the calcite substrate around holes caused by perlinhibin crystal-growth inhibition. The strong interaction of the protein with calcium carbonate was also shown by vapor diffusion crystallization. In the presence of the protein, the crystal surfaces were covered with holes due to protein binding and local inhibition of crystal growth. In addition to perlinhibin, we isolated and sequenced a perlinhibin-related protein, indicating that perlinhibin may be a member of a family of closely related proteins. PMID:17496038

  4. Structural characterization and comparative modeling of PD-Ls 1-3, type 1 ribosome-inactivating proteins from summer leaves of Phytolacca dioica L.

    PubMed

    Di Maro, Antimo; Chambery, Angela; Carafa, Vincenzo; Costantini, Susan; Colonna, Giovanni; Parente, Augusto

    2009-03-01

    The amino acid sequence and glycan structure of PD-L1, PD-L2 and PD-L3, type 1 ribosome-inactivating proteins isolated from Phytolacca dioica L. leaves, were determined using a combined approach based on peptide mapping, Edman degradation and ESI-Q-TOF MS in precursor ion discovery mode. The comparative analysis of the 261 amino acid residue sequences showed that PD-L1 and PD-L2 have identical primary structure, as it is the case of PD-L3 and PD-L4. Furthermore, the primary structure of PD-Ls 1-2 and PD-Ls 3-4 have 81.6% identity (85.1% similarity). The ESI-Q-TOF MS analysis confirmed that PD-Ls 1-3 were glycosylated at different sites. In particular, PD-L1 contained three glycidic chains with the well known paucidomannosidic structure (Man)(3) (GlcNAc)(2) (Fuc)(1) (Xyl)(1) linked to Asn10, Asn43 and Asn255. PD-L2 was glycosylated at Asn10 and Asn43, and PD-L3 was glycosylated only at Asn10. PD-L4 was confirmed to be not glycosylated. Despite an overall high structural similarity, the comparative modeling of PD-L1, PD-L2, PD-L3 and PD-L4 has shown potential influences of the glycidic chains on their adenine polynucleotide glycosylase activity on different substrates.

  5. Olfactory Proteins Mediating Chemical Communication in the Navel Orangeworm Moth, Amyelois transitella

    PubMed Central

    Leal, Walter S.; Ishida, Yuko; Pelletier, Julien; Xu, Wei; Rayo, Josep; Xu, Xianzhong; Ames, James B.

    2009-01-01

    Background The navel orangeworm, Amyelois transitella Walker (Lepidoptera: Pyralidae), is the most serious insect pest of almonds and pistachios in California for which environmentally friendly alternative methods of control — like pheromone-based approaches — are highly desirable. Some constituents of the sex pheromone are unstable and could be replaced with parapheromones, which may be designed on the basis of molecular interaction of pheromones and pheromone-detecting olfactory proteins. Methodology By analyzing extracts from olfactory and non-olfactory tissues, we identified putative olfactory proteins, obtained their N-terminal amino acid sequences by Edman degradation, and used degenerate primers to clone the corresponding cDNAs by SMART RACE. Additionally, we used degenerate primers based on conserved sequences of known proteins to fish out other candidate olfactory genes. We expressed the gene encoding a newly identified pheromone-binding protein, which was analyzed by circular dichroism, fluorescence, and nuclear magnetic resonance, and used in a binding assay to assess affinity to pheromone components. Conclusion We have cloned nine cDNAs encoding olfactory proteins from the navel orangeworm, including two pheromone-binding proteins, two general odorant-binding proteins, one chemosensory protein, one glutathione S-transferase, one antennal binding protein X, one sensory neuron membrane protein, and one odorant receptor. Of these, AtraPBP1 is highly enriched in male antennae. Fluorescence, CD and NMR studies suggest a dramatic pH-dependent conformational change, with high affinity to pheromone constituents at neutral pH and no binding at low pH. PMID:19789654

  6. Processing of an anglerfish somatostatin precursor to a hydroxylysine-containing somatostatin 28.

    PubMed Central

    Spiess, J; Noe, B D

    1985-01-01

    A novel 28-residue somatostatin (SS) has been isolated from anglerfish pancreatic islets and characterized by complete Edman degradation, peptide mapping, and amino acid analysis. The primary structure of this anglerfish SS-28 (aSS-28) containing hydroxylysine (Hyl) was established to be H-Ser-Val-Asp-Ser-Thr-Asn-Asn-Leu-Pro-Pro-Arg-Glu-Arg-Lys-Ala-Gly-Cys- Lys-Asn-Phe-Tyr-Trp-Hyl-Gly-Phe-Thr-Ser-Cys-OH. This sequence (with the exception of hydroxylysine-23, which is replaced by lysine) is identical to the sequence of the COOH-terminal 28 residues of prepro-SS II predicted on the basis of cDNA analysis [Hobart, P., Crawford, R., Shen, L., Pictet, R. & Rutter, W. J. (1980) Nature (London) 288, 137-141]. This is the first instance in which hydroxylysine (to date characteristically observed in collagen or collagen-like structures) has been found in a potential regulatory peptide. Chromatographic characterization of peptides, radiolabeled in islet culture, revealed that aSS-28 contained 10-12% of the radioactivity incorporated into the 8000- to 1000-dalton SS-like polypeptides, whereas 88-90% of this radioactivity was detected in anglerfish SS-14. It appears probable that aSS-28 represents the predominant primary cleavage product derived from prepro-SS II by cleavage at the COOH-terminal side of a single arginine. Based on knowledge of the collagen biosynthesis, it is speculated that hydroxylation may take place as an early post-translational event. Images PMID:2857489

  7. Methods and devices for protein assays

    DOEpatents

    Chhabra, Swapnil [San Jose, CA; Cintron, Jose M [Indianapolis, IN; Shediac, Renee [Oakland, CA

    2009-11-03

    Methods and devices for protein assays based on Edman degradation in microfluidic channels are disclosed herein. As disclosed, the cleaved amino acid residues may be immobilized in an array format and identified by detectable labels, such as antibodies, which specifically bind given amino acid residues. Alternatively, the antibodies are immobilized in an array format and the cleaved amino acids are labeled identified by being bound by the antibodies in the array.

  8. Ocellatin peptides from the skin secretion of the South American frog Leptodactylus labyrinthicus (Leptodactylidae): characterization, antimicrobial activities and membrane interactions.

    PubMed

    Gusmão, Karla A G; Dos Santos, Daniel M; Santos, Virgílio M; Cortés, María Esperanza; Reis, Pablo V M; Santos, Vera L; Piló-Veloso, Dorila; Verly, Rodrigo M; de Lima, Maria Elena; Resende, Jarbas M

    2017-01-01

    The availability of antimicrobial peptides from several different natural sources has opened an avenue for the discovery of new biologically active molecules. To the best of our knowledge, only two peptides isolated from the frog Leptodactylus labyrinthicus , namely pentadactylin and ocellatin-F1, have shown antimicrobial activities. Therefore, in order to explore the antimicrobial potential of this species, we have investigated the biological activities and membrane interactions of three peptides isolated from the anuran skin secretion. Three peptide primary structures were determined by automated Edman degradation. These sequences were prepared by solid-phase synthesis and submitted to activity assays against gram-positive and gram-negative bacteria and against two fungal strains. The hemolytic properties of the peptides were also investigated in assays with rabbit blood erythrocytes. The conformational preferences of the peptides and their membrane interactions have been investigated by circular dichroism spectroscopy and liposome dye release assays. The amino acid compositions of three ocellatins were determined and the sequences exhibit 100% homology for the first 22 residues (ocellatin-LB1 sequence). Ocellatin-LB2 carries an extra Asn residue and ocellatin-F1 extra Asn-Lys-Leu residues at C-terminus. Ocellatin-F1 presents a stronger antibiotic potential and a broader spectrum of activities compared to the other peptides. The membrane interactions and pore formation capacities of the peptides correlate directly with their antimicrobial activities, i.e., ocellatin-F1 > ocellatin-LB1 > ocellatin-LB2. All peptides acquire high helical contents in membrane environments. However, ocellatin-F1 shows in average stronger helical propensities. The obtained results indicate that the three extra amino acid residues at the ocellatin-F1 C-terminus play an important role in promoting stronger peptide-membrane interactions and antimicrobial properties. The extra Asn-23 residue present in ocellatin-LB2 sequence seems to decrease its antimicrobial potential and the strength of the peptide-membrane interactions.

  9. Antagonistic Activities of Novel Peptides from Bacillus amyloliquefaciens PT14 against Fusarium solani and Fusarium oxysporum.

    PubMed

    Kim, Young Gwon; Kang, Hee Kyoung; Kwon, Kee-Deok; Seo, Chang Ho; Lee, Hyang Burm; Park, Yoonkyung

    2015-12-09

    Bacillus species have recently drawn attention due to their potential use in the biological control of fungal diseases. This paper reports on the antifungal activity of novel peptides isolated from Bacillus amyloliquefaciens PT14. Reverse-phase high-performance liquid chromatography revealed that B. amyloliquefaciens PT14 produces five peptides (PT14-1, -2, -3, -4a, and -4b) that exhibit antifungal activity but are inactive against bacterial strains. In particular, PT14-3 and PT14-4a showed broad-spectrum antifungal activity against Fusarium solani and Fusarium oxysporum. The PT14-4a N-terminal amino acid sequence was identified through Edman degradation, and a BLAST homology analysis showed it not to be identical to any other protein or peptide. PT14-4a displayed strong fungicidal activity with minimal inhibitory concentrations of 3.12 mg/L (F. solani) and 6.25 mg/L (F. oxysporum), inducing severe morphological deformation in the conidia and hyphae. On the other hand, PT14-4a had no detectable hemolytic activity. This suggests PT14-4a has the potential to serve as an antifungal agent in clinical therapeutic and crop-protection applications.

  10. The substrate degradome of meprin metalloproteases reveals an unexpected proteolytic link between meprin β and ADAM10.

    PubMed

    Jefferson, Tamara; Auf dem Keller, Ulrich; Bellac, Caroline; Metz, Verena V; Broder, Claudia; Hedrich, Jana; Ohler, Anke; Maier, Wladislaw; Magdolen, Viktor; Sterchi, Erwin; Bond, Judith S; Jayakumar, Arumugam; Traupe, Heiko; Chalaris, Athena; Rose-John, Stefan; Pietrzik, Claus U; Postina, Rolf; Overall, Christopher M; Becker-Pauly, Christoph

    2013-01-01

    The in vivo roles of meprin metalloproteases in pathophysiological conditions remain elusive. Substrates define protease roles. Therefore, to identify natural substrates for human meprin α and β we employed TAILS (terminal amine isotopic labeling of substrates), a proteomics approach that enriches for N-terminal peptides of proteins and cleavage fragments. Of the 151 new extracellular substrates we identified, it was notable that ADAM10 (a disintegrin and metalloprotease domain-containing protein 10)-the constitutive α-secretase-is activated by meprin β through cleavage of the propeptide. To validate this cleavage event, we expressed recombinant proADAM10 and after preincubation with meprin β, this resulted in significantly elevated ADAM10 activity. Cellular expression in murine primary fibroblasts confirmed activation. Other novel substrates including extracellular matrix proteins, growth factors and inhibitors were validated by western analyses and enzyme activity assays with Edman sequencing confirming the exact cleavage sites identified by TAILS. Cleavages in vivo were confirmed by comparing wild-type and meprin(-/-) mice. Our finding of cystatin C, elafin and fetuin-A as substrates and natural inhibitors for meprins reveal new mechanisms in the regulation of protease activity important for understanding pathophysiological processes.

  11. Synthesis and screening of one-bead-one-compound cyclic peptide libraries.

    PubMed

    Qian, Ziqing; Upadhyaya, Punit; Pei, Dehua

    2015-01-01

    Cyclic peptides have been a rich source of biologically active molecules. Herein we present a method for the combinatorial synthesis and screening of large one-bead-one-compound (OBOC) libraries of cyclic peptides against biological targets such as proteins. Up to ten million different cyclic peptides are rapidly synthesized on TentaGel microbeads by the split-and-pool synthesis method and subjected to a multistage screening protocol which includes magnetic sorting, on-bead enzyme-linked and fluorescence-based assays, and in-solution binding analysis of cyclic peptides selectively released from single beads by fluorescence anisotropy. Finally, the most active hit(s) is identified by the partial Edman degradation-mass spectrometry (PED-MS) method. This method allows a single researcher to synthesize and screen up to ten million cyclic peptides and identify the most active ligand(s) in ~1 month, without the time-consuming and expensive hit resynthesis or the use of any special equipment.

  12. Personal Protective Measures Against Insects and Other Arthropods of Military Significance

    DTIC Science & Technology

    2009-10-01

    responsibility, although it is also an important adjunct to unit- level and higher echelon preventive medicine countermeasures. Military personnel must be aware...fabric treatment level of 0.52% weight by weight of active ingredient. (d) Several steps are essential in properly using this treatment method...Control as a Force Multiplier. Defense 90. pp. 26-35. Elridge, B.F. and Edman, J.D. 2004. Medical Entomology: A Textbook on Public Health and

  13. The primary structure of fatty-acid-binding protein from nurse shark liver. Structural and evolutionary relationship to the mammalian fatty-acid-binding protein family.

    PubMed

    Medzihradszky, K F; Gibson, B W; Kaur, S; Yu, Z H; Medzihradszky, D; Burlingame, A L; Bass, N M

    1992-02-01

    The primary structure of a fatty-acid-binding protein (FABP) isolated from the liver of the nurse shark (Ginglymostoma cirratum) was determined by high-performance tandem mass spectrometry (employing multichannel array detection) and Edman degradation. Shark liver FABP consists of 132 amino acids with an acetylated N-terminal valine. The chemical molecular mass of the intact protein determined by electrospray ionization mass spectrometry (Mr = 15124 +/- 2.5) was in good agreement with that calculated from the amino acid sequence (Mr = 15121.3). The amino acid sequence of shark liver FABP displays significantly greater similarity to the FABP expressed in mammalian heart, peripheral nerve myelin and adipose tissue (61-53% sequence similarity) than to the FABP expressed in mammalian liver (22% similarity). Phylogenetic trees derived from the comparison of the shark liver FABP amino acid sequence with the members of the mammalian fatty-acid/retinoid-binding protein gene family indicate the initial divergence of an ancestral gene into two major subfamilies: one comprising the genes for mammalian liver FABP and gastrotropin, the other comprising the genes for mammalian cellular retinol-binding proteins I and II, cellular retinoic-acid-binding protein myelin P2 protein, adipocyte FABP, heart FABP and shark liver FABP, the latter having diverged from the ancestral gene that ultimately gave rise to the present day mammalian heart-FABP, adipocyte FABP and myelin P2 protein sequences. The sequence for intestinal FABP from the rat could be assigned to either subfamily, depending on the approach used for phylogenetic tree construction, but clearly diverged at a relatively early evolutionary time point. Indeed, sequences proximately ancestral or closely related to mammalian intestinal FABP, liver FABP, gastrotropin and the retinoid-binding group of proteins appear to have arisen prior to the divergence of shark liver FABP and should therefore also be present in elasmobranchs. The presence in shark liver of an FABP which differs substantially in primary structure from mammalian liver FABP, while being closely related to the FABP expressed in mammalian heart muscle, peripheral nerve myelin and adipocytes, opens a further dimension regarding the question of the existence of structure-dependent and tissue-specific specialization of FABP function in lipid metabolism.

  14. Venom gland transcriptomic and venom proteomic analyses of the scorpion Megacormus gertschi Díaz-Najera, 1966 (Scorpiones: Euscorpiidae: Megacorminae).

    PubMed

    Santibáñez-López, Carlos E; Cid-Uribe, Jimena I; Zamudio, Fernando Z; Batista, Cesar V F; Ortiz, Ernesto; Possani, Lourival D

    2017-07-01

    The soluble venom from the Mexican scorpion Megacormus gertschi of the family Euscorpiidae was obtained and its biological effects were tested in several animal models. This venom is not toxic to mice at doses of 100 μg per 20 g of mouse weight, while being lethal to arthropods (insects and crustaceans), at doses of 20 μg (for crickets) and 100 μg (for shrimps) per animal. Samples of the venom were separated by high performance liquid chromatography and circa 80 distinct chromatographic fractions were obtained from which 67 components have had their molecular weights determined by mass spectrometry analysis. The N-terminal amino acid sequence of seven protein/peptides were obtained by Edman degradation and are reported. Among the high molecular weight components there are enzymes with experimentally-confirmed phospholipase activity. A pair of telsons from this scorpion species was dissected, from which total RNA was extracted and used for cDNA library construction. Massive sequencing by the Illumina protocol, followed by de novo assembly, resulted in a total of 110,528 transcripts. From those, we were able to annotate 182, which putatively code for peptides/proteins with sequence similarity to previously-reported venom components available from different protein databases. Transcripts seemingly coding for enzymes showed the richest diversity, with 52 sequences putatively coding for proteases, 20 for phospholipases, 8 for lipases and 5 for hyaluronidases. The number of different transcripts potentially coding for peptides with sequence similarity to those that affect ion channels was 19, for putative antimicrobial peptides 19, and for protease inhibitor-like peptides, 18. Transcripts seemingly coding for other venom components were identified and described. The LC/MS analysis of a trypsin-digested venom aliquot resulted in 23 matches with the translated transcriptome database, which validates the transcriptome. The proteomic and transcriptomic analyses reported here constitute the first approach to study the venom components from a scorpion species belonging to the family Euscorpiidae. The data certainly show that this venom is different from all the ones described thus far in the literature. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Identification of cysteine-644 as the covalent site of attachment of dexamethasone 21-mesylate to murine glucocorticoid receptors in WEHI-7 cells

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smith, L.I.; Bodwell, J.E.; Mendel, D.B.

    1988-05-17

    Dexamethasone 21-mesylate is a highly specific synthetic glucocorticoid derivative that binds covalently to glucocorticoid receptors via sulfhydryl groups. The authors have identified the amino acid that reacts with the dexamethasone 21-mesylate by using enzymatic digestion and microsequencing for radiolabel. Nonactivated glucocorticoid receptors obtained from labeling intact WEHI-7 mouse thymoma cells with (/sup 3/H)dexamethasone 21-mesylate were immunopurified and analyzed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Trypsin digestion followed by reversed-phase high-performance liquid chromatography (reversed-phase HPLC) produced a single (/sup 3/H)dexamethasone 21-mesylate labeled peptide. Automated Edman degradation of this peptide revealed that the (/sup 3/H)dexamethasone 21-mesylate was located at position 5 frommore » the amino terminus. Dual-isotope labeling studies with (/sup 3/H)dexamethasone 21-mesylate and (/sup 35/S)methionine demonstrated that this peptide contained methionine. Staphylococcus aureus V8 protease digestion of (/sup 3/H)dexamethasone 21-mesylate labeled steroid-binding subunits generated a different radiolabeled peptide containing label at position 7 from the amino terminus. On the basis of the published amino acid sequence of the murine glucocorticoid receptor, their data clearly identify cysteine-644 as the single residue in the steroid-binding domain that covalently binds dexamethasone 21-mesylate. They have confirmed this finding by demonstrating that a synthetic peptide representing the amino acid sequence 640-650 of the murine glucocorticoid receptor behaves in an identical manner on reversed-phase HPLC as the trypsin-generated peptide from intact cells.« less

  16. When is Mass Spectrometry Combined with Affinity Approaches Essential? A Case Study of Tyrosine Nitration in Proteins

    NASA Astrophysics Data System (ADS)

    Petre, Brînduşa-Alina; Ulrich, Martina; Stumbaum, Mihaela; Bernevic, Bogdan; Moise, Adrian; Döring, Gerd; Przybylski, Michael

    2012-11-01

    Tyrosine nitration in proteins occurs under physiologic conditions and is increased at disease conditions associated with oxidative stress, such as inflammation and Alzheimer's disease. Identification and quantification of tyrosine-nitrations are crucial for understanding nitration mechanism(s) and their functional consequences. Mass spectrometry (MS) is best suited to identify nitration sites, but is hampered by low stabilities and modification levels and possible structural changes induced by nitration. In this insight, we discuss methods for identifying and quantifying nitration sites by proteolytic affinity extraction using nitrotyrosine (NT)-specific antibodies, in combination with electrospray-MS. The efficiency of this approach is illustrated by identification of specific nitration sites in two proteins in eosinophil granules from several biological samples, eosinophil-cationic protein (ECP) and eosinophil-derived neurotoxin (EDN). Affinity extraction combined with Edman sequencing enabled the quantification of nitration levels, which were found to be 8 % and 15 % for ECP and EDN, respectively. Structure modeling utilizing available crystal structures and affinity studies using synthetic NT-peptides suggest a tyrosine nitration sequence motif comprising positively charged residues in the vicinity of the NT- residue, located at specific surface- accessible sites of the protein structure. Affinities of Tyr-nitrated peptides from ECP and EDN to NT-antibodies, determined by online bioaffinity- MS, provided nanomolar KD values. In contrast, false-positive identifications of nitrations were obtained in proteins from cystic fibrosis patients upon using NT-specific antibodies, and were shown to be hydroxy-tyrosine modifications. These results demonstrate affinity- mass spectrometry approaches to be essential for unequivocal identification of biological tyrosine nitrations.

  17. Thionin-like peptides from Capsicum annuum fruits with high activity against human pathogenic bacteria and yeasts.

    PubMed

    Taveira, Gabriel B; Mathias, Luciana S; da Motta, Olney V; Machado, Olga L T; Rodrigues, Rosana; Carvalho, André O; Teixeira-Ferreira, André; Perales, Jonas; Vasconcelos, Ilka M; Gomes, Valdirene M

    2014-01-01

    Plants defend themselves against pathogens with production of antimicrobial peptides (AMPs). Herein we describe the discovery of a new antifungal and antibacterial peptide from fruits of Capsicum annuum that showed similarity to an already well characterized family of plant AMPs, thionins. Other fraction composed of two peptides, in which the major peptide also showed similarity to thionins. Among the obtained fractions, fraction 1, which is composed of a single peptide of 7 kDa, was sequenced by Edman method and its comparative sequence analysis in database (nr) showed similarity to thionin-like peptides. Tests against microorganisms, fraction 1 presented inhibitory activity to the cells of yeast Saccharomyces cerevisiae, Candida albicans, and Candida tropicalis and caused growth reduction to the bacteria species Escherichia coli and Pseudomonas aeruginosa. Fraction 3 caused inhibitory activity only for C. albicans and C. tropicalis. This fraction was composed of two peptides of ∼7 and 10 kDa, and the main protein band correspondent to the 7 kDa peptide, also showed similarity to thionins. This plasma membrane permeabilization assay demonstrates that the peptides present in the fractions 1 and 3 induced changes in the membranes of all yeast strains, leading to their permeabilization. Fraction 1 was capable of inhibiting acidification of the medium of glucose-induced S. cerevisiae cells 78% after an incubation time of 30 min, and opposite result was obtained for C. albicans. Experiments demonstrate that the fraction 1 and 3 were toxic and induced changes in the membranes of all yeast strains, leading to their permeabilization. Copyright © 2013 Wiley Periodicals, Inc.

  18. The adipokinetic hormone of the coleopteran suborder Adephaga: Structure, function, and comparison of distribution in other insects.

    PubMed

    Gäde, Gerd; Marco, Heather G

    2017-07-01

    The aim of the current study is to identify the adipokinetic hormone(s) (AKHs) of a basal suborder of the species-rich Coleoptera, the Adephaga, and possibly learn more about the ancestral AKH of beetles. Moreover, we wanted to compare the ancestral AKH with AKHs of more advanced beetles, of which a number are pest insects. This would allow us to assess whether AKH mimetics would be suitable as insecticides, that is, be harmful to the pest species but not to the beneficial species. Nine species of the Adephaga were investigated and all synthesize only one octapeptide in the corpus cardiacum, as revealed by Edman degradation sequencing techniques or by mass spectrometry. The amino acid sequence pGlu-Leu-Asn-Phe-Ser-Thr-Gly-Trp corresponds to Schgr-AKH-II that was first identified in the desert locust. It is assumed that Schgr-AKH-II-the peptide of a basal beetle clade-is the ancestral AKH for beetles. Some other beetle families, as well as some Hymenoptera (including honey bees) also contain this peptide, whereas most of the pest beetle species have different AKHs. This argues that those peptides and their receptors should be explored for developing mimetics with insecticidal properties. A scenario where Schgr-AKH-II (the only AKH of Adephaga) is used as basic molecular structure to derive almost all other known beetle AKHs via single step mutations is very likely, and supports the interpretation that Schgr-AKH-II is the ancestral AKH of Coleoptera. © 2017 Wiley Periodicals, Inc.

  19. Identification and immunologic characterization of an allergen, alliin lyase, from garlic (Allium sativum).

    PubMed

    Kao, Shao-Hsuan; Hsu, Ching-Hsian; Su, Song-Nan; Hor, Wei-Ting; Chang T, Wen-Hong; Chow, Lu-Ping

    2004-01-01

    Garlic (Allium sativum) is one of the most common relishes used in cooking worldwide. Very few garlic allergens have been reported, and garlic allergy has been rarely studied. The aim of the study was to identify allergenic proteins in garlic and to investigate their importance in allergies to other Allium species (leek, shallot, and onion). A crude extract of garlic proteins was separated by SDS-PAGE and 2-dimensional electrophoresis; immunoblotting was then performed with the use of individual and pooled sera from patients with garlic allergy, and the major IgE-binding proteins were analyzed by amino acid sequencing and mass spectrometry. The putative allergens were further purified by chromatography; the antigenicity, allergenicity, and IgE-binding cross-reactivity of the purified protein were then studied by immunoblotting, periodate oxidation, skin tests, and IgE-binding inhibition assays. A major allergen, alliin lyase, was identified by mass spectrometry and Edman sequencing and purified to homogeneity through the use of a simple 2-step chromatographic method. Skin tests showed that the purified protein elicited IgE-mediated hypersensitive responses in patients with garlic allergy. Periodate oxidation showed that carbohydrate groups were involved in the antigenicity, allergenicity, and cross-reactivity. Garlic alliin lyase showed strong cross-reactivity with alliin lyases from other Allium species, namely leek, shallot, and onion. Alliin lyase was found to be a major garlic allergen in a garlic-allergic group of patients in Taiwan. The wide distribution of alliin lyase in Allium suggests it may be a new cross-reactive allergen.

  20. Peroxisomal copper, zinc superoxide dismutase. Characterization of the isoenzyme from watermelon cotyledons.

    PubMed Central

    Bueno, P; Varela, J; Gimeénez-Gallego, G; del Río, L A

    1995-01-01

    The biochemical and immunochemical characterization of a superoxide dismutase (SOD, EC 1.15.1.1) from peroxisomal origin has been carried out. The enzyme is a Cu,Zn-containing SOD (CuZn-SOD) located in the matrix of peroxisomes from watermelon (Citrullus vulgaris Schrad.) cotyledons (L.M. Sandalio and L.A. del Río [1988] Plant Physiol 88: 1215-1218). The amino acid composition of the enzyme was determined. Analysis by reversed-phase high-performance liquid chromatography of the peroxisomal CuZn-SOD incubated with 6 M guanidine-HCl indicated that this enzyme contained a noncovalently bound chromophore group that was responsible for the absorbance peak of the native enzyme at 260 nm. The amino acid sequence of the peroxisomal CuZn-SOD was determined by Edman degradation. Comparison of its sequence with those reported for other plant SODs revealed homologies of about 70% with cytosolic CuZn-SODs and of 90% with chloroplastic CuZn-SODs. The peroxisomal SOD has a high thermal stability and resistance to inactivation by hydrogen peroxide. A polyclonal antibody was raised against peroxisomal CuZn-SOD, and by western blotting the antibody cross-reacted with plant CuZn-SODs but did not recognize either plant Mn-SOD or bacterial Fe-SOD. The antiSOD-immunoglobulin G showed a weak cross-reaction with bovine erythrocytes and liver CuZn-SODs, and also with cell-free extracts from trout liver. The possible function of this CuZn-SOD in the oxidative metabolism of peroxisomes is discussed. PMID:7630940

  1. Antibacterial activity in bovine lactoferrin-derived peptides.

    PubMed Central

    Hoek, K S; Milne, J M; Grieve, P A; Dionysius, D A; Smith, R

    1997-01-01

    Several peptides sharing high sequence homology with lactoferricin B (Lf-cin B) were generated from bovine lactoferrin (Lf) with recombinant chymosin. Two peptides were copurified, one identical to Lf-cin B and another differing from Lf-cin B by the inclusion of a C-terminal alanine (lactoferricin). Two other peptides were copurified from chymosin-hydrolyzed Lf, one differing from Lf-cin B by the inclusion of C-terminal alanyl-leucine and the other being a heterodimer linked by a disulfide bond. These peptides were isolated in a single step from chymosin-hydrolyzed Lf by membrane ion-exchange chromatography and were purified by reverse-phase high-pressure liquid chromatography (HPLC). They were characterized by N-terminal Edman sequencing, mass spectrometry, and antibacterial activity determination. Pure lactoferricin, prepared from pepsin-hydrolyzed Lf, was purified by standard chromatography techniques. This peptide was analyzed against a number of gram-positive and gram-negative bacteria before and after reduction of its disulfide bond or cleavage after its single methionine residue and was found to inhibit the growth of all the test bacteria at a concentration of 8 microM or less. Subfragments of lactoferricin were isolated from reduced and cleaved peptide by reverse-phase HPLC. Subfragment 1 (residues 1 to 10) was active against most of the test microorganisms at concentrations of 10 to 50 microM. Subfragment 2 (residues 11 to 26) was active against only a few microorganisms at concentrations up to 100 microM. These antibacterial studies indicate that the activity of lactoferricin is mainly, but not wholly, due to its N-terminal region. PMID:8980754

  2. Biochemical and Genetic Evidence that Enterococcus faecium L50 Produces Enterocins L50A and L50B, the sec-Dependent Enterocin P, and a Novel Bacteriocin Secreted without an N-Terminal Extension Termed Enterocin Q

    PubMed Central

    Cintas, Luis M.; Casaus, Pilar; Herranz, Carmen; Håvarstein, Leiv Sigve; Holo, Helge; Hernández, Pablo E.; Nes, Ingolf F.

    2000-01-01

    Enterococcus faecium L50 grown at 16 to 32°C produces enterocin L50 (EntL50), consisting of EntL50A and EntL50B, two unmodified non-pediocin-like peptides synthesized without an N-terminal leader sequence or signal peptide. However, the bacteriocin activity found in the cell-free culture supernatants following growth at higher temperatures (37 to 47°C) is not due to EntL50. A purification procedure including cation-exchange, hydrophobic interaction, and reverse-phase liquid chromatography has shown that the antimicrobial activity is due to two different bacteriocins. Amino acid sequences obtained by Edman degradation and DNA sequencing analyses revealed that one is identical to the sec-dependent pediocin-like enterocin P produced by E. faecium P13 (L. M. Cintas, P. Casaus, L. S. Håvarstein, P. E. Hernández, and I. F. Nes, Appl. Environ. Microbiol. 63:4321–4330, 1997) and the other is a novel unmodified non-pediocin-like bacteriocin termed enterocin Q (EntQ), with a molecular mass of 3,980. DNA sequencing analysis of a 963-bp region of E. faecium L50 containing the enterocin P structural gene (entP) and the putative immunity protein gene (entiP) reveals a genetic organization identical to that previously found in E. faecium P13. DNA sequencing analysis of a 1,448-bp region identified two consecutive but diverging open reading frames (ORFs) of which one, termed entQ, encodes a 34-amino-acid protein whose deduced amino acid sequence was identical to that obtained for EntQ by amino acid sequencing, showing that EntQ, similarly to EntL50A and EntL50B, is synthesized without an N-terminal leader sequence or signal peptide. The second ORF, termed orf2, was located immediately upstream of and in opposite orientation to entQ and encodes a putative immunity protein composed of 221 amino acids. Bacteriocin production by E. faecium L50 showed that EntP and EntQ are produced in the temperature range from 16 to 47°C and maximally detected at 47 and 37 to 47°C, respectively, while EntL50A and EntL50B are maximally synthesized at 16 to 25°C and are not detected at 37°C or above. PMID:11073927

  3. Angiotensin I-Converting Enzyme Inhibitor Derived from Cross-Linked Oyster Protein

    PubMed Central

    Xie, Cheng-Liang; Kim, Jin-Soo; Ha, Jong-Myung; Choung, Se-Young

    2014-01-01

    Following cross-linking by microbial transglutaminase, modified oyster proteins were hydrolyzed to improve inhibitory activity against angiotensin-converting enzyme (ACE) inhibitory activity with the use of a single protease, or a combination of six proteases. The oyster hydrolysate with the lowest 50% ACE inhibitory concentration (IC50) of 0.40 mg/mL was obtained by two-step hydrolysis of the cross-linked oyster protein using Protamex and Neutrase. Five ACE inhibitory peptides were purified from the oyster hydrolysate using a multistep chromatographic procedure comprised of ion-exchange, size exclusion, and reversed-phase liquid chromatography. Their sequences were identified as TAY, VK, KY, FYN, and YA, using automated Edman degradation and mass spectrometry. These peptides were synthesized, and their IC50 values were measured to be 16.7, 29.0, 51.5, 68.2, and 93.9 μM, respectively. Toxicity of the peptides on the HepG2 cell line was not detected. The oyster hydrolysate also significantly decreased the systolic blood pressure of spontaneously hypertensive rats (SHR). The antihypertensive effect of the oyster hydrolysate on SHR was rapid and long-lasting, compared to commercially obtained sardine hydrolysate. These results suggest that the oyster hydrolysate could be a source of effective nutraceuticals against hypertension. PMID:25140307

  4. Purification and characterization of an antibacterial and anti-inflammatory polypeptide from Arca subcrenata.

    PubMed

    Chen, Yuyan; Li, Chunlei; Zhu, Jianhua; Xie, Wangshi; Hu, Xianjing; Song, Liyan; Zi, Jiachen; Yu, Rongmin

    2017-03-01

    A polypeptide coded as PGC was isolated from Arca subcrenata muscle using ion exchange, Sephadex G-50 gel chromatography and RP-HPLC. PGC was identified to be a homogeneous compound by Native-PAGE and the purity was more than 98.9% measured by HPLC. The isoelectric point of PGC was determined to be 9.76 by IEF-PAGE. The molecular weight was determined to be 15,973.0Da by ESI-MS/MS. The conformational structure of PGC was characterized by UV-vis, FT-IR and CD spectroscopy. N terminal amino acid sequence of PGC was shown as PSVYDAAAQLTADVKKDLRDSWKVIGGDKKGNGVA by Edman degradation. The results demonstrated that there is a high degree of homology between PGC and the subunit from hemoglobin, and proposed that PGC is the depolymerized polypeptide of Hemoglobin I (HbI) from A. subcrenata. The evaluation of biological activities showed that the diameters of the inhibitory ring of PGC on Escherichia coli and Staphylococcus aureus were 14.5±0.44mm and 16.5±1.15mm, respectively. The IC 50 of inhibition rate for PGC on NO production was 9.60±0.71μg/mL. Therefore, PGC might be developed as one of potential antibacterial and anti-inflammatory agents. Copyright © 2016 Elsevier B.V. All rights reserved.

  5. Asymmetric arginine dimethylation of heterogeneous nuclear ribonucleoprotein K by protein-arginine methyltransferase 1 inhibits its interaction with c-Src.

    PubMed

    Ostareck-Lederer, Antje; Ostareck, Dirk H; Rucknagel, Karl P; Schierhorn, Angelika; Moritz, Bodo; Huttelmaier, Stefan; Flach, Nadine; Handoko, Lusy; Wahle, Elmar

    2006-04-21

    Arginine methylation is a post-translational modification found in many RNA-binding proteins. Heterogeneous nuclear ribonucleoprotein K (hnRNP K) from HeLa cells was shown, by mass spectrometry and Edman degradation, to contain asymmetric N(G),N(G)-dimethylarginine at five positions in its amino acid sequence (Arg256, Arg258, Arg268, Arg296, and Arg299). Whereas these five residues were quantitatively modified, Arg303 was asymmetrically dimethylated in <33% of hnRNP K and Arg287 was monomethylated in <10% of the protein. All other arginine residues were unmethylated. Protein-arginine methyltransferase 1 was identified as the only enzyme methylating hnRNP K in vitro and in vivo. An hnRNP K variant in which the five quantitatively modified arginine residues had been substituted was not methylated. Methylation of arginine residues by protein-arginine methyltransferase 1 did not influence the RNA-binding activity, the translation inhibitory function, or the cellular localization of hnRNP K but reduced the interaction of hnRNP K with the tyrosine kinase c-Src. This led to an inhibition of c-Src activation and hnRNP K phosphorylation. These findings support the role of arginine methylation in the regulation of protein-protein interactions.

  6. Conformational and Functional Effects Induced by D- and L-Amino Acid Epimerization on a Single Gene Encoded Peptide from the Skin Secretion of Hypsiboas punctatus

    PubMed Central

    de Magalhães, Mariana T. Q.; Barbosa, Eder A.; Prates, Maura V.; Verly, Rodrigo M.; Munhoz, Victor Hugo O.; de Araújo, Ivan E.; Bloch, Carlos

    2013-01-01

    Skin secretion of Hypsiboas punctatus is the source of a complex mixture of bioactive compounds where peptides and small proteins prevail, similarly to many other amphibians. Among dozens of molecules isolated from H. punctatus in a proteomic based approach, we report here the structural and functional studies of a novel peptide named Phenylseptin (FFFDTLKNLAGKVIGALT-NH2) that was purified as two naturally occurring D- and L-Phes configurations. The amino acid epimerization and C-terminal amidation for both molecules were confirmed by a combination of techniques including reverse-phase UFLC, ion mobility mass spectrometry, high resolution MS/MS experiments, Edman degradation, cDNA sequencing and solid-phase peptide synthesis. RMSD analysis of the twenty lowest-energy 1H NMR structures of each peptide revealed a major 90° difference between the two backbones at the first four N-terminal residues and substantial orientation changes of their respective side chains. These structural divergences were considered to be the primary cause of the in vitro quantitative differences in antimicrobial activities between the two molecules. Finally, both molecules elicited equally aversive reactions in mice when delivered orally, an effect that depended entirely on peripheral gustatory pathways. PMID:23565145

  7. Structure and biological activities of eumenine mastoparan-AF (EMP-AF), a new mast cell degranulating peptide in the venom of the solitary wasp (Anterhynchium flavomarginatum micado).

    PubMed

    Konno, K; Hisada, M; Naoki, H; Itagaki, Y; Kawai, N; Miwa, A; Yasuhara, T; Morimoto, Y; Nakata, Y

    2000-11-01

    A new mast cell degranulating peptide, eumenine mastoparan-AF (EMP-AF), was isolated from the venom of the solitary wasp Anterhynchium flavomarginatum micado, the most common eumenine wasp found in Japan. The structure was analyzed by FAB-MS/MS together with Edman degradation, which was corroborated by solid-phase synthesis. The sequence of EMP-AF, Ile-Asn-Leu-Leu-Lys-Ile-Ala-Lys-Gly-Ile-Ile-Lys-Ser-Leu-NH(2), was similar to that of mastoparan, a mast cell degranulating peptide from a hornet venom; tetradecapeptide with C-terminus amidated and rich in hydrophobic and basic amino acids. In fact, EMP-AF exhibited similar activity to mastoparan in stimulating degranulation from rat peritoneal mast cells and RBL-2H3 cells. It also showed significant hemolytic activity in human erythrocytes. Therefore, this is the first example that a mast cell degranulating peptide is found in the solitary wasp venom. Besides the degranulation and hemolytic activity, EMP-AF also affects on neuromuscular transmission in the lobster walking leg preparation. Three analogs EMP-AF-1 approximately 3 were snythesized and biologically tested together with EMP-AF, resulting in the importance of the C-terminal amide structure for biological activities.

  8. Studies on the plasma membrane H sup + -ATPase of oat roots: Preparation and assay, cytological localization, and sulfhydryl chemistry

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Katz, D.B.

    1989-01-01

    Biochemical and cytological studies were performed on the plasma membrane proton pump (H{sup +}-ATPase) of oat roots (Avena sativa cv. Stout). H{sup +}-ATPase activity in oat root plasma membranes is inhibited by N-ethylmaleimide (NEM), a covalent modifier of protein sulfhydryl groups. The rate of inhibition is reduced in the presence of ADP or MgADP. An M{sub r} = 100,000 plasma membrane polypeptide showed reduced labelling by ({sup 3}H)NEM in the presence of ADP. When tryptic peptides from ({sup 3}H)NEM-labeled M{sub r} = 100,000 polypeptide were separated by reverse-phase high-pressure liquid chromatography (HPLC), only one radioactive peak consistently showed labeling inmore » the presence of ADP. In order to determine the location and identity of the NEM-reactive residue, the radioactive peptide in this peak was further purified by HPLC. The amino acid sequence(s) in the resulting sample were then determined by Edman degradation on an automated gas-phase sequenator. The PTH-amino acids released at each cycle of the degradation were separated by HPLC. Analysis of the chromatograms suggested that the radio-labeled residue was located in a peptide of sequence V-E-N-Q-D-A-I-D-A-C{sup *}-M-V-G-M-L-A-D-P-K. The NEM-reactive residue was cysteine, based on the retention time of the radioactivity released. The ATP-hydrolyzing activity observed in electron micrographs by lead-precipitation of enzymically released inorganic phosphate was compared with that observed in in vitro assays of the soluble and plasma membrane fractions of oat root homogenates. Although an ATP-hydrolyzing activity was observed on the plasma membrane in the electron micrographs, its substrate specificity and inhibitor sensitivity was identical to that observed for phosphatase activity.« less

  9. Enterococcus faecium F58, a bacteriocinogenic strain naturally occurring in Jben, a soft, farmhouse goat's cheese made in Morocco.

    PubMed

    Achemchem, F; Martínez-Bueno, M; Abrini, J; Valdivia, E; Maqueda, M

    2005-01-01

    Characterization of Ent F-58 produced by Enterococcus faecium strain F58 isolated from Jben, a soft, farmhouse goat's cheese manufactured without starter cultures. E. faecium strain F58 was isolated because of its broad inhibitory spectrum, including activity against food-borne pathogenic and spoilage bacteria. The antimicrobial substance was produced during the growth phase, with maximum production after 16-20 h of incubation at 30 degrees C, and was stable over a wide pH range (4-8) and at high temperatures (5 min at 100 degrees C). The enterocin was purified to homogeneity using cation exchange and hydrophobic interaction on C-18 and reverse-phase high-performance liquid chromatography. The activity was eluted as two individual active fractions (F-58A and F-58B) and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry analysis showed masses of 5210.5 and 5234.3 Da respectively. Both peptides were partially sequenced by Edman degradation, and amino-acid sequencing revealed high similarity with enterocin L50 (I). PCR-amplified fragments containing the structural genes for F-58 A and B were located in a 22-kb plasmid harboured by this strain. We verified that it also holds the structural gene for P-like enterocin. E. faecium strain F58 from Jben cheese, a producer of enterocin L50, exerts an inhibitory effect against strains of genera such as Listeria, Staphylococcus, Clostridium, Brochothrix and Bacillus. Enterocin was characterized according to its functional and biological properties, purification to homogeneity and an analysis of its amino acid and genetic sequences. E. faecium strain F58 is a newly discovered producer of enterocin L50, the biotechnological characteristics of which indicate its potential for application as a protective agent against pathogens and spoilage bacteria in foods.

  10. Purification and Characterization of Suicin 65, a Novel Class I Type B Lantibiotic Produced by Streptococcus suis.

    PubMed

    Vaillancourt, Katy; LeBel, Geneviève; Frenette, Michel; Fittipaldi, Nahuel; Gottschalk, Marcelo; Grenier, Daniel

    2015-01-01

    Bacteriocins are antimicrobial peptides of bacterial origin that are considered as a promising alternative to the use of conventional antibiotics. Recently, our laboratory reported the purification and characterization of two lantibiotics, suicin 90-1330 and suicin 3908, produced by the swine pathogen and zoonotic agent Streptococcus suis (serotype 2). In this study, a novel bacteriocin produced by S. suis has been identified and characterized. The producing strain S. suis 65 (serotype 2) was found to belong to the sequence type 28, that includes strains known to be weakly or avirulent in a mouse model. The bacteriocin, whose production was only possible following growth on solid culture medium, was purified to homogeneity by cationic exchange and reversed-phase high-pressure liquid chromatography. The bacteriocin, named suicin 65, was heat, pH and protease resistant. Suicin 65 was active against all S. suis isolates tested, including antibiotic resistant strains. Amino acid sequencing of the purified bacteriocin by Edman degradation revealed the presence of modified amino acids suggesting a lantibiotic. Using the partial sequence obtained, a blast was performed against published genomes of S. suis and allowed to identify a putative lantibiotic locus in the genome of S. suis 89-1591. From this genome, primers were designed and the gene cluster involved in the production of suicin 65 by S. suis 65 was amplified by PCR. Sequence analysis revealed the presence of ten open reading frames, including a duplicate of the structural gene. The structural genes (sssA and sssA') of suicin 65 encodes a 25-amino acid residue leader peptide and a 26-amino acid residue mature peptide yielding an active bacteriocin with a deducted molecular mass of 3,005 Da. Mature suicin 65 showed a high degree of identity with class I type B lantibiotics (globular structure) produced by Streptococcus pyogenes (streptococcin FF22; 84.6%), Streptococcus macedonicus (macedocin ACA-DC 198; 84.6%), and Lactococcus lactis subsp. lactis (lacticin 481; 74.1%). Further studies will evaluate the ability of suicin 65 or the producing strain to prevent experimental S. suis infections in pigs.

  11. Purification and Characterization of Suicin 65, a Novel Class I Type B Lantibiotic Produced by Streptococcus suis

    PubMed Central

    Vaillancourt, Katy; LeBel, Geneviève; Frenette, Michel; Fittipaldi, Nahuel; Gottschalk, Marcelo; Grenier, Daniel

    2015-01-01

    Bacteriocins are antimicrobial peptides of bacterial origin that are considered as a promising alternative to the use of conventional antibiotics. Recently, our laboratory reported the purification and characterization of two lantibiotics, suicin 90–1330 and suicin 3908, produced by the swine pathogen and zoonotic agent Streptococcus suis (serotype 2). In this study, a novel bacteriocin produced by S. suis has been identified and characterized. The producing strain S. suis 65 (serotype 2) was found to belong to the sequence type 28, that includes strains known to be weakly or avirulent in a mouse model. The bacteriocin, whose production was only possible following growth on solid culture medium, was purified to homogeneity by cationic exchange and reversed-phase high-pressure liquid chromatography. The bacteriocin, named suicin 65, was heat, pH and protease resistant. Suicin 65 was active against all S. suis isolates tested, including antibiotic resistant strains. Amino acid sequencing of the purified bacteriocin by Edman degradation revealed the presence of modified amino acids suggesting a lantibiotic. Using the partial sequence obtained, a blast was performed against published genomes of S. suis and allowed to identify a putative lantibiotic locus in the genome of S. suis 89–1591. From this genome, primers were designed and the gene cluster involved in the production of suicin 65 by S. suis 65 was amplified by PCR. Sequence analysis revealed the presence of ten open reading frames, including a duplicate of the structural gene. The structural genes (sssA and sssA’) of suicin 65 encodes a 25-amino acid residue leader peptide and a 26-amino acid residue mature peptide yielding an active bacteriocin with a deducted molecular mass of 3,005 Da. Mature suicin 65 showed a high degree of identity with class I type B lantibiotics (globular structure) produced by Streptococcus pyogenes (streptococcin FF22; 84.6%), Streptococcus macedonicus (macedocin ACA-DC 198; 84.6%), and Lactococcus lactis subsp. lactis (lacticin 481; 74.1%). Further studies will evaluate the ability of suicin 65 or the producing strain to prevent experimental S. suis infections in pigs. PMID:26709705

  12. Peptidomics of Three Bothrops Snake Venoms: Insights Into the Molecular Diversification of Proteomes and Peptidomes*

    PubMed Central

    Tashima, Alexandre K.; Zelanis, André; Kitano, Eduardo S.; Ianzer, Danielle; Melo, Robson L.; Rioli, Vanessa; Sant'anna, Sávio S.; Schenberg, Ana C. G.; Camargo, Antônio C. M.; Serrano, Solange M. T.

    2012-01-01

    Snake venom proteomes/peptidomes are highly complex and maintenance of their integrity within the gland lumen is crucial for the expression of toxin activities. There has been considerable progress in the field of venom proteomics, however, peptidomics does not progress as fast, because of the lack of comprehensive venom sequence databases for analysis of MS data. Therefore, in many cases venom peptides have to be sequenced manually by MS/MS analysis or Edman degradation. This is critical for rare snake species, as is the case of Bothrops cotiara (BC) and B. fonsecai (BF), which are regarded as near threatened with extinction. In this study we conducted a comprehensive analysis of the venom peptidomes of BC, BF, and B. jararaca (BJ) using a combination of solid-phase extraction and reversed-phase HPLC to fractionate the peptides, followed by nano-liquid chromatography-tandem MS (LC-MS/MS) or direct infusion electrospray ionization-(ESI)-MS/MS or MALDI-MS/MS analyses. We detected marked differences in the venom peptidomes and identified peptides ranging from 7 to 39 residues in length by de novo sequencing. Forty-four unique sequences were manually identified, out of which 30 are new peptides, including 17 bradykinin-potentiating peptides, three poly-histidine-poly-glycine peptides and interestingly, 10 l-amino acid oxidase fragments. Some of the new bradykinin-potentiating peptides display significant bradykinin potentiating activity. Automated database search revealed fragments from several toxins in the peptidomes, mainly from l-amino acid oxidase, and allowed the determination of the peptide bond specificity of proteinases and amino acid occurrences for the P4-P4′ sites. We also demonstrate that the venom lyophilization/resolubilization process greatly increases the complexity of the peptidome because of the imbalance caused to the venom proteome and the consequent activity of proteinases on venom components. The use of proteinase inhibitors clearly showed different outcomes in the peptidome characterization and suggested that degradomic-peptidomic analysis of snake venoms is highly sensitive to the conditions of sampling procedures. PMID:22869554

  13. Purification and characterisation of an antifungal protein, MCha-Pr, from the intercellular fluid of bitter gourd (Momordica charantia) leaves.

    PubMed

    Zhang, Beibei; Xie, Chengjian; Wei, Yunming; Li, Jing; Yang, Xingyong

    2015-03-01

    An antifungal protein, designated MCha-Pr, was isolated from the intercellular fluid of bitter gourd (Momordica charantia) leaves during a screen for potent antimicrobial proteins from plants. The isolation procedure involved a combination of extraction, ammonium sulphate precipitation, gel filtration on Bio-Gel P-6, ion exchange chromatography on CM-Sephadex, an additional gel filtration on HiLoad 16/60 Superdex 30, and finally, HPLC on a SOURCE 5RPC column. Matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry indicated that the protein had a molecular mass of 25733.46Da. Automated Edman degradation was used to determine the N-terminal sequence of MCha-Pr, and the amino acid sequence was identified as V-E-Y-T-I-T-G-N-A-G-N-T-P-G-G. The MCha-Pr protein has some similarity to the pathogenesis-related proteins from Atropa belladonna (deadly nightshade), Solanum tuberosum (potato), Ricinus communis (castor bean), and Nicotiana tabacum (tobacco). Analysis of the circular dichroism spectra indicated that MCha-Pr predominantly contains α-helix and β-sheet structures. MCha-Pr had inhibitory effects towards a variety of fungal species and the 50% inhibition of fungal growth (IC50) for Alternaria brassicae, Cercospora personata, Fusarium oxysporum, Mucor sp., and Rhizoctonia solani are 33 μM, 42 μM, 37 μM, 40 μM, and 48 μM, respectively. In addition, this antifungal protein can inhibit the germination of A. brassicae spores at 12.5 μM. These results suggest that MCha-Pr in bitter gourd leaves plays a protective role against phytopathogens and has a wide antimicrobial spectrum. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. Purification and Characterization of Plantaricin JLA-9: A Novel Bacteriocin against Bacillus spp. Produced by Lactobacillus plantarum JLA-9 from Suan-Tsai, a Traditional Chinese Fermented Cabbage.

    PubMed

    Zhao, Shengming; Han, Jinzhi; Bie, Xiaomei; Lu, Zhaoxin; Zhang, Chong; Lv, Fengxia

    2016-04-06

    Bacteriocins are ribosomally synthesized peptides with antimicrobial activity produced by numerous bacteria. A novel bacteriocin-producing strain, Lactobacillus plantarum JLA-9, isolated from Suan-Tsai, a traditional Chinese fermented cabbage, was screened and identified by its physiobiochemical characteristics and 16S rDNA sequence analysis. A new bacteriocin, designated plantaricin JLA-9, was purified using butanol extraction, gel filtration, and reverse-phase high-performance liquid chromatography. The molecular mass of plantaricin JLA-9 was shown to be 1044 Da by MALDI-TOF-MS analyses. The amino acid sequence of plantaricin JLA-9 was predicted to be FWQKMSFA by MALDI-TOF-MS/MS, which was confirmed by Edman degradation. This bacteriocin exhibited broad-spectrum antibacterial activity against Gram-positive and Gram-negative bacteria, especially Bacillus spp., high thermal stability (20 min, 121 °C), and narrow pH stability (pH 2.0-7.0). It was sensitive to α-chymotrypsin, pepsin, alkaline protease, and papain. The mode of action of this bacteriocin responsible for outgrowth inhibition of Bacillus cereus spores was studied. Plantaricin JLA-9 had no detectable effects on germination initiation over 1 h on monitoring the hydration, heat resistance, and 2,6-pyridinedicarboxylic acid (DPA) release of spores. Rather, germination initiation is a prerequisite for the action of plantaricin JLA-9. Plantaricin JLA-9 inhibited growth by preventing the establishment of oxidative metabolism and disrupting membrane integrity in germinating spores within 2 h. The results suggest that plantaricin JLA-9 has potential applications in the control of Bacillus spp. in the food industry.

  15. Bioinsecticidal activity of a novel Kunitz trypsin inhibitor from Catanduva (Piptadenia moniliformis) seeds.

    PubMed

    Cruz, Ana C B; Massena, Fábio S; Migliolo, Ludovico; Macedo, Leonardo L P; Monteiro, Norberto K V; Oliveira, Adeliana S; Macedo, Francisco P; Uchoa, Adriana F; Grossi de Sá, Maria F; Vasconcelos, Ilka M; Murad, Andre M; Franco, Octavio L; Santos, Elizeu A

    2013-09-01

    The present study aims to provide new in vitro and in vivo biochemical information about a novel Kunitz trypsin inhibitor purified from Piptadenia moniliformis seeds. The purification process was performed using TCA precipitation, Trypsin-Sepharose and reversed-phase C18 HPLC chromatography. The inhibitor, named PmTKI, showed an apparent molecular mass of around 19 kDa, visualized by SDS-PAGE, which was confirmed by mass spectrometry MALDI-ToF demonstrating a monoisotopic mass of 19.296 Da. The inhibitor was in vitro active against trypsin, chymotrypsin and papain. Moreover, kinetic enzymatic studies were performed aiming to understand the inhibition mode of PmTKI, which competitively inhibits the target enzyme, presenting Ki values of 1.5 × 10(-8) and 3.0 × 10(-1) M against trypsin and chymotrypsin, respectively. Also, the inhibitory activity was assayed at different pH ranges, temperatures and reduction environments (DTT). The inhibitor was stable in all conditions maintaining an 80% residual activity. N-terminal sequence was obtained by Edman degradation and the primary sequence presented identity with members of Kunitz-type inhibitors from the same subfamily. Finally after biochemical characterization the inhibitory effect was evaluated in vitro on insect digestive enzymes from different orders, PmTKI demonstrated remarkable activity against enzymes from Anthonomus grandis (90%), Plodia interpuncptella (60%), and Ceratitis capitata (70%). Furthermore, in vivo bioinsecticidal assays of C. capitata larvae were also performed and the concentration of PmTKI (w/w) in an artificial diet required to LD50 and ED50 larvae were 0.37 and 0.3% respectively. In summary, data reported here shown the biotechnological potential of PmTKI for insect pest control. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  16. Design and testing for a nontagged F1-V fusion protein as vaccine antigen against bubonic and pneumonic plague.

    PubMed

    Powell, Bradford S; Andrews, Gerard P; Enama, Jeffrey T; Jendrek, Scott; Bolt, Chris; Worsham, Patricia; Pullen, Jeffrey K; Ribot, Wilson; Hines, Harry; Smith, Leonard; Heath, David G; Adamovicz, Jeffrey J

    2005-01-01

    A two-component recombinant fusion protein antigen was re-engineered and tested as a medical counter measure against the possible biological threat of aerosolized Yersinia pestis. The active component of the proposed subunit vaccine combines the F1 capsular protein and V virulence antigen of Y. pestis and improves upon the design of an earlier histidine-tagged fusion protein. In the current study, different production strains were screened for suitable expression and a purification process was optimized to isolate an F1-V fusion protein absent extraneous coding sequences. Soluble F1-V protein was isolated to 99% purity by sequential liquid chromatography including capture and refolding of urea-denatured protein via anion exchange, followed by hydrophobic interaction, concentration, and then transfer into buffered saline for direct use after frozen storage. Protein identity and primary structure were verified by mass spectrometry and Edman sequencing, confirming a purified product of 477 amino acids and removal of the N-terminal methionine. Purity, quality, and higher-order structure were compared between lots using RP-HPLC, intrinsic fluorescence, CD spectroscopy, and multi-angle light scattering spectroscopy, all of which indicated a consistent and properly folded product. As formulated with aluminum hydroxide adjuvant and administered in a single subcutaneous dose, this new F1-V protein also protected mice from wild-type and non-encapsulated Y. pestis challenge strains, modeling prophylaxis against pneumonic and bubonic plague. These findings confirm that the fusion protein architecture provides superior protection over the former licensed product, establish a foundation from which to create a robust production process, and set forth assays for the development of F1-V as the active pharmaceutical ingredient of the next plague vaccine.

  17. Turkish scorpion Buthacus macrocentrus: general characterization of the venom and description of Bu1, a potent mammalian Na⁺-channel α-toxin.

    PubMed

    Caliskan, F; Quintero-Hernández, V; Restano-Cassulini, R; Batista, C V F; Zamudio, F Z; Coronas, F I; Possani, L D

    2012-03-01

    The venom of the scorpion Buthacus macrocentrus of Turkey was fractionated by high performance liquid chromatography (HPLC) and its mass finger print analysis was obtained by spectrometry. More than 70 different fractions were obtained, allowing the determination of the molecular masses of at least 60 peptides ranging between 648 and 44,336 Da. The venom is enriched with peptides containing molecular masses between 3200-4500 Da, and 6000-7500 Da. They very likely correspond to K⁺-channel and Na⁺-channel specific peptides, respectively, as expected from venoms of scorpions of the family Buthidae, already determined for other species. The major component obtained from HPLC was shown to be lethal to mice and was further purified and characterized. It contains 65 amino acid residues maintained closely packed by 4 disulfide bridges, and shows a molecular weight of 7263 Da. Additionally, a cDNA from the venomous glands of this scorpion was used in conjunction with sequence data from Edman degradation and mass spectrometry for cloning the gene that codes for Bu1 as we named this toxin. This gene codes for a 67 amino acid residues peptide, where the two last are eliminated post-translationally for production of an amidated C-terminal arginine. Its sequence is closely related to toxins from the species Leiurus quinquestriatus, as revealed by a phylogenetic tree analysis. Electrophysiological results conducted with Bu1 using patch-clamp techniques indicate that it modifies the Na⁺ currents, in a similar way as other well known α-scorpion toxins. These results support the conclusion that this species of scorpions is dangerous to humans, having an epidemiological interest for the country. Copyright © 2012 Elsevier Ltd. All rights reserved.

  18. Purification and Characterization of a Novel Anti-Campylobacter Bacteriocin Produced by Lactobacillus curvatus DN317.

    PubMed

    Zommiti, Mohamed; Almohammed, Hamdan; Ferchichi, Mounir

    2016-12-01

    The lactic acid bacteria (LAB) microbiota of Saudi chicken ceca was determined. From 60 samples, 204 isolates of lactic acid bacteria were obtained. Three isolates produced antimicrobial activities against Campylobacter jejuni, Listeria monocytogenes, and Bacillus subtilis. The isolate DN317, which had the highest activity against Campylobacter jejuni ATCC 33560, was identified as Lactobacillus curvatus (GenBank accession numbers: KX353849 and KX353850). Full inhibitory activity was observed after a 2-h incubation with the supernatant at pH values between 4 and 8. Only 16% of the activity was conserved after a treatment at 121 °C for 15 min. The use of proteinase K, pepsin, chymotrypsin, trypsin, papain, and lysozyme drastically reduced the antimicrobial activity. However, lipase, catalase, and lysozyme had no effect on this activity. The active peptide produced by Lactobacillus curvatus DN317 was purified by precipitation with an 80% saturated ammonium sulfate solution, and two steps of reversed phase HPLC on a C18 column. The molecular weight of this peptide was 4448 Da as determined by MALDI-ToF. N-terminal sequence analysis using Edman degradation revealed 47 amino acid residues (UniProt Knowledgebase accession number C0HK82) revealing homology with the amino acid sequences of sakacin P and curvaticin L442. The antimicrobial activity of the bacteriocin, namely curvaticin DN317, was found to be bacteriostatic against Campylobacter jejuni ATCC 33560. The use of microbial antagonism by LAB is one of the best ways to control microorganisms safely in foods. This result constitutes a reasonable advance in the antimicrobial field because of its potential applications in food technology.

  19. Tri-domain Bifunctional Inhibitor of Metallocarboxypeptidases A and Serine Proteases Isolated from Marine Annelid Sabellastarte magnifica*

    PubMed Central

    Alonso-del-Rivero, Maday; Trejo, Sebastian A.; Reytor, Mey L.; Rodriguez-de-la-Vega, Monica; Delfin, Julieta; Diaz, Joaquin; González-González, Yamile; Canals, Francesc; Chavez, Maria Angeles; Aviles, Francesc X.

    2012-01-01

    This study describes a novel bifunctional metallocarboxypeptidase and serine protease inhibitor (SmCI) isolated from the tentacle crown of the annelid Sabellastarte magnifica. SmCI is a 165-residue glycoprotein with a molecular mass of 19.69 kDa (mass spectrometry) and 18 cysteine residues forming nine disulfide bonds. Its cDNA was cloned and sequenced by RT-PCR and nested PCR using degenerated oligonucleotides. Employing this information along with data derived from automatic Edman degradation of peptide fragments, the SmCI sequence was fully characterized, indicating the presence of three bovine pancreatic trypsin inhibitor/Kunitz domains and its high homology with other Kunitz serine protease inhibitors. Enzyme kinetics and structural analyses revealed SmCI to be an inhibitor of human and bovine pancreatic metallocarboxypeptidases of the A-type (but not B-type), with nanomolar Ki values. SmCI is also capable of inhibiting bovine pancreatic trypsin, chymotrypsin, and porcine pancreatic elastase in varying measures. When the inhibitor and its nonglycosylated form (SmCI N23A mutant) were overproduced recombinantly in a Pichia pastoris system, they displayed the dual inhibitory properties of the natural form. Similarly, two bi-domain forms of the inhibitor (recombinant rSmCI D1-D2 and rSmCI D2-D3) as well as its C-terminal domain (rSmCI-D3) were also overproduced. Of these fragments, only the rSmCI D1-D2 bi-domain retained inhibition of metallocarboxypeptidase A but only partially, indicating that the whole tri-domain structure is required for such capability in full. SmCI is the first proteinaceous inhibitor of metallocarboxypeptidases able to act as well on another mechanistic class of proteases (serine-type) and is the first of this kind identified in nature. PMID:22411994

  20. Actiflagelin, a new sperm activator isolated from Walterinnesia aegyptia venom using phenotypic screening.

    PubMed

    Abd El-Aziz, Tarek Mohamed; Al Khoury, Sawsan; Jaquillard, Lucie; Triquigneaux, Mathilde; Martinez, Guillaume; Bourgoin-Voillard, Sandrine; Sève, Michel; Arnoult, Christophe; Beroud, Rémy; De Waard, Michel

    2018-01-01

    Sperm contains a wealth of cell surface receptors and ion channels that are required for most of its basic functions such as motility and acrosome reaction. Conversely, animal venoms are enriched in bioactive compounds that primarily target those ion channels and cell surface receptors. We hypothesized, therefore, that animal venoms should be rich enough in sperm-modulating compounds for a drug discovery program. Our objective was to demonstrate this fact by using a sperm-based phenotypic screening to identify positive modulators from the venom of Walterinnesia aegyptia . Herein, as proof of concept that venoms contain interesting compounds for sperm physiology, we fractionated Walterinnesia aegyptia snake venom by RP-HPLC and screened for bioactive fractions capable of accelerating mouse sperm motility (primary screening). Next, we purified each compound from the positive fraction by cation exchange and identified the bioactive peptide by secondary screening. The peptide sequence was established by Edman sequencing of the reduced/alkylated compound combined to LC-ESI-QTOF MS/MS analyses of reduced/alkylated fragment peptides following trypsin or V8 protease digestion. Using this two-step purification protocol combined to cell phenotypic screening, we identified a new toxin of 7329.38 Da (actiflagelin) that activates sperm motility in vitro from OF1 male mice. Actiflagelin is 63 amino acids in length and contains five disulfide bridges along the proposed pattern of disulfide connectivity C 1 -C 5 , C 2 -C 3 , C 4 -C 6 , C 7 -C 8 and C 9 -C 10 . Modeling of its structure suggests that it belongs to the family of three finger toxins with a noticeable homology with bucandin, a peptide from Bungarus candidus venom. This report demonstrates the feasibility of identifying profertility compounds that may be of therapeutic potential for infertility cases where motility is an issue.

  1. Blood-feeding Behaviors of Anopheles stephensi But Not Phlebotomus papatasi are Influenced by Actively Warming Guinea Pigs (Cavia porcellus) Under General Anesthesia

    DTIC Science & Technology

    2015-01-01

    response in mosquitoes. Proc Natl Acad Sci 108:8026–8029. Davis EE, Sokolove PG. 1975. Temperature responses of antennal receptors of the mosquito, Aedes ...books/NBK54050/ Peterson D, Brown A. 1951. Studies of the responses of the female Aedes mosquito. Part III. The response of Aedes aegypti (L.) to a warm...Hyg 33:1232–1238. Walker ED, Edman JD. 1985. Feeding-site selection and blood-feeding behavior of Aedes triseriatus (Diptera: Culicidae) on rodent

  2. A photoreceptor calcium binding protein is recognized by autoantibodies obtained from patients with cancer-associated retinopathy

    PubMed Central

    1991-01-01

    Cancer-associated retinopathy (CAR), a paraneoplastic syndrome, is characterized by the degeneration of retinal photoreceptors under conditions where the tumor and its metastases have not invaded the eye. The retinopathy often is apparent before the diagnosis of cancer and may be associated with autoantibodies that react with specific sites in the retina. We have examined the sera from patients with CAR to further characterize the retinal antigen. Western blot analysis of human retinal proteins reveals a prominent band at 26 kD that is labeled by the CAR antisera. Antibodies to the 26-kD protein were affinity- purified from complex CAR antisera and used for EM-immunocytochemical localization of the protein to the nuclei, inner and outer segments of both rod and cone cells. Other antibodies obtained from the CAR sera did not label photoreceptors. Using the affinity-purified antibodies for detection, the 26-kD protein, designated p26, was purified to homogeneity from the outer segments of bovine rod photoreceptor cells by Phenyl-Sepharose and ion exchange chromatography. Partial amino acid sequence of p26 was determined by gas phase Edman degradation and revealed extensive homology with a cone-specific protein, visinin. Based upon structural relatedness, both the p26 rod protein and visinin are members of the calmodulin family and contain calcium binding domains of the E-F hand structure. PMID:1999465

  3. Isolation and structure elucidation of neuropeptides of the AKH/RPCH family in long-horned grasshoppers (Ensifera).

    PubMed

    Gäde, G

    1992-11-01

    An identical neuropeptide was isolated by reversed-phase high-performance liquid chromatography from the corpora cardiaca of the king cricket, Libanasidus vittatus, and the two armoured ground crickets, Heterodes namaqua and Acanthoproctus cervinus. The crude gland extracts had adipokinetic activity in migratory locusts, hypertrehalosaemic activity in American cockroaches and a slight hypertrehalosaemic, but no adipokinetic, effect in armoured ground crickets. The primary structure of this neuropeptide was determined by pulsed-liquid phase sequencing employing Edman chemistry after enzymically deblocking the N-terminal 5-oxopyrrolidine-2-carboxylic acid residue. The C-terminus was also blocked, as indicated by the lack of digestion by carboxypeptidase A. The peptide was assigned the structure [symbol: see text]Glu-Leu-Asn-Phe-Ser-Thr-Gly-TrpNH2, previously designated Scg-AKH-II. The corpora cardiaca of the cricket Gryllodes sigillatus contained a neuropeptide which differed in retention time from the one isolated from the king and armoured ground crickets. The structure was assigned as [symbol: see text]Glu-Val-Asn-Phe-Ser-Thr-Gly-TrpNH2, previously designated Grb-AKH. This octapeptide caused hyperlipaemia in its donor species. The presence of the same peptide, Scg-AKH-II, in the two primitive infraorders of Ensifera, and the different peptide, Grb-AKH, in the most advanced infraorder of Ensifera, supports the evolutionary trends assigned formerly from morphological and physiological evidence.

  4. Isolation and characterization of a leech neuropeptide in rat brains: coupling to nitric oxide release in leech, rat and human tissues.

    PubMed

    Salzet, M; Salzet, B; Sáutière, P; Lésage, J; Beauvillain, J C; Bilfinger, T V; Rialas, C; Bjenning, C; Stefano, G B

    1998-03-30

    The osmoregulator peptide (leech osmoregulatory factor, LORF; IPEPYVWD) was first found in the leech central nervous system (CNS). Given the fact that certain peptides can be found in mammals and invertebrates, e.g., opioid, we examined rat brains to determine if LORF was present. This peptide was found and isolated by successive reversed-phase HPLC purification steps and characterized by electrospray mass spectrometry measurement. It was sequenced by Edman degradation and quantified in different tissues by ELISA. Our results demonstrate the presence of LORF in the hypothalamus, thalamus, and striatum (6 pmol/mg of protein extract) and in other brain areas at lower levels. This octapeptide is also present in the rat duodenum and liver (10 to 14 pmol/mg) and at lower levels in heart, lung, pancreas and caudal spinal cord (< 5 pmol/mg). The testes, adrenals and kidneys have the lowest levels of all the tissues examined (ca. 0.5 pmol/mg of protein). Furthermore, we also demonstrate that LORF is coupled to nitric oxide (NO) release in leech CNS, rat hypothalamus and human saphenous vein in a manner which is inhibited by a nitric oxide synthase inhibitor as well as an antibody directed toward LORF. The study demonstrates that LORF, and its function in relation to NO release, has been conserved over more than 400 million years of evolution.

  5. A broad-spectrum antimicrobial activity of Bacillus subtilis RLID 12.1.

    PubMed

    Ramachandran, Ramya; Chalasani, Ajay Ghosh; Lal, Ram; Roy, Utpal

    2014-01-01

    In the present study, an attempt was made to biochemically characterize the antimicrobial substance from the soil isolate designated as RLID 12.1 and explore its potential applications in biocontrol of drug-resistant pathogens. The antimicrobial potential of the wild-type isolate belonging to the genus Bacillus was determined by the cut-well agar assay. The production of antimicrobial compound was recorded maximum at late exponential growth phase. The ultrafiltered concentrate was insensitive to organic solvents, metal salts, surfactants, and proteolytic and nonproteolytic enzymes. The concentrate was highly heat stable and active over a wide range of pH values. Partial purification, zymogram analysis, and TLC were performed to determine the preliminary biochemical nature. The molecular weight of the antimicrobial peptide was determined to be less than 2.5 kDa in 15% SDS-PAGE and in zymogram analysis against Streptococcus pyogenes. The N-terminal amino acid sequence by Edman degradation was partially determined to be T-P-P-Q-S-X-L-X-X-G, which shows very insignificant identity to other antimicrobial peptides from bacteria. The minimum inhibitory concentrations of dialysed and partially purified ion exchange fractions were determined against some selected gram-positive and gram-negative bacteria and some pathogenic yeasts. The presence of three important antimicrobial peptide biosynthesis genes ituc, fend, and bmyb was determined by PCR.

  6. New Kunitz-Type HCRG Polypeptides from the Sea Anemone Heteractis crispa

    PubMed Central

    Gladkikh, Irina; Monastyrnaya, Margarita; Zelepuga, Elena; Sintsova, Oksana; Tabakmakher, Valentin; Gnedenko, Oksana; Ivanov, Alexis; Hua, Kuo-Feng; Kozlovskaya, Emma

    2015-01-01

    Sea anemones are a rich source of Kunitz-type polypeptides that possess not only protease inhibitor activity, but also Kv channels toxicity, analgesic, antihistamine, and anti-inflammatory activities. Two Kunitz-type inhibitors belonging to a new Heteractis crispa RG (HCRG) polypeptide subfamily have been isolated from the sea anemone Heteractis crispa. The amino acid sequences of HCRG1 and HCRG2 identified using the Edman degradation method share up to 95% of their identity with the representatives of the HCGS polypeptide multigene subfamily derived from H. crispa cDNA. Polypeptides are characterized by positively charged Arg at the N-terminus as well as P1 Lys residue at their canonical binding loop, identical to those of bovine pancreatic trypsin inhibitor (BPTI). These polypeptides are shown by our current evidence to be more potent inhibitors of trypsin than the known representatives of the HCGS subfamily with P1Thr. The kinetic and thermodynamic characteristics of the intermolecular interactions between inhibitors and serine proteases were determined by the surface plasmon resonance (SPR) method. Residues functionally important for polypeptide binding to trypsin were revealed using molecular modeling methods. Furthermore, HCRG1 and HCRG2 possess anti-inflammatory activity, reducing tumor necrosis factor-α (TNF-α) and interleukin 6 (IL-6) secretions, as well as proIL-1β expression in lipopolysaccharide (LPS)-activated macrophages. However, there was no effect on nitric oxide (NO) generation. PMID:26404319

  7. Ligatoxin B, a new cytotoxic protein with a novel helix-turn-helix DNA-binding domain from the mistletoe Phoradendron liga.

    PubMed Central

    Li, Shi-Sheng; Gullbo, Joachim; Lindholm, Petra; Larsson, Rolf; Thunberg, Eva; Samuelsson, Gunnar; Bohlin, Lars; Claeson, Per

    2002-01-01

    A new basic protein, designated ligatoxin B, containing 46 amino acid residues has been isolated from the mistletoe Phoradendron liga (Gill.) Eichl. (Viscaceae). The protein's primary structure, determined unambiguously using a combination of automated Edman degradation, trypsin enzymic digestion, and tandem MS analysis, was 1-KSCCPSTTAR-NIYNTCRLTG-ASRSVCASLS-GCKIISGSTC-DSGWNH-46. Ligatoxin B exhibited in vitro cytotoxic activities on the human lymphoma cell line U-937-GTB and the primary multidrug-resistant renal adenocarcinoma cell line ACHN, with IC50 values of 1.8 microM and 3.2 microM respectively. Sequence alignment with other thionins identified a new member of the class 3 thionins, ligatoxin B, which is similar to the earlier described ligatoxin A. As predicted by the method of homology modelling, ligatoxin B shares a three-dimensional structure with the viscotoxins and purothionins and so may have the same mode of cytotoxic action. The novel similarities observed by structural comparison of the helix-turn-helix (HTH) motifs of the thionins, including ligatoxin B, and the HTH DNA-binding proteins, led us to propose the working hypothesis that thionins represent a new group of DNA-binding proteins. This working hypothesis could be useful in further dissecting the molecular mechanisms of thionin cytotoxicity and of thionin opposition to multidrug resistance, and useful in clarifying the physiological function of thionins in plants. PMID:12049612

  8. Characterization of a Cadmium-Binding Complex of Cabbage Leaves 1

    PubMed Central

    Wagner, George J.

    1984-01-01

    The chemical nature of a principal, inducible cadmium-binding complex which accumulates in cabbage leaves (Wagner and Trotter 1982 Plant Physiol 69: 804-809) was studied and compared with that of animal metallothionein and copper-binding proteins isolated from various organisms. The apparent molecular weight of native cabbage complex and carboxymethylated ligand of the complex under native conditions as determined by gel filtration was about 10,000 daltons. Under denaturing conditions their apparent molecular weights were about 2000 daltons. Ligand of native complex contained 37, 28, and 9 residue per cent of glutamic acid-glutamine, cysteine, and glycine, respectively, and low aromatic residue, serine and lysine content. The high acidic and low hydrophobic residue content explain the behavior of complex on electrophoresis in the presence and absence of sodium dodecyl sulfate. Its isoelectric point was below 4.0 and it bound 4 to 6 moles cadmium per mole ligand in what appear to be cadmium-mercaptide chromophores. The complex was found to be heat stable, relatively protease insensitive, and lacking in disulfide bonds. Attempts to determine the primary sequence of reduced native complex and carboxymethylated, cleaved ligand using the Edman degradation procedure were unsuccessful. An electrophoretic procedure is described for preparative isolation of purified complex and a method is described for monitoring ligand of complex as its fluorescent dibromobimane adduct. Images Fig. 1 Fig. 3 PMID:16663927

  9. Substrate specificity of mitochondrial intermediate peptidase analysed by a support-bound peptide library

    PubMed Central

    Marcondes, M.F.M.; Alves, F.M.; Assis, D.M.; Hirata, I.Y.; Juliano, L.; Oliveira, V.; Juliano, M.A.

    2015-01-01

    The substrate specificity of recombinant human mitochondrial intermediate peptidase (hMIP) using a synthetic support-bound FRET peptide library is presented. The collected fluorescent beads, which contained the hydrolysed peptides generated by hMIP, were sequenced by Edman degradation. The results showed that this peptidase presents a remarkable preference for polar uncharged residues at P1 and P1′ substrate positions: Ser = Gln > Thr at P1 and Ser > Thr at P1′. Non-polar residues were frequent at the substrate P3, P2, P2′ and P3′ positions. Analysis of the predicted MIP processing sites in imported mitochondrial matrix proteins shows these cleavages indeed occur between polar uncharged residues. Previous analysis of these processing sites indicated the importance of positions far from the MIP cleavage site, namely the presence of a hydrophobic residue (Phe or Leu) at P8 and a polar uncharged residue (Ser or Thr) at P5. To evaluate this, additional kinetic analyses were carried out, using fluorogenic substrates synthesized based on the processing sites attributed to MIP. The results described here underscore the importance of the P1 and P1′ substrate positions for the hydrolytic activity of hMIP. The information presented in this work will help in the design of new substrate-based inhibitors for this peptidase. PMID:26082885

  10. Identification of E-cadherin signature motifs functioning as cleavage sites for Helicobacter pylori HtrA

    NASA Astrophysics Data System (ADS)

    Schmidt, Thomas P.; Perna, Anna M.; Fugmann, Tim; Böhm, Manja; Jan Hiss; Haller, Sarah; Götz, Camilla; Tegtmeyer, Nicole; Hoy, Benjamin; Rau, Tilman T.; Neri, Dario; Backert, Steffen; Schneider, Gisbert; Wessler, Silja

    2016-03-01

    The cell adhesion protein and tumour suppressor E-cadherin exhibits important functions in the prevention of gastric cancer. As a class-I carcinogen, Helicobacter pylori (H. pylori) has developed a unique strategy to interfere with E-cadherin functions. In previous studies, we have demonstrated that H. pylori secretes the protease high temperature requirement A (HtrA) which cleaves off the E-cadherin ectodomain (NTF) on epithelial cells. This opens cell-to-cell junctions, allowing bacterial transmigration across the polarised epithelium. Here, we investigated the molecular mechanism of the HtrA-E-cadherin interaction and identified E-cadherin cleavage sites for HtrA. Mass-spectrometry-based proteomics and Edman degradation revealed three signature motifs containing the [VITA]-[VITA]-x-x-D-[DN] sequence pattern, which were preferentially cleaved by HtrA. Based on these sites, we developed a substrate-derived peptide inhibitor that selectively bound and inhibited HtrA, thereby blocking transmigration of H. pylori. The discovery of HtrA-targeted signature sites might further explain why we detected a stable 90 kDa NTF fragment during H. pylori infection, but also additional E-cadherin fragments ranging from 105 kDa to 48 kDa in in vitro cleavage experiments. In conclusion, HtrA targets E-cadherin signature sites that are accessible in in vitro reactions, but might be partially masked on epithelial cells through functional homophilic E-cadherin interactions.

  11. Angiotensin I-converting-enzyme-inhibitory and antibacterial peptides from Lactobacillus helveticus PR4 proteinase-hydrolyzed caseins of milk from six species.

    PubMed

    Minervini, F; Algaron, F; Rizzello, C G; Fox, P F; Monnet, V; Gobbetti, M

    2003-09-01

    Sodium caseinates prepared from bovine, sheep, goat, pig, buffalo or human milk were hydrolyzed by a partially purified proteinase of Lactobacillus helveticus PR4. Peptides in each hydrolysate were fractionated by reversed-phase fast-protein liquid chromatography. The fractions which showed the highest angiotensin I-converting-enzyme (ACE)-inhibitory or antibacterial activity were sequenced by mass spectrum and Edman degradation analyses. Various ACE-inhibitory peptides were found in the hydrolysates: the bovine alpha(S1)-casein (alpha(S1)-CN) 24-47 fragment (f24-47), f169-193, and beta-CN f58-76; ovine alpha(S1)-CN f1-6 and alpha(S2)-CN f182-185 and f186-188; caprine beta-CN f58-65 and alpha(S2)-CN f182-187; buffalo beta-CN f58-66; and a mixture of three tripeptides originating from human beta-CN. A mixture of peptides with a C-terminal sequence, Pro-Gly-Pro, was found in the most active fraction of the pig sodium caseinate hydrolysate. The highest ACE-inhibitory activity of some peptides corresponded to the concentration of the ACE inhibitor (S)-N-(1-[ethoxycarbonyl]-3-phenylpropyl)-ala-pro maleate (enalapril) of 49.253 micro g/ml (100 micro mol/liter). Several of the above sequences had features in common with other ACE-inhibitory peptides reported in the literature. The 50% inhibitory concentration (IC(50)) of some of the crude peptide fractions was very low (16 to 100 micro g/ml). Some identified peptides were chemically synthesized, and the ACE-inhibitory activity and IC(50)s were confirmed. An antibacterial peptide corresponding to beta-CN f184-210 was identified in human sodium caseinate hydrolysate. It showed a very large spectrum of inhibition against gram-positive and -negative bacteria, including species of potential clinical interest, such as Enterococcus faecium, Bacillus megaterium, Escherichia coli, Listeria innocua, Salmonella spp., Yersinia enterocolitica, and Staphylococcus aureus. The MIC for E. coli F19 was ca. 50 micro g/ml. Once generated, the bioactive peptides were resistant to further degradation by proteinase of L. helveticus PR4 or by trypsin and chymotrypsin.

  12. Characterization of the first K⁺ channel blockers from the venom of the Moroccan scorpion Buthus occitanus Paris.

    PubMed

    Martin-Eauclaire, Marie-France; Céard, Brigitte; Belghazi, Maya; Lebrun, Régine; Bougis, Pierre E

    2013-12-01

    The availability of a large variety of specific blockers, which inhibit different K(+) currents, would help to elucidate their differences in physiological function. Short peptide toxins isolated from scorpion venoms are able to block voltage-dependent or Ca(2+)-activated K(+) channels. Here, we have studied the venom of the Moroccan scorpion Buthus occitanus Paris (BoP) in order to find new peptides, which could enlarge our structure-function relationship knowledge on the Kv1.3 blocker Kaliotoxin (KTX) that belongs to the α-KTx3.1 family. Indeed and since more a decade, KTX is widely used by international investigators because it exhibits a quite sharp specificity and a high-affinity for the Kv1.3 channel, which is not only a neuronal channel but also a therapeutic target for diverse autoimmune diseases such as multiple sclerosis, type 1 diabetes, and rheumatoid arthritis. The BoP venom was first investigated using HPLC and MALDI-TOF/MS. Further, the HPLC fractions were screened by ELISA with antibodies raised against KTX. These antibodies recognized at least three components toxic in mice by intracerebroventricular injection. They were further pharmacologically characterized by competition using (125)I-KTX bound to its specific binding sites on rat brain synaptosomes. A single component (4161 Da) inhibited totally the (125)I-KTX binding and with high-affinity (IC50 = 0.1 nM), while the two other components poorly competed with (IC50 > 100 nM). These toxins were sequenced in full by Edman's degradation. The high-affinity ligand (BoPKTX) shares 86% sequence identity with KTX and was classified as toxin α-KTx3.17. The two others peptides (BoP1 and BoP2, 4093 Da and 4121 Da, respectively) only differ by a Lys/Arg mutation. Their amino acid sequences were related to Martentoxin, which has been characterized from the Chinese scorpion Buthus martenzi Karch and described as both a BKCa and Kv1.3 blocker. Accordingly, they belong to the α-KTx16 family. Copyright © 2013 Elsevier Ltd. All rights reserved.

  13. 6-phospho-alpha-D-glucosidase from Fusobacterium mortiferum: cloning, expression, and assignment to family 4 of the glycosylhydrolases.

    PubMed Central

    Bouma, C L; Reizer, J; Reizer, A; Robrish, S A; Thompson, J

    1997-01-01

    The Fusobacterium mortiferum malH gene, encoding 6-phospho-alpha-glucosidase (maltose 6-phosphate hydrolase; EC 3.2.1.122), has been isolated, characterized, and expressed in Escherichia coli. The relative molecular weight of the polypeptide encoded by malH (441 residues; Mr of 49,718) was in agreement with the estimated value (approximately 49,000) obtained by sodium dodecyl sulfate-polyacrylamide gel electrophoresis for the enzyme purified from F. mortiferum. The N-terminal sequence of the MalH protein obtained by Edman degradation corresponded to the first 32 amino acids deduced from the malH sequence. The enzyme produced by the strain carrying the cloned malH gene cleaved [U-14C]maltose 6-phosphate to glucose 6-phosphate (Glc6P) and glucose. The substrate analogs p-nitrophenyl-alpha-D-glucopyranoside 6-phosphate (pNP alphaGlc6P) and 4-methylumbelliferyl-alpha-D-glucopyranoside 6-phosphate (4MU alphaGlc6P) were hydrolyzed to yield Glc6P and the yellow p-nitrophenolate and fluorescent 4-methylumbelliferyl aglycons, respectively. The 6-phospho-alpha-glucosidase expressed in E. coli (like the enzyme purified from F. mortiferum) required Fe2+, Mn2+, Co2+, or Ni2+ for activity and was inhibited in air. Synthesis of maltose 6-phosphate hydrolase from the cloned malH gene in E. coli was modulated by addition of various sugars to the growth medium. Computer-based analyses of MalH and its homologs revealed that the phospho-alpha-glucosidase from F. mortiferum belongs to the seven-member family 4 of the glycosylhydrolase superfamily. The cloned 2.2-kb Sau3AI DNA fragment from F. mortiferum contained a second partial open reading frame of 83 residues (designated malB) that was located immediately upstream of malH. The high degree of sequence identity of MalB with IIB(Glc)-like proteins of the phosphoenol pyruvate dependent:sugar phosphotransferase system suggests participation of MalB in translocation of maltose and related alpha-glucosides in F. mortiferum. PMID:9209025

  14. LC-MS/MS screening strategy for unknown adducts to N-terminal valine in hemoglobin applied to smokers and nonsmokers.

    PubMed

    Carlsson, Henrik; von Stedingk, Hans; Nilsson, Ulrika; Törnqvist, Margareta

    2014-12-15

    Electrophilically reactive compounds have the ability to form adducts with nucleophilic sites in DNA and proteins, constituting a risk for toxic effects. Mass spectrometric detection of adducts to N-terminal valine in hemoglobin (Hb) after detachment by modified Edman degradation procedures is one approach for in vivo monitoring of exposure to electrophilic compounds/metabolites. So far, applications have been limited to one or a few selected reactive species, such as acrylamide and its metabolite glycidamide. This article presents a novel screening strategy for unknown Hb adducts to be used as a basis for an adductomic approach. The method is based on a modified Edman procedure, FIRE, specifically developed for LC-MS/MS analysis of N-terminal valine adducts in Hb detached as fluorescein thiohydantoin (FTH) derivatives. The aim is to detect and identify a priori unknown Hb adducts in human blood samples. Screening of valine adducts was performed by stepwise scanning of precursor ions in small mass increments, monitoring four fragments common for the FTH derivative of valine with different N-substitutions in the multiple-reaction mode, covering a mass range of 135 Da (m/z 503-638). Samples from six smokers and six nonsmokers were analyzed. Control experiments were performed to compare these results with known adducts and to check for artifactual formation of adducts. In all samples of smokers and nonsmokers, seven adducts were identified, of which six have previously been studied. Nineteen unknown adducts were observed, and 14 of those exhibited fragmentation patterns similar to earlier studied FTH derivatives of adducts to valine. Identification of the unknown adducts will be the focus of future work. The presented methodology is a promising screening tool using Hb adducts to indicate exposure to potentially toxic electrophilic compounds and metabolites.

  15. Atypical Genetic Locus Associated with Constitutive Production of Enterocin B by Enterococcus faecium BFE 900

    PubMed Central

    Franz, Charles M. A. P.; Worobo, Randy W.; Quadri, Luis E. N.; Schillinger, Ulrich; Holzapfel, Wilhelm H.; Vederas, John C.; Stiles, Michael E.

    1999-01-01

    A purified bacteriocin produced by Enterococcus faecium BFE 900 isolated from black olives was shown by Edman degradation and mass spectrometric analyses to be identical to enterocin B produced by E. faecium T136 from meat (P. Casaus, T. Nilsen, L. M. Cintas, I. F. Nes, P. E. Hernández, and H. Holo, Microbiology 143:2287–2294, 1997). The structural gene was located on a 2.2-kb HindIII fragment and a 12.0-kb EcoRI chromosomal fragment. The genetic characteristics and production of EntB by E. faecium BFE 900 differed from that described so far by the presence of a conserved sequence like a regulatory box upstream of the EntB gene, and its production was constitutive and not regulated. The 2.2-kb chromosomal fragment contained the hitherto undetected immunity gene for EntB in an atypical orientation that is the reverse of that of the structural gene. Typical transport and other genes associated with bacteriocin production were not detected on the 12.0-kb chromosomal fragment containing the EntB structural gene. This makes the EntB genetic system different from most other bacteriocin systems, where transport and possible regulatory genes are clustered. EntB was subcloned and expressed by the dedicated secretion machinery of Carnobacterium piscicola LV17A. The structural gene was amplified by PCR, fused to the divergicin A signal peptide, and expressed by the general secretory pathway in Enterococcus faecalis ATCC 19433. PMID:10224016

  16. Structural and mechanistic studies on. beta. -hydroxydecanoly thioester dehydrase and its inhibition by the suicide substrate, 3-decynoic acid, n-acetylcysteamine thioester

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, W.B.

    ..beta..-Hydroxydecanoyl thioester dehydrase catalyzes the interconversion of thioesters of (R)-3-hydroxydecanoic acid, (E)-2-decenoic acid, and (Z)-3-decenoic acid. Dehydrase is irreversibly inactivated by the N-acetylcysteamine thioester of 3-decynoic acid (3-decynoyl-NAC). This is the classic example of suicide enzyme inactivation. The structure of the dehydrase-inactivator adduct is still unclear. The purpose of this thesis is to determine the structure of the inactivator moiety and the stoichiometry of the inactivation of this dimeric enzyme, as well as to conduct structural studies on dehydrase itself. 3-(2-/sup 13/C)Decynoyl-NAC was synthesized and incubated with homogeneous dehydrase. The spectra showed that dehydrase adds to the inactivator so asmore » to quickly produce an (E)-3-(N/sup im/-histidinyl)-3-decenoyl thioester adduct at the active site. This species is slowly converted to the 2-decenoyl thioester congener. Titration of dehydrase with 3-(2/sup 13/C)decynoyl-NAC under these conditions clearly indicated that 2 moles of inactivator are bound to each mole of dehydrase dimer. These experiments provide a self-consistent picture of dehydrase inactivation by 3-decynoyl-NAC and normal dehydrase-catalyzed reactions. Dehydrase was cleaved by chemical fragmentation, and the resulting mixture of peptides were separated by reversed-phase HPLC. Partial N-terminal sequences of purified peptides were obtained by automated Edman technology« less

  17. Molecular Cloning and Pharmacological Properties of an Acidic PLA2 from Bothrops pauloensis Snake Venom

    PubMed Central

    Ferreira, Francis Barbosa; Gomes, Mário Sérgio Rocha; Naves de Souza, Dayane Lorena; Gimenes, Sarah Natalie Cirilo; Castanheira, Letícia Eulalio; Borges, Márcia Helena; Rodrigues, Renata Santos; Yoneyama, Kelly Aparecida Geraldo; Homsi Brandeburgo, Maria Inês; Rodrigues, Veridiana M.

    2013-01-01

    In this work, we describe the molecular cloning and pharmacological properties of an acidic phospholipase A2 (PLA2) isolated from Bothrops pauloensis snake venom. This enzyme, denominated BpPLA2-TXI, was purified by four chromatographic steps and represents 2.4% of the total snake venom protein content. BpPLA2-TXI is a monomeric protein with a molecular mass of 13.6 kDa, as demonstrated by Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF) analysis and its theoretical isoelectric point was 4.98. BpPLA2-TXI was catalytically active and showed some pharmacological effects such as inhibition of platelet aggregation induced by collagen or ADP and also induced edema and myotoxicity. BpPLA2-TXI displayed low cytotoxicity on TG-180 (CCRF S 180 II) and Ovarian Carcinoma (OVCAR-3), whereas no cytotoxicity was found in regard to MEF (Mouse Embryonic Fibroblast) and Sarcoma 180 (TIB-66). The N-terminal sequence of forty-eight amino acid residues was determined by Edman degradation. In addition, the complete primary structure of 122 amino acids was deduced by cDNA from the total RNA of the venom gland using specific primers, and it was significantly similar to other acidic D49 PLA2s. The phylogenetic analyses showed that BpPLA2-TXI forms a group with other acidic D49 PLA2s from the gender Bothrops, which are characterized by a catalytic activity associated with anti-platelet effects. PMID:24304676

  18. Purification, properties, and N-terminal amino acid sequence of homogeneous Escherichia coli 2-amino-3-ketobutyrate CoA ligase, a pyridoxal phosphate-dependent enzyme.

    PubMed

    Mukherjee, J J; Dekker, E E

    1987-10-25

    Starting with 100 g (wet weight) of a mutant of Escherichia coli K-12 forced to grow on L-threonine as sole carbon source, we developed a 6-step procedure that provides 30-40 mg of homogeneous 2-amino-3-ketobutyrate CoA ligase (also called aminoacetone synthetase or synthase). This ligase, which catalyzes the cleavage/condensation reaction between 2-amino-3-ketobutyrate (the presumed product of the L-threonine dehydrogenase-catalyzed reaction) and glycine + acetyl-CoA, has an apparent molecular weight approximately equal to 85,000 and consists of two identical (or nearly identical) subunits with Mr = 42,000. Computer analysis of amino acid composition data, which gives the best fit nearest integer ratio for each residue, indicates a total of 387 amino acids/subunit with a calculated Mr = 42,093. Stepwise Edman degradation provided the N-terminal sequence of the first 21 amino acids. It is a pyridoxal phosphate-dependent enzyme since (a) several carbonyl reagents caused greater than 90% loss of activity, (b) dialysis against buffer containing hydroxylamine resulted in 89% loss of activity coincident with an 86% decrease in absorptivity at 428 nm, (c) incubation of the apoenzyme with 20 microM pyridoxal phosphate showed a parallel recovery (greater than 90%) of activity and 428-nm absorptivity, and (d) reduction of the holoenzyme with NaBH4 resulted in complete inactivation, disappearance of a new absorption maximum at 333 nm. Strict specificity for glycine is shown but acetyl-CoA (100%), n-propionyl-CoA (127%), or n-butyryl-CoA (16%) is utilized in the condensation reaction. Apparent Km values for acetyl-CoA, n-propionyl-CoA, and glycine are 59 microM, 80 microM, and 12 mM, respectively; the pH optimum = 7.5. Added divalent metal ions or sulfhydryl compounds inhibited catalysis of the condensation reaction.

  19. Biochemical, physicochemical and molecular characterization of a genuine 2-Cys-peroxiredoxin purified from cowpea [Vigna unguiculata (L.) Walpers] leaves.

    PubMed

    Silva, Fredy D A; Vasconcelos, Ilka M; Lobo, Marina D P; de Castro, Patrícia G; Magalhães, Vladimir G; de Freitas, Cléverson D T; Carlini, Célia R R S; Pinto, Paulo M; Beltramini, Leila M; Filho, José H A; Barros, Eduardo B; Alencar, Luciana M R; Grangeiro, Thalles B; Oliveira, José T A

    2012-07-01

    Peroxiredoxins have diverse functions in cellular defense-signaling pathways. 2-Cys-peroxiredoxins (2-Cys-Prx) reduce H2O2 and alkyl-hydroperoxide. This study describes the purification and characterization of a genuine 2-Cys-Prx from Vigna unguiculata (Vu-2-Cys-Prx). Vu-2-Cys-Prx was purified from leaves by ammonium sulfate fractionation, chitin affinity and ion exchange chromatography. Vu-2-Cys-Prx reduces H2O2 using NADPH and DTT. Vu-2-Cys-Prx is a 44 kDa (SDS-PAGE)/46 kDa (exclusion chromatography) protein that appears as a 22 kDa molecule under reducing conditions, indicating that it is a homodimer linked intermolecularly by disulfide bonds and has a pI range of 4.56–4.72; its NH2-terminal sequence was similar to 2-Cys-Prx from Phaseolus vulgaris (96%) and Populus tricocarpa (96%). Analysis by ESI-Q-TOF MS/MS showed a molecular mass/pI of 28.622 kDa/5.18. Vu-2-Cys-Prx has 8% α-helix, 39% β-sheet, 22% of turns and 31% of unordered forms. Vu-2-Cys-Prx was heat stable, has optimal activity at pH 7.0, and prevented plasmid DNA degradation. Atomic force microscopy shows that Vu-2-Cys-Prx oligomerized in decamers which might be associated with its molecular chaperone activity that prevented denaturation of insulin and citrate synthase. Its cDNA analysis showed that the redox-active Cys52 residue and the amino acids Pro45, Thr49 and Arg128 are conserved as in other 2-Cys-Prx. The biochemical and molecular features of Vu-2-Cys-Prx are similar to other members of 2-Cys-Prx family. To date, only one publication reported on the purification of native 2-Cys-Prx from leaves and the subsequent analysis by N-terminal Edman sequencing, which is crucial for construction of stromal recombinant 2-Cys-Prx proteins.

  20. Sequence of the radioactive tryptic peptide obtained after inactivating the F1-ATPase of the thermophilic bacterium PS3 with 5'-p-fluorosulfonylbenzoyl(3H)adenosine at 65 degrees C

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bullough, D.A.; Yoshida, M.; Allison, W.S.

    1986-02-01

    Following a lag of about 30 min, the F1-ATPase from the thermophilic bacterium, PS3 (TF1), was inactivated slowly by 0.8 mM 5'-p-fluorosulfonylbenzoyladenosine (FSBA) at 23 degrees C and pH 7.0. When the enzyme was treated with 0.2 mM FSBA at pH 7.0 and 23 degrees C for 15 min and gel-filtered, no enzyme activity was lost. However, the lag in inactivation was abolished when the enzyme was subsequently incubated with 2.0 mM FSBA at 23 degrees C in the pH range from 6.8 to 10.0. The pH-inactivation profile obtained under these conditions revealed a pK alpha of about 9.3 whichmore » was associated with the inactivation. When pretreated TF1 was inactivated at 23 degrees C with (3H)FSBA by about 90%, greater than 20 mol of (3H)SBA was incorporated per mole of enzyme. TF1 was inactivated rapidly by 0.8 mM FSBA at pH 6.4 and 65 degrees C, and no lag was observed. Following inactivation of TF1 with 0.8 mM (3H)FSBA at 65 degrees C and pH 6.4, about 10 mol of (3H)SBA was incorporated per mole of enzyme. When a tryptic digest of the labeled enzyme was fractionated by reversed-phase high-performance liquid chromatography, a single major radioactive peptide was isolated. When subjected to automatic Edman degradation, this peptide was shown to have the amino acid sequence: A-L-A-P-E-I-V-G-E-E-H-X-Q-V-A-R, where X indicates that a phenylthiohydantoin derivative was not detected in cycle 12. However, from the DNA sequence of the gene encoding the subunit of TF1 (Y. Kagawa, M. Ishizuka, T. Saishu, and S. Nakao (1985)), this position has been shown to be occupied by tyrosine. This tyrosine is homologous with beta-Tyr-368 of the bovine mitochondrial F1-ATPase (MF1) the modification of which is responsible for the inactivation MF1 by FSBA.« less

  1. Angiotensin I-Converting-Enzyme-Inhibitory and Antibacterial Peptides from Lactobacillus helveticus PR4 Proteinase-Hydrolyzed Caseins of Milk from Six Species

    PubMed Central

    Minervini, F.; Algaron, F.; Rizzello, C. G.; Fox, P. F.; Monnet, V.; Gobbetti, M.

    2003-01-01

    Sodium caseinates prepared from bovine, sheep, goat, pig, buffalo or human milk were hydrolyzed by a partially purified proteinase of Lactobacillus helveticus PR4. Peptides in each hydrolysate were fractionated by reversed-phase fast-protein liquid chromatography. The fractions which showed the highest angiotensin I-converting-enzyme (ACE)-inhibitory or antibacterial activity were sequenced by mass spectrum and Edman degradation analyses. Various ACE-inhibitory peptides were found in the hydrolysates: the bovine αS1-casein (αS1-CN) 24-47 fragment (f24-47), f169-193, and β-CN f58-76; ovine αS1-CN f1-6 and αS2-CN f182-185 and f186-188; caprine β-CN f58-65 and αS2-CN f182-187; buffalo β-CN f58-66; and a mixture of three tripeptides originating from human β-CN. A mixture of peptides with a C-terminal sequence, Pro-Gly-Pro, was found in the most active fraction of the pig sodium caseinate hydrolysate. The highest ACE-inhibitory activity of some peptides corresponded to the concentration of the ACE inhibitor (S)-N-(1-[ethoxycarbonyl]-3-phenylpropyl)-ala-pro maleate (enalapril) of 49.253 μg/ml (100 μmol/liter). Several of the above sequences had features in common with other ACE-inhibitory peptides reported in the literature. The 50% inhibitory concentration (IC50) of some of the crude peptide fractions was very low (16 to 100 μg/ml). Some identified peptides were chemically synthesized, and the ACE-inhibitory activity and IC50s were confirmed. An antibacterial peptide corresponding to β-CN f184-210 was identified in human sodium caseinate hydrolysate. It showed a very large spectrum of inhibition against gram-positive and -negative bacteria, including species of potential clinical interest, such as Enterococcus faecium, Bacillus megaterium, Escherichia coli, Listeria innocua, Salmonella spp., Yersinia enterocolitica, and Staphylococcus aureus. The MIC for E. coli F19 was ca. 50 μg/ml. Once generated, the bioactive peptides were resistant to further degradation by proteinase of L. helveticus PR4 or by trypsin and chymotrypsin. PMID:12957917

  2. Purification and partial characterization of an early pregnancy factor-induced suppressor factor (EPF-S1).

    PubMed

    Rolfe, B A; Athanasas-Platsis, S; Hoskin, M J; Morton, H; Cavanagh, A C

    1995-06-01

    The immunomodulatory properties of early pregnancy factor (EPF) are mediated through induction of at least two lymphokines, designated EPF-S1 and EPF-S2 (previously estimated M(r) 15,000 and 55,000 respectively). The activity of the former is MHC-restricted while the latter is restricted to a locus (or loci) outside the MHC. The present study established further criteria by which EPF-S1 and EPF-S2 might be distinguished from each other and compared with other suppressor factors. In addition, techniques have been developed to purify EPF-S1 to homogeneity. Congenic mouse strains were used to map the genetic restriction of EPF-S2 in the rosette inhibition test and high performance gel permeation chromatography was used to demonstrate that EPF-S1 induces EPF-S2 but not vice versa. Further studies then focused on isolation of this first component of the cascade, EPF-S1, from immune ascites (from growth in athymic mice of the anti-EPF-S1 producing rat-mouse hybridoma R2T gamma, in which EPF-S1 is complexed to antibody). Techniques used were acidification followed by application to Sep-pak C18 cartridges, high performance cation-exchange chromatography and two reversed-phased HPLC steps on a C3 column. Purified material was analyzed by SDS-PAGE and Edman degradation. Approximately 10 micrograms EPF-S1 were isolated fom 60 ml ascitic fluid. Homogeneity of the purified material was demonstrated by SDS-PAGE, where it ran as a single band of approximate M(r) 12,000 coincident with biological activity. Attempts at Edman degradation indicate that the molecule is N-blocked. Definitive primary characterization of EPF-S1 must await the preparation and isolation of proteolytic fragments of the molecule, but the present studies establish conditions which make such structural analysis possible.

  3. Maltaricin CPN, a new class IIa bacteriocin produced by Carnobacterium maltaromaticum CPN isolated from mould-ripened cheese.

    PubMed

    Hammi, I; Delalande, F; Belkhou, R; Marchioni, E; Cianferani, S; Ennahar, S

    2016-11-01

    The purpose of this study was to isolate, characterize and determine the structure and the antibacterial activities of a bacteriocin produced by Carnobacterium maltaromaticum CPN, a strain isolated from unpasteurized milk Camembert cheese. This bacteriocin, termed maltaricin CPN, was produced at higher amounts in MRS broth at temperatures between 15°C and 25°C. It was purified to homogeneity from culture supernatant by using a simple method consisting of cation-exchange and reversed-phase chromatographies. Mass spectrometry showed that maltaricin was a 4427·29 Da bacteriocin. Its amino acid sequence was determined by Edman degradation which showed that it had close similarity with bacteriocins of the class IIa. Maltaricin CPN consisted in fact of 44 unmodified amino acids including two cysteine residues at positions 9 and 14 linked by a disulphide bond. The antimicrobial activity of maltaricin CPN covered a range of bacteria, with strong activity against many species of Gram-positive bacteria, especially the food-borne pathogen Listeria monocytogenes, but no activity against Gram-negative ones. In the studied conditions, C. maltaromaticum CPN produced a new class IIa bacteriocin with strong anti-Listeria activity. The study covers the purification and the structural characterization of a new bacteriocin produced by strain C. maltaromaticum CPN isolated from Camembert cheese. Its activity against strains of L. monocytogenes and higher production rates at relatively low temperatures show potential technological applications to improve the safety of refrigerated food. © 2016 The Society for Applied Microbiology.

  4. Some biochemical and histochemical properties of human liver serine dehydratase.

    PubMed

    Kashii, Tatsuhiko; Gomi, Tomoharu; Oya, Takeshi; Ishii, Yoko; Oda, Hirofumi; Maruyama, Muneharu; Kobayashi, Masashi; Masuda, Tohru; Yamazaki, Mitsuaki; Nagata, Takuya; Tsukada, Kazuhiro; Nakajima, Akinori; Tatsu, Kazuhito; Mori, Hisashi; Takusagawa, Fusao; Ogawa, Hirofumi; Pitot, Henry C

    2005-03-01

    In rat, serine dehydratase (SDH) is abundant in the liver and known to be a gluconeogenic enzyme, while there is little information about the biochemical property of human liver serine dehydratase because of its low content and difficulty in obtaining fresh materials. To circumvent these problems, we purified recombinant enzyme from Escherichia coli, and compared some properties between human and rat liver serine dehydratases. Edman degradation showed that the N-terminal sequence of about 75% of human serine dehydratase starts from MetSTART-Met2-Ser3- and the rest from Ser3-, whereas the N-terminus of rat enzyme begins from the second codon of MetSTART-Ala2-. The heterogeneity of the purified preparation was totally confirmed by mass spectrometry. Accordingly, this observation in part fails to follow the general rule that the first Met is not removed when the side chain of the penultimate amino acid is bulky such as Met, Arg, Lys, etc. There existed the obvious differences in the local structures between the two enzymes as revealed by limited-proteolysis experiments using trypsin and Staphylococcus aureus V8 protease. The most prominent difference was found histochemically: expression of rat liver serine dehydratase is confined to the periportal region in which many enzymes involved in gluconeogenesis and urea cycle are known to coexist, whereas human liver serine dehydratase resides predominantly in the perivenous region. These findings provide an additional support to the previous notion suggested by physiological experiments that contribution of serine dehydratase to gluconeogenesis is negligible or little in human liver.

  5. The human gastrin precursor. Characterization of phosphorylated forms and fragments.

    PubMed Central

    Varro, A; Desmond, H; Pauwels, S; Gregory, H; Young, J; Dockray, G J

    1988-01-01

    There is a potential phosphorylation site in the C-terminal region of the precursor for the acid-stimulating hormone gastrin, which is immediately adjacent to an important cleavage point. In the present study we have sought to identify, separate, quantify and characterize phosphorylated and unphosphorylated forms of human progastrin and its fragments. Identification was made by two radioimmunoassays: (a) a novel assay employing an antibody raised to intact human progastrin; and (b) an assay using antibody reacting with the C-terminal tryptic fragment of human progastrin, as well as progastrin itself. Two forms of human progastrin isolated from a gastrinoma were separated by ion-exchange h.p.l.c., and had similar elution positions on reverse-phase h.p.l.c. and on gel filtration. The more acidic peptide contained close to equimolar amounts of phosphate. On trypsinization, peptides were released that co-eluted on ion-exchange h.p.l.c. with, and had the immunochemical properties of, naturally occurring C-terminal fragments of progastrin. One of the latter was isolated and shown by Edman degradation after derivatization with ethanethiol to have the sequence Ser (P)-Ala-Glu-Asp-Glu-Asn. Similar peptides occur in antral mucosa resected from ulcer patients. The unphosphorylated forms of progastrin predominated, whereas the phosphorylated forms of the C-terminal fragments were predominant. This distribution could be explained by preferential cleavage of phosphorylated progastrin. We conclude that in human progastrin, Ser-96 can occur in the phosphorylated form; this residue immediately follows a pair of basic residues (Arg-Arg) that are cleaved during synthesis of the biologically active product. PMID:3223964

  6. Tripeptidyl peptidase II. An oligomeric protease complex from Arabidopsis.

    PubMed

    Book, Adam J; Yang, Peizhen; Scalf, Mark; Smith, Lloyd M; Vierstra, Richard D

    2005-06-01

    The breakdown of most nuclear and cytoplasmic proteins involves their partial cleavage by the 26S proteasome followed by further disassembly to free amino acids by the combined action of endo- and exopeptidases. In animals, one important intermediate exopeptidase is tripeptidyl peptidase (TPP)II, which digests peptide products of the 26S proteasome and other endopeptidases into tripeptides. Here, we describe the purification and characterization of TPPII from Arabidopsis (Arabidopsis thaliana). Like its animal counterparts, Arabidopsis TPPII exists as a soluble, approximately 5- to 9-MD complex. Two related species of 153 and 142 kD are present in the purified preparations that are derived from a single TPP2 gene. Sequencing by Edman degradation of the intact polypeptides and mass spectrometry of proteolytic fragments demonstrated that the 142-kD form mainly differs from the 153-kD form by a truncation at the C-terminal end. This serine protease is a member of the subtilisin superfamily and is sensitive to the inhibitors alanine-alanine-phenylalanine-chloromethylketone and butabindide, which are diagnostic for the TPPII subfamily. The Arabidopsis TPP2 gene is widely expressed in many tissue types with related genes evident in other plant genomes. Whereas the 26S proteasome is essential, TPPII appears not as important for plant physiology. An Arabidopsis T-DNA mutant defective in TPP2 expression displays no phenotypic abnormalities and is not hypersensitive to either amino acid analogs or the 26S proteasome inhibitor MG132. As a consequence, plants likely contain other intermediate exopeptidases that assist in amino acid recycling.

  7. Tripeptidyl Peptidase II. An Oligomeric Protease Complex from Arabidopsis1

    PubMed Central

    Book, Adam J.; Yang, Peizhen; Scalf, Mark; Smith, Lloyd M.; Vierstra, Richard D.

    2005-01-01

    The breakdown of most nuclear and cytoplasmic proteins involves their partial cleavage by the 26S proteasome followed by further disassembly to free amino acids by the combined action of endo- and exopeptidases. In animals, one important intermediate exopeptidase is tripeptidyl peptidase (TPP)II, which digests peptide products of the 26S proteasome and other endopeptidases into tripeptides. Here, we describe the purification and characterization of TPPII from Arabidopsis (Arabidopsis thaliana). Like its animal counterparts, Arabidopsis TPPII exists as a soluble, approximately 5- to 9-MD complex. Two related species of 153 and 142 kD are present in the purified preparations that are derived from a single TPP2 gene. Sequencing by Edman degradation of the intact polypeptides and mass spectrometry of proteolytic fragments demonstrated that the 142-kD form mainly differs from the 153-kD form by a truncation at the C-terminal end. This serine protease is a member of the subtilisin superfamily and is sensitive to the inhibitors alanine-alanine-phenylalanine-chloromethylketone and butabindide, which are diagnostic for the TPPII subfamily. The Arabidopsis TPP2 gene is widely expressed in many tissue types with related genes evident in other plant genomes. Whereas the 26S proteasome is essential, TPPII appears not as important for plant physiology. An Arabidopsis T-DNA mutant defective in TPP2 expression displays no phenotypic abnormalities and is not hypersensitive to either amino acid analogs or the 26S proteasome inhibitor MG132. As a consequence, plants likely contain other intermediate exopeptidases that assist in amino acid recycling. PMID:15908606

  8. Multiple forms of statherin in human salivary secretions.

    PubMed

    Jensen, J L; Lamkin, M S; Troxler, R F; Oppenheim, F G

    1991-01-01

    Sequential chromatography of hydroxyapatite-adsorbed salivary proteins from submandibular/sublingual secretions on Sephadex G-50 and reversed-phase HPLC resulted in the purification of statherin and several statherin variants. Amino acid analysis, Edman degradation and carboxypeptidase digestion of the obtained protein fractions led to the determination of the complete primary structures of statherin SV1, statherin SV2, and statherin SV3. SV1 is identical to statherin but lacks the carboxyl-terminal phenylalanine residue. SV2, lacking residues 6-15, is otherwise identical to statherin. SV3 is identical to SV2 but lacks the carboxyl-terminal phenylalanine. These results provide the first evidence for multiple forms of statherin which are probably derived both by post-translational modification and alternative splicing of the statherin gene.

  9. Preparation and characterization of 5-(4-hydroxy-3-nitrobenzyl)-3-phenyl-2-thiohydantoin, the phenylthiohydantoin derivative of 3-nitrotyrosine.

    PubMed

    Lilova, A; Kleinschmidt, T; Nedkov, P; Braunitzer, G

    1986-10-01

    The phenylthiocarbamoyl derivative of 3-nitrotyrosine was synthesized according to the known Edman method and then converted to its phenylthiohydantoin derivative [5-(4-hydroxy-3-nitrobenzyl)-3-phenyl-2-thiohydantion] by incubation in 0.5M HCl for 24 h at room temperature. After drying over P2O5 the chromatographically pure substance could be obtained by double recrystallization from hot acetic acid. It could be established that a shorter incubation time leads to an incomplete conversion and higher temperatures cause polymerization of the product. The compounds could be characterized by thin-layer and high-performance liquid chromatography, melting point, elemental analysis as well as NMR- and absorption spectroscopy.

  10. Venoms of Centruroides and Tityus species from Panama and their main toxic fractions.

    PubMed

    Salazar, Marcos H; Arenas, Iván; Corrales-García, Ligia L; Miranda, Roberto; Vélez, Sara; Sánchez, Jairo; Mendoza, Karla; Cleghorn, John; Zamudio, Fernando Z; Castillo, Adolfo; Possani, Lourival D; Corzo, Gerardo; Acosta, Hildaura

    2018-01-01

    The scorpionism in Panama is notorious for the confluence and coexistence of buthid scorpions from the genera Centruroides and Tityus. This communication describes an overview of the larger representative toxic venom fractions from eight dangerous buthid scorpion species of Panama: Centruroides (C. granosus, C. bicolor, C. limbatus and C. panamensis) and Tityus (T. (A.) asthenes, T. (A.) festae, T. (T.) cerroazul and T. (A.) pachyurus). Their venoms were separated by HPLC and the corresponding sub-fractions were tested for lethality effects on mice and insects. Many fractions toxic to either mice or insects, or both, were found and have had their molecular masses determined by mass spectrometry analysis. The great majority of the lethal components had a molecular mass close to 7000 Da, assumed to be peptides that recognize Na + -channels, responsible for the toxicity symptoms observed in other buthids scorpion venoms. A toxic peptide isolated from the venom of T. pachyurus was sequenced by Edman degradation, allowing the synthesis of nucleotide probe for cloning the correspondent gene. The mature toxin based on the cDNA sequencing has the C-terminal residue amidated, contains 62 amino acid packed by 4 disulfide linkages, with molecular mass of 7099.1 Da. This same toxic peptide seems to be present in scorpions of the species T. pachyurus collected in 5 different regions of Panama, although the overall HPLC profile is quite different. The most diverse neurotoxic venom components from the genus Centruroides were found in the species C. panamensis, whereas T. cerroazul was the one from the genus Tityus. The most common neurotoxins were observed in the venoms of T. festae, T. asthenes and T. pachyurus with closely related molecular masses of 7099.1 and 7332 Da. The information reported here is considered very important for future generation of a neutralizing antivenom against scorpions from Panama. Furthermore, it will contribute to the growing interest in using bioactive toxins from scorpions for drug discovery purposes. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. A computational module assembled from different protease family motifs identifies PI PLC from Bacillus cereus as a putative prolyl peptidase with a serine protease scaffold.

    PubMed

    Rendón-Ramírez, Adela; Shukla, Manish; Oda, Masataka; Chakraborty, Sandeep; Minda, Renu; Dandekar, Abhaya M; Ásgeirsson, Bjarni; Goñi, Félix M; Rao, Basuthkar J

    2013-01-01

    Proteolytic enzymes have evolved several mechanisms to cleave peptide bonds. These distinct types have been systematically categorized in the MEROPS database. While a BLAST search on these proteases identifies homologous proteins, sequence alignment methods often fail to identify relationships arising from convergent evolution, exon shuffling, and modular reuse of catalytic units. We have previously established a computational method to detect functions in proteins based on the spatial and electrostatic properties of the catalytic residues (CLASP). CLASP identified a promiscuous serine protease scaffold in alkaline phosphatases (AP) and a scaffold recognizing a β-lactam (imipenem) in a cold-active Vibrio AP. Subsequently, we defined a methodology to quantify promiscuous activities in a wide range of proteins. Here, we assemble a module which encapsulates the multifarious motifs used by protease families listed in the MEROPS database. Since APs and proteases are an integral component of outer membrane vesicles (OMV), we sought to query other OMV proteins, like phospholipase C (PLC), using this search module. Our analysis indicated that phosphoinositide-specific PLC from Bacillus cereus is a serine protease. This was validated by protease assays, mass spectrometry and by inhibition of the native phospholipase activity of PI-PLC by the well-known serine protease inhibitor AEBSF (IC50 = 0.018 mM). Edman degradation analysis linked the specificity of the protease activity to a proline in the amino terminal, suggesting that the PI-PLC is a prolyl peptidase. Thus, we propose a computational method of extending protein families based on the spatial and electrostatic congruence of active site residues.

  12. Conorfamide-Sr2, a gamma-carboxyglutamate-containing FMRFamide-related peptide from the venom of Conus spurius with activity in mice and mollusks

    PubMed Central

    Aguilar, Manuel B.; Luna-Ramírez, Karen S.; Echeverría, Daniel; Falcón, Andrés; Olivera, Baldomero M.; Heimer de la Cotera, Edgar P.; Maillo, María

    2008-01-01

    A novel peptide, conorfamide-Sr2 (CNF-Sr2), was purified from the venom extract of Conus spurius, collected in the Caribbean Sea off the Yucatan Peninsula. Its primary structure was determined by automated Edman degradation and amino acid analysis, and confirmed by electrospray ionization mass spectrometry. Conorfamide-Sr2 contains 12 amino acids and no Cys residues, and it is only the second FMRFamide-related peptide isolated from a venom. Its primary structure GPMγDPLγIIRI-nh2, (γ, gamma-carboxyglutamate;-nh2, amidated C-terminus; calculated monoisotopic mass, 1,468.72 Da; experimental monoisotopic mass, 1,468.70 Da) shows two features that are unusual among FMRFamide-related peptides (FaRPs, also known as RFamide peptides), namely the novel presence of gamma-carboxyglutamate, and a rather uncommon C-terminal residue, Ile. CNF-Sr2 exhibits paralytic activity in the limpet Patella opea and causes hyperactivity in the freshwater snail Pomacea paludosa and in the mouse. The sequence similarities of CNF-Sr2 with FaRPs from marine and freshwater mollusks and mice might explain its biological effects in these organisms. It also resembles FaRPs from polychaetes (the prey of C. spurius), which suggests a natural biological role. Based on these similarities, CNF-Sr2 might interact with receptors of these three distinct types of FaRPs, G-protein-coupled receptors, Na+ channels activated by FMRFamide (FaNaCs), and acid-sensing ion channels (ASICs). The biological activities of CNF-Sr2 in mollusks and mice make it a potential tool to study molecular targets in these and other organisms. PMID:18201803

  13. Identification of amino acids in the N-terminal SH2 domain of phospholipase C gamma 1 important in the interaction with epidermal growth factor receptor.

    PubMed

    Gergel, J R; McNamara, D J; Dobrusin, E M; Zhu, G; Saltiel, A R; Miller, W T

    1994-12-13

    Photoaffinity labeling and site-directed mutagenesis have been used to identify amino acid residues of the phospholipase C gamma 1 (PLC gamma 1) N-terminal SH2 domain involved in recognition of the activated epidermal growth factor receptor (EGFR). The photoactive amino acid p-benzoylphenylalanine (Bpa) was incorporated into phosphotyrosine-containing peptides derived from EGFR autophosphorylation sites Tyr992 and Tyr1068. Irradiation of these labels in the presence of SH2 domains showed cross-linking which was time-dependent and specific; labeling was inhibited with non-Bpa-containing peptides from EGFR in molar excess. The phosphotyrosine residue on the peptides was important for SH2 recognition, as dephosphorylated peptides did not cross-link. Radiolabeled peptides were used to identify sites of cross-linking to the N-terminal SH2 of PLC gamma 1. Bpa peptide-SH2 complexes were digested with trypsin, and radioactive fragments were purified by HPLC and analyzed by Edman sequencing. These experiments showed Arg562 and an additional site in the alpha A-beta B region of the SH2 domain, most likely Glu587, to be labeled by the Tyr992-derived peptide. Similar analysis of the reaction with the Tyr1068-derived photoaffinity label identified Leu653 as the cross-linked site. Mutation of the neighboring residues of Glu587 decreased photo-cross-linking, emphasizing the importance of this region of the molecule for recognition. These results are consistent with evidence from the v-Src crystal structure and implicate the loop spanning residues Gln640-Ser654 of PLC gamma 1 in specific recognition of phosphopeptides.

  14. Characterization of toxin III of the scorpion Leiurus quinquestriatus quinquestriatus: a new type of alpha-toxin highly toxic both to mammals and insects.

    PubMed

    Kopeyan, C; Mansuelle, P; Martin-Eauclaire, M F; Rochat, H; Miranda, F

    1993-01-01

    The primary structure of toxin III of Leiurus quinquestriatus quinquestriatus (Lqq III) was elucidated by automatic Edman degradation of the reduced and S-carboxymethylated protein and derived tryptic peptides. Like other scorpion toxins that are active on sodium channels, Lqq III, consisting of 64 amino acids, is a 7 kDa single-chain polypeptide crosslinked by four disulfide bridges. It belongs to the alpha-toxin group, as judged by competition experiments with 125I AaH II for binding to rat brain synaptosomes (K0.5 = 7 x 10(-7) M). Lqq III is the first alpha-toxin to be characterized that is highly toxic to mice [LD50 = 50 micrograms (7.1 nmol)/kg body wt], by subcutaneous injection, insects Blatella germanica [LD50 = 60 ng (8.5 pmol)/g body wt.] and Musca domestica [LD50 = 120 ng (17 pmol)/g body wt]. When tested via the intracerebroventricular route, the toxicity for mice [55 micrograms (8 nmol)/kg] was of the same order as that found by subcutaneous injection, indicating that Lqq III has a higher affinity for peripheral sodium channels that for those of the central nervous system. There are three differences between the sequences of Lqq III and Lqh alpha IT, an alpha-toxin isolated from the venom of Leiurus quinquestriatus hebraeus. These substitutions are found at positions 20, 24, and 64 (Ser-->Ala,Asp-->Glu and His-->Arg, respectively). Surprisingly Lqh alpha IT is only weakly active in mice [LD50 = 5 mg (0.7 mumol)/kg], while in insects its toxicity is similar to that of Lqq III [140 ng (20 pmol)/g body wt blowfly larvae]. These observations are relevant to the definition of scorpion toxin structure-activity relationships.

  15. New APETx-like peptides from sea anemone Heteractis crispa modulate ASIC1a channels.

    PubMed

    Kalina, Rimma; Gladkikh, Irina; Dmitrenok, Pavel; Chernikov, Oleg; Koshelev, Sergey; Kvetkina, Aleksandra; Kozlov, Sergey; Kozlovskaya, Emma; Monastyrnaya, Margarita

    2018-06-01

    Sea anemones are an abundant source of various biologically active peptides. The hydrophobic 20% ethanol fraction of tropical sea anemone Heteractis crispa was shown to contain at least 159 peptide compounds including neurotoxins, proteinase and α-amylase inhibitors, as well as modulators of acid-sensing ion channels (ASICs). The three new peptides, π-AnmTX Hcr 1b-2, -3, and -4 (41 aa) (short names Hcr 1b-2, -3, -4), identified by a combination of reversed-phase liquid chromatography and mass spectrometry were found to belong to the class 1b sea anemone neurotoxins. The amino acid sequences of these peptides were determined by Edman degradation and tandem mass spectrometry. The percent of identity of Hcr 1b-2, -3, and -4 with well-known ASIC3 inhibitors Hcr 1b-1 from H. crispa and APETx2 from Anthopleura elegantissima is 95-78% and 46-49%, respectively. Electrophysiological experiments on homomeric ASIC channels expressed in Xenopus laevis oocytes establish that these peptides are the first inhibitors of ASIC1a derived from sea anemone venom. The major peptide, Hcr 1b-2, inhibited both rASIC1a (IC 50 4.8 ± 0.3 μM; nH 0.92 ± 0.05) and rASIC3 (IC 50 15.9 ± 1.1 μM; nH 1.0 ± 0.05). The maximum inhibition at saturating peptide concentrations reached 64% and 81%, respectively. In the model of acid-induced muscle pain Hcr 1b-2 was also shown to exhibit an antihyperalgesic effect, significantly reducing of the pain threshold of experimental animals. Copyright © 2018 Elsevier Inc. All rights reserved.

  16. Purification, characterization and specificity of chondroitin lyases and glycuronidase from Flavobacterium heparinum.

    PubMed Central

    Gu, K; Linhardt, R J; Laliberté, M; Gu, K; Zimmermann, J

    1995-01-01

    The chondroitin lyases from Flavobacterium heparinum (Cytophaga heparinia) have been widely used in depolymerization of glycosaminoglycan and proteoglycan chondroitin sulphates. Oligosaccharide products derived from chondroitin sulphate can be further degraded by glycuronidases and sulphatases obtained from the same organism. There has been no reported purification of these enzymes to homogeneity nor is there any information on their physical and kinetic characteristics. The absence of pure enzymes has resulted in a lack of understanding of the optimal conditions for their catalytic activity and their substrate specificity. This has limited the use of these enzymes as reagents for preparation of oligosaccharides for structure and activity studies. Reproducible schemes to purify a chondroitin AC lyase, a glycuronidase and chondroitin B lyase from Flavobacterium heparinum to apparent homogeneity are described. Chondroitin AC lyase (chondroitinase AC, EC 4.2.2.5), glycuronidase [chondro-(1-->3)-glycuronidase, no EC number] and chondroitin B lyase (chondroitinase B, no EC number) have M(r) values (assessed by SDS/PAGE) of 74,000, 41,800 and 55,200 respectively, and isoelectric points (determined by isoelectric focusing) of 8.85, 9.28 and 9.05 respectively. Chondroitin lyase AC and B contain pyroglutamic acid at their N-termini precluding their analysis by Edman degradation. Deblocking with pyroglutamate aminopeptidase facilitated the determination of their N-terminal sequences. The kinetic properties of these enzymes have been determined as well as the optimum conditions for their catalytic activity. The specificity of the glycouronidase, determined using 17 different disaccharide substrates, shows that it only acts on unsulphated or 6-O-sulphated 1-->3 linkages. The chondroitin lyases are both endolytic enzymes, and oligosaccharide mapping shows their expected specificity towards the chondroitin and dermatan sulphate polymers. Images Figure 2 Figure 4 PMID:8526872

  17. Formation of pyroglutamic acid from N-terminal glutamic acid in immunoglobulin gamma antibodies.

    PubMed

    Chelius, Dirk; Jing, Kay; Lueras, Alexis; Rehder, Douglas S; Dillon, Thomas M; Vizel, Alona; Rajan, Rahul S; Li, Tiansheng; Treuheit, Michael J; Bondarenko, Pavel V

    2006-04-01

    The status of the N-terminus of proteins is important for amino acid sequencing by Edman degradation, protein identification by shotgun and top-down techniques, and to uncover biological functions, which may be associated with modifications. In this study, we investigated the pyroglutamic acid formation from N-terminal glutamic acid residues in recombinant monoclonal antibodies. Almost half the antibodies reported in the literature contain a glutamic acid residue at the N-terminus of the light or the heavy chain. Our reversed-phase high-performance liquid chromatography-mass spectrometry method could separate the pyroglutamic acid-containing light chains from the native light chains of reduced and alkylated recombinant monoclonal antibodies. Tryptic peptide mapping and tandem mass spectrometry of the reduced and alkylated proteins was used for the identification of the pyroglutamic acid. We identified the formation of pyroglutamic acid from N-terminal glutamic acid in the heavy chains and light chains of several antibodies, indicating that this nonenzymatic reaction does occur very commonly and can be detected after a few weeks of incubation at 37 and 45 degrees C. The rate of this reaction was measured in several aqueous buffers with different pH values, showing minimal formation of pyroglutamic acid at pH 6.2 and increased formation of pyroglutamic acid at pH 4 and pH 8. The half-life of the N-terminal glutamic acid was approximately 9 months in a pH 4.1 buffer at 45 degrees C. To our knowledge, we showed for the first time that glutamic acid residues located at the N-terminus of proteins undergo pyroglutamic acid formation in vitro.

  18. Venom from the snake Bothrops asper Garman. Purification and characterization of three phospholipases A2

    PubMed Central

    Anagón, Alejandro C.; Molinar, Ricardo R.; Possani, Lourival D.; Fletcher, Paul L.; Cronan, John E.; Julia, Jordi Z.

    1980-01-01

    The water-soluble venom of Bothrops asper Garman (San Juan Evangelista, Veracruz, México) showed 15 polypeptide bands on polyacrylamide-gel electrophoresis. This material exhibited phospholipase, hyaluronidase, N-benzoyl-l-arginine ethyl hydrolase, N-benzoyl-l-tyrosine ethyl hydrolase and phosphodiesterase activity, but no alkaline phosphatase or acid phosphatase activity. Fractionation on Sephadex G-75 afforded seven protein fractions, which were apparently less toxic than the whole venom (LD50=4.3μg/g mouse wt.). Subsequent separation of the phospholipase-positive fraction (II) on DEAE-cellulose with potassium phosphate buffers (pH7.55) gave several fractions, two being phospholipase-positive (II.6 and II.8). These fractions were further purified on DEAE-cellulose columns with potassium phosphate buffers (pH8.6). Fraction II.8.4 was rechromatographed in the same DEAE-cellulose column, giving a pure protein designated phospholipase 1. The fraction II.6.3 was further separated by gel disc electrophoresis yielding two more pure proteins designated phospholipase 2 and phospholipase 3. Analysis of phospholipids hydrolysed by these enzymes have shown that all three phospholipases belong to type A2. Amino acid analysis has shown that phospholipase A2 (type 1) has 97 residues with a calculated mol.wt. of 10978±11. Phospholipase A2 (type 2) has 96 residues with a mol.wt. of 10959±11. Phospholipase A2 (type 3) has 266 residues with 16 half-cystine residues and a calculated mol.wt of 29042±31. Automated Edman degradation showed the N-terminal sequence to be: Asx-Leu-Trp-Glx-Phe-Gly-Glx-Met-Met-Ser-Asx-Val- Met-Arg-Lys-Asx-Val-Val-Phe-Lys-Tyr-Leu- for phospholipase A2 (type 2). ImagesFig. 1. PMID:7387631

  19. Isolation and biochemical characterization of a γ-type phospholipase A2 inhibitor from Crotalus durissus collilineatus snake serum.

    PubMed

    Gimenes, Sarah Natalie Cirilo; Ferreira, Francis Barbosa; Silveira, Ana Carolina Portella; Rodrigues, Renata Santos; Yoneyama, Kelly Aparecida Geraldo; Izabel Dos Santos, Juliana; Fontes, Marcos Roberto de Mattos; de Campos Brites, Vera Lúcia; Santos, André Luiz Quagliatto; Borges, Márcia Helena; Lopes, Daiana Silva; Rodrigues, Veridiana M

    2014-04-01

    In the present work, we describe the isolation and partial structural and biochemical characterization of the first phospholipase A2 inhibitor (γPLI) from Crotalus durissus collilineatus (Cdc) snake serum. Initially, the Cdc serum was subjected to a Q-Sepharose ion exchange column, producing six peaks at 280 nm absorbance (Q1-Q6). Subsequently, Q4 fraction was submitted to affinity chromatography with immobilized PLA2 BnSP-7, a step that resulted in two fractions (NHS-1 and NHS-2). The latter contained the inhibitor, denominated γCdcPLI. The molecular mass of γCdcPLI, determined by Matrix-Assisted Laser Desorption Ionization Time-of-Flight (MALDI-TOF), was 22,340 Da. Partial sequences obtained by Edman degradation and by mass spectrometry (MALDI-TOF/TOF), showed similarity, as expected, to other related inhibitors. Circular dichroism (CD) analysis showed the presence of approximately 22% alpha helices and 29% beta sheets in the protein secondary structure. Additionally, CD studies also indicated no significant changes in the secondary structure of γCdcPLI when it is complexed to BpPLA2-TXI. On the other hand, dynamic light scattering (DLS) assays showed a temperature-dependent oligomerization behavior for this inhibitor. Biochemical analyses showed γCdcPLI was able to inhibit the enzymatic, cytotoxic and myotoxic activities of PLA2s. Structural and functional studies performed on this inhibitor may elucidate the action mechanisms of PLA2 inhibitors. In addition, we hope this study may contribute to investigating the potential use of these inhibitors for the treatment of snakebite or inflammatory diseases in which PLA2s may be involved. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. cDNA cloning and immunological characterization of a newly identified enolase allergen from Penicillium citrinum and Aspergillus fumigatus.

    PubMed

    Lai, Hsiu-Yu; Tam, Ming F; Tang, Ren-Bin; Chou, Hong; Chang, Ching-Yun; Tsai, Jaw-Ji; Shen, Horng-Der

    2002-03-01

    Penicillium citrinum and Aspergillus fumigatus are prevalent indoor airborne fungal species that have been implicated in human respiratory allergic disorders. It is important to understand the allergenic profile of these fungal species. The purpose of the present study is to characterize a newly identified enolase allergen from P. citrinum and A. fumigatus. Fungal proteins were separated by two-dimensional (2D) gel electrophoresis and blotted onto polyvinylidene difluoride membranes. Protein spots that reacted with IgE antibodies in serum samples from asthmatic patients were identified and the N-terminal amino acid sequences were determined by Edman degradation. The peptide sequences obtained were utilized in cloning the cDNA of the allergen genes by reverse transcriptase-polymerase chain reaction and the 5'- and 3'-rapid amplification cDNA end reactions. Our results from 2D immunoblotting identified a 47-kD IgE-reactive component in the extracts of P. citrinum and A. fumigatus. The N-terminal amino acid sequences of the 47-kD proteins are homologous to those of fungal enolases. The corresponding enolase cDNA from P. citrinum contains 1,552 bp and encodes a protein of 438 residues. In A. fumigatus, the isolated enolase cDNA has 1,649 bp and contains a 438-amino acid open reading frame. The deduced amino acid sequences of these two enolases have 94% identity. These enolases from P. citrinum and A. fumigatus were expressed in Escherichia coli as a His-tagged protein and designated as rPen c 22 and rAsp f 22, respectively. Sera from 7 (30%) of the 23 Penicillium-sensitized asthmatic patients showed IgE binding to the 47-kD P. citrinum component (Pen c 22) and rPen c 22. In addition, six of seven Pen c 22-positive serum samples have IgE immunoblot reactivity to the 47-kD A. fumigatus component (Asp f 22) and rAsp f 22. A polyclonal rabbit antiserum generated against the N-terminal peptide of Pen c 22 can react with Pen c 22, rPen c 22, Asp f 22 and rAsp f 22. In addition, the presence of IgE cross-reactivity between rPen c 22 and rAsp f 22 and between enolases from A. fumigatus and Alternaria alternata was also detected by immunoblot inhibition. These results demonstrated that a novel enolase allergen from P. citrinum (Pen c 22) and A. fumigatus (Asp f 22) was identified. In addition, IgE cross-reactivity between enolase allergens from A. fumigatus and P. citrinum and between enolases from A. fumigatus and A. alternata was also detected. Results obtained provide more information on fungal enolase allergens. Copyright 2002 S. Karger AG, Basel

  1. Primary structure of a guanyl-specific ribonuclease from the fungus Penicillium brevicompactum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kulikov, V.A.; Shlyapnikov, S.V.; Yakovlev, G.I.

    1986-01-01

    By the automatic Edman degradation of the intact S-carboxymethylated protein and a mixture of the products of its proteolytic cleavage at Arg, Lys, and Glu residues, together with results on the kinetics of the proteolysis of the protein under the action of carboxypeptidase Y, the primary structure of the extracellular guanyl-specific RNase of the fungus Penicillium brevicompactum has been determined. The RNase contains 102 amino acid residues: 7 Asp, 7 Asn, 9 Thr, 11 Ser, 4 Glu, 1 Gln, 4 Pro, 10 Gly, 11 Ala, 4 Cys, 7 Val, 4 Ile, 3 Leu, 9 Tyr, 5 Phe, 2 Lys, 3more » His, 1 Arg (M/sub r/ 10,801). It has been established that four hemicystine residues of the P. compactum RNase form, in pairs, two disulfide bonds« less

  2. Biomimetic Synthesis of Macahydantoins A and B from Lepidium meyenii, and Structure Revision of Macahydantoin B as a Class of Thiohydantoin with a 4-Methyl-hexahydropyrrolo[1,2-c]imidazole Skeleton.

    PubMed

    Zhou, Min; Ma, Hang-Ying; Xing, Huan-Huan; Li, Ping; Li, Gan-Peng; Geng, Hui-Chun; Hu, Qiu-Fen; Yang, Guang-Yu

    2017-09-15

    Phytochemical investigation on Lepidium meyenii led to the discovery of macahydantoin C (3), a new thiohydantoin with a 1,3-diazabicyclo[3.3.1]nonane core, the spectral properties of which indicate a potential structural misassignment of its previously reported analogue, macahydantoin B (2a). To probe this hypothesis, a concise, scalable, and biomimetic synthesis of the originally proposed 2a and its revised structure (2b) was efficiently accomplished using the modified Edman degradation as the key step from commercially available materials in 65% (three steps) and 52% (three steps) overall yields, respectively. These synthetic endeavors undoubtedly reassigned the structure of macahydantoin B as an unreported type of thiohydantoin featuring a 4-methyl-hexahydropyrrolo[1,2-c]imidazole scaffold.

  3. Desleucyl-Oritavancin with a Damaged d-Ala-d-Ala Binding Site Inhibits the Transpeptidation Step of Cell-Wall Biosynthesis in Whole Cells of Staphylococcus aureus.

    PubMed

    Kim, Sung Joon; Singh, Manmilan; Sharif, Shasad; Schaefer, Jacob

    2017-03-14

    We have used solid-state nuclear magnetic resonance to characterize the exact nature of the dual mode of action of oritavancin in preventing cell-wall assembly in Staphylococcus aureus. Measurements performed on whole cells labeled selectively in vivo have established that des-N-methylleucyl-N-4-(4-fluorophenyl)benzyl-chloroeremomycin, an Edman degradation product of [ 19 F]oritavancin, which has a damaged d-Ala-d-Ala binding aglycon, is a potent inhibitor of the transpeptidase activity of cell-wall biosynthesis. The desleucyl drug binds to partially cross-linked peptidoglycan by a cleft formed between the drug aglycon and its biphenyl hydrophobic side chain. This type of binding site is present in other oritavancin-like glycopeptides, which suggests that for these drugs a similar transpeptidase inhibition occurs.

  4. Poliovirus RNA synthesis in vitro: structural elements and antibody inhibition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Semler, B.L.; Hanecak, R.; Dorner, L.F.

    1983-01-01

    The poliovirus RNA polymerase complex has been analyzed by immunoautoradiography using antibody probes derived from purified replicase (P3) region viral polypeptides. Antibody preparations made against the polio RNA polymerase, P3-4b, detected a previously unreported cellular protein that copurifies with the RNA polymerase. An IgG fraction purified from rabbit antiserum to polypeptide P3-2, a precursor fo the RNA polymerase, specifically inhibits poliovirus RNA synthesis in vitro. The authors have also immunoprecipitated a 60,000-dalton protein (P3-4a) with antiserum to protein P3-4b and have determined the precise genomic map position of this protein by automated Edman degradation. Protein P3-4a originates by cleavage ofmore » the RNA polymerase precursor at a glutamine-glucine amino acid pair not previously reported to be a viral cleavage site.« less

  5. Characterization of the major cyanogen bromide fragment of alpha-A crystallin

    NASA Technical Reports Server (NTRS)

    Ifeanyi, F.; Takemoto, L.; Spooner, B. S. (Principal Investigator)

    1991-01-01

    Alpha crystallin from the bovine lens has been digested with cyanogen bromide, and the major fragment (CB-1) has been purified using reverse phase HPLC. Characterization of this fragment by Edman degradation and antisera to synthetic peptides indicates that it originates from alpha-A crystallin, but lacks the N-terminal methionine and the last 35 amino acids from the C-terminus of the molecule. The purified CB-1 fragment binds as well as native alpha crystallin to lens membrane, but is unable to self-assemble into the correct size of high molecular weight oligomeric complexes characteristic of the intact alpha-A chain. Together, these results demonstrate that the alpha-A chain is comprised of at least two functional domains, one of which is involved in binding of alpha-A crystallin to lens membrane, and another which is necessary for correct self-assembly of the molecule into high molecular weight oligomers.

  6. Structural characterization of a novel neuropeptide from the central nervous system of the leech Erpobdella octoculata. The leech osmoregulator factor.

    PubMed

    Salzet, M; Bulet, P; Weber, W M; Clauss, W; Verger-Bocquet, M; Malecha, J

    1996-03-22

    Purification of a material immunoreactive to an antiserum against the C-terminal part of the oxytocin (Pro-Leu-Gly-amide) and present in the central nervous system of the Pharyngobdellid leech Erpobdella octoculata was performed by reversed-phase high performance liquid chromatography combined with both enzyme-linked immunosorbent and dot immunobinding assays for oxytocin. The amino acid sequence of the purified peptide (Ile-Pro-Glu-Pro-Tyr-Val-Trp-Asp) was established by Edman degradation and confirmed by electrospray mass spectrometry measurement. When injected in leeches, purified or synthetic peptides exert an anti-diuretic effect, the most effective ranged between 10 pmol and 1 nmol. They provoked an uptake of water 1-2 h post-injection. Furthermore, electrophysiological experiments conducted in the leech Hirudo medicinalis revealed an inhibition of the potency of Na+ conductances of leech skin by this peptide. Immunocytochemical studies with an antiserum against synthetic oxytocin-like molecule provided the cytological basis for existence of a neuropeptide, since large amounts of immunoreactive neurons were detected in the central nervous systems of E. octoculata. The purified molecule is both different to peptides of the oxytocin/vasopressin family and is a novel neuropeptide in the animal kingdom. It was named the leech osmoregulator factor (LORF). An identification of the proteins immunoreactive to an antiserum against oxytocin performed at the level of both central nervous systems extracts and in vitro central nervous system-translated RNA products indicated that in the two cases, a single protein was detected. These proteins with a molecular masses of, respectively, approximately 34 kDa (homodimer of 17 kDa) for the central nervous systems extracts and approximately 19 kDa for in vitro central nervous system-translated RNA products were not recognized by the antiserum against MSEL- and VLDV-neurophysin (proteins associated to oxytocin and vasopressin), confirming that LORF did not belong to the oxytocin/vasopressin family.

  7. Purification of a NAD(P) reductase-like protein from the thermogenic appendix of the Sauromatum guttatum inflorescence.

    PubMed

    Skubatz, Hanna; Howald, William N

    2013-03-01

    A NAD(P) reductase-like protein with a molecular mass of 34.146 ± 34 Da was purified to homogeneity from the appendix of the inflorescence of the Sauromatum guttatum. On-line liquid chromatography/electrospray ionization-mass spectrometry was used to isolate and quantify the protein. For the identification of the protein, liquid chromatography/electrospray ionization-tandem mass spectrometry analysis of tryptic digests of the protein was carried out. The acquired mass spectra were used for database searching, which led to the identification of a single tryptic peptide. The 12 amino acid tryptic peptide (FLPSEFGNDVDR) was found to be identical to amino acid residues at the positions 108-120 of isoflavone reductase in the Arabidopsis genome. A BLAST search identified this sequence region as unique and specific to a class of NAD(P)-dependent reductases involved in phenylpropanoid biosynthesis. Edman degradation revealed that the protein was N-terminally blocked. The amount of the protein (termed RL, NAD(P) reductase-like protein) increased 60-fold from D-4 (4 days before inflorescence-opening, designated as D-day) to D-Day, and declined the following day, when heat-production ceased. When salicylic acid, the endogenous trigger of heat-production in the Sauromatum appendix, was applied to premature appendices, a fivefold decrease in the amount of RL was detected in the treated section relative to the non-treated section. About 40 % of RL was found in the cytoplasm. Another 30 % was detected in Percoll-purified mitochondria and the rest, about 30 % was associated with a low speed centrifugation pellet due to nuclei and amyloplast localization. RL was also found in other thermogenic plants and detected in Arabidopsis leaves. The function of RL in thermogenic and non-thermogenic plants requires further investigation.

  8. Novel haemoglobin-derived antimicrobial peptides from chicken (Gallus gallus) blood: purification, structural aspects and biological activity.

    PubMed

    Vasilchenko, A S; Rogozhin, E A; Vasilchenko, A V; Kartashova, O L; Sycheva, M V

    2016-12-01

    To purify and characterize antimicrobial peptides derived from the acid extract of Gallus gallus blood cells. Two polypeptides (i.e. CHb-1 and CHb-2) with antibacterial activity were detected in the acidic extract of blood cells from chicken (G. gallus). The isolated peptides that possessed a potent antibacterial activity were purified using a two-step chromatography procedure that involved solid-phase extraction of a total protein/peptide extract followed by thin fractionation by reversed-phase high performance liquid chromatography (RP-HPLC). The molecular masses of the purified peptides were similar and were 4824·4 and 4825·2 Da, which have been measured by matrix-assisted laser desorption/ionization mass spectrometry (MALDI TOF MS). Their amino acid sequences were determined by Edman degradation and showed that the peptides were fully identical to the two fragments of G. gallus α-haemoglobin localized into different subunits (A and D respectively). The peptides were active in micromolar concentrations against Gram-negative Escherichia coli K12 TG1. Using the 1-N-phenylnaphthylamine, the FITC-dextran labelled probes and the live/dead staining allowed to show the hemocidin mode of action and estimate the pore size. In this study, for the first time, α-haemoglobin from chicken (G. gallus) has been investigated as a donor of the two high homologous native peptide fragments that possess potent antibacterial activity in vitro. These are membrane-active peptides and their mechanism of action against E. coli involves a toroidal pore formation. The obtained results expand the perception of the role of haemoglobin in a living system, describing it as a source of multifunction substances. Additionally, the data presented in this paper may contribute to the development of new, cost-effective, antimicrobial agents. © 2016 The Society for Applied Microbiology.

  9. Structure Predictions of Two Bauhinia variegata Lectins Reveal Patterns of C-Terminal Properties in Single Chain Legume Lectins

    PubMed Central

    Moreira, Gustavo M. S. G.; Conceição, Fabricio R.; McBride, Alan J. A.; Pinto, Luciano da S.

    2013-01-01

    Bauhinia variegata lectins (BVL-I and BVL-II) are single chain lectins isolated from the plant Bauhinia variegata. Single chain lectins undergo post-translational processing on its N-terminal and C-terminal regions, which determines their physiological targeting, carbohydrate binding activity and pattern of quaternary association. These two lectins are isoforms, BVL-I being highly glycosylated, and thus far, it has not been possible to determine their structures. The present study used prediction and validation algorithms to elucidate the likely structures of BVL-I and -II. The program Bhageerath-H was chosen from among three different structure prediction programs due to its better overall reliability. In order to predict the C-terminal region cleavage sites, other lectins known to have this modification were analysed and three rules were created: (1) the first amino acid of the excised peptide is small or hydrophobic; (2) the cleavage occurs after an acid, polar, or hydrophobic residue, but not after a basic one; and (3) the cleavage spot is located 5-8 residues after a conserved Leu amino acid. These rules predicted that BVL-I and –II would have fifteen C-terminal residues cleaved, and this was confirmed experimentally by Edman degradation sequencing of BVL-I. Furthermore, the C-terminal analyses predicted that only BVL-II underwent α-helical folding in this region, similar to that seen in SBA and DBL. Conversely, BVL-I and -II contained four conserved regions of a GS-I association, providing evidence of a previously undescribed X4+unusual oligomerisation between the truncated BVL-I and the intact BVL-II. This is the first report on the structural analysis of lectins from Bauhinia spp. and therefore is important for the characterisation C-terminal cleavage and patterns of quaternary association of single chain lectins. PMID:24260572

  10. Structure predictions of two Bauhinia variegata lectins reveal patterns of C-terminal properties in single chain legume lectins.

    PubMed

    Moreira, Gustavo M S G; Conceição, Fabricio R; McBride, Alan J A; Pinto, Luciano da S

    2013-01-01

    Bauhinia variegata lectins (BVL-I and BVL-II) are single chain lectins isolated from the plant Bauhinia variegata. Single chain lectins undergo post-translational processing on its N-terminal and C-terminal regions, which determines their physiological targeting, carbohydrate binding activity and pattern of quaternary association. These two lectins are isoforms, BVL-I being highly glycosylated, and thus far, it has not been possible to determine their structures. The present study used prediction and validation algorithms to elucidate the likely structures of BVL-I and -II. The program Bhageerath-H was chosen from among three different structure prediction programs due to its better overall reliability. In order to predict the C-terminal region cleavage sites, other lectins known to have this modification were analysed and three rules were created: (1) the first amino acid of the excised peptide is small or hydrophobic; (2) the cleavage occurs after an acid, polar, or hydrophobic residue, but not after a basic one; and (3) the cleavage spot is located 5-8 residues after a conserved Leu amino acid. These rules predicted that BVL-I and -II would have fifteen C-terminal residues cleaved, and this was confirmed experimentally by Edman degradation sequencing of BVL-I. Furthermore, the C-terminal analyses predicted that only BVL-II underwent α-helical folding in this region, similar to that seen in SBA and DBL. Conversely, BVL-I and -II contained four conserved regions of a GS-I association, providing evidence of a previously undescribed X4+unusual oligomerisation between the truncated BVL-I and the intact BVL-II. This is the first report on the structural analysis of lectins from Bauhinia spp. and therefore is important for the characterisation C-terminal cleavage and patterns of quaternary association of single chain lectins.

  11. Social and behavioral research in genomic sequencing: approaches from the Clinical Sequencing Exploratory Research Consortium Outcomes and Measures Working Group.

    PubMed

    Gray, Stacy W; Martins, Yolanda; Feuerman, Lindsay Z; Bernhardt, Barbara A; Biesecker, Barbara B; Christensen, Kurt D; Joffe, Steven; Rini, Christine; Veenstra, David; McGuire, Amy L

    2014-10-01

    The routine use of genomic sequencing in clinical medicine has the potential to dramatically alter patient care and medical outcomes. To fully understand the psychosocial and behavioral impact of sequencing integration into clinical practice, it is imperative that we identify the factors that influence sequencing-related decision making and patient outcomes. In an effort to develop a collaborative and conceptually grounded approach to studying sequencing adoption, members of the National Human Genome Research Institute's Clinical Sequencing Exploratory Research Consortium formed the Outcomes and Measures Working Group. Here we highlight the priority areas of investigation and psychosocial and behavioral outcomes identified by the Working Group. We also review some of the anticipated challenges to measurement in social and behavioral research related to genomic sequencing; opportunities for instrument development; and the importance of qualitative, quantitative, and mixed-method approaches. This work represents the early, shared efforts of multiple research teams as we strive to understand individuals' experiences with genomic sequencing. The resulting body of knowledge will guide recommendations for the optimal use of sequencing in clinical practice.

  12. Probing the reactivity of nucleophile residues in human 2,3-diphosphoglycerate/deoxy-hemoglobin complex by aspecific chemical modifications.

    PubMed

    Scaloni, A; Ferranti, P; De Simone, G; Mamone, G; Sannolo, N; Malorni, A

    1999-06-11

    The use of aspecific methylation reaction in combination with MS procedures has been employed for the characterization of the nucleophilic residues present on the molecular surface of the human 2,3-diphosphoglycerate/deoxy-hemoglobin complex. In particular, direct molecular weight determinations by ESMS allowed to control the reaction conditions, limiting the number of methyl groups introduced in the modified globin chains. A combined LCESMS-Edman degradation approach for the analysis of the tryptic peptide mixtures yielded to the exact identification of methylation sites together with the quantitative estimation of their degree of modification. The reactivities observed were directly correlated with the pKa and the relative surface accessibility of the nucleophilic residues, calculated from the X-ray crystallographic structure of the protein. The results here described indicate that this methodology can be efficiently used in aspecific modification experiments directed to the molecular characterization of the surface topology in proteins and protein complexes.

  13. Structural characterization of thioether-bridged bacteriocins.

    PubMed

    Lohans, Christopher T; Vederas, John C

    2014-01-01

    Bacteriocins are a group of ribosomally synthesized antimicrobial peptides produced by bacteria, some of which are extensively post-translationally modified. Some bacteriocins, namely the lantibiotics and sactibiotics, contain one or more thioether bridges. However, these modifications complicate the structural elucidation of these bacteriocins using conventional techniques. This review will discuss the techniques and strategies that have been applied to determine the primary structures of lantibiotics and sactibiotics. A major challenge is to identify the topology of thioether bridges in these peptides (i.e., which amino-acid residues are involved in which bridges). Edman degradation, NMR spectroscopy and tandem MS have all been commonly applied to characterize these bacteriocins, but can be incompatible with the post-translational modifications present. Chemical modifications to the modified residues, such as desulfurization and reduction, make the treated bacteriocins more compatible to analysis by these standard peptide analytical techniques. Despite their differences in structure, similar strategies have proved useful to study the structures of both lantibiotics and sactibiotics.

  14. Monitoring abacavir bioactivation in humans: screening for an aldehyde metabolite.

    PubMed

    Grilo, Nádia M; Antunes, Alexandra M M; Caixas, Umbelina; Marinho, Aline T; Charneira, Catarina; Conceição Oliveira, M; Monteiro, Emília C; Matilde Marques, M; Pereira, Sofia A

    2013-05-10

    The anti-HIV drug abacavir is associated with idiosyncratic hypersensitivity reactions and cardiotoxicity. Although the mechanism underlying abacavir-toxicity is not fully understood, drug bioactivation to reactive metabolites may be involved. This work was aimed at identifying abacavir-protein adducts in the hemoglobin of HIV patients as biomarkers of abacavir bioactivation and protein modification. The protocol received prior approval from the Hospital Ethics Committee, patients gave their written informed consent and adherence was controlled through a questionnaire. Abacavir-derived Edman adducts with the N-terminal valine of hemoglobin were analyzed by an established liquid chromatography-electrospray ionization-tandem mass spectrometry method. Abacavir-valine adducts were detected in three out of ten patients. This work represents the first evidence of abacavir-protein adduct formation in humans. The data confirm the ability of abacavir to modify self-proteins and suggest that the molecular mechanism(s) of some abacavir-induced adverse reactions may require bioactivation. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  15. Porifera Lectins: Diversity, Physiological Roles and Biotechnological Potential.

    PubMed

    Gardères, Johan; Bourguet-Kondracki, Marie-Lise; Hamer, Bojan; Batel, Renato; Schröder, Heinz C; Müller, Werner E G

    2015-08-07

    An overview on the diversity of 39 lectins from the phylum Porifera is presented, including 38 lectins, which were identified from the class of demosponges, and one lectin from the class of hexactinellida. Their purification from crude extracts was mainly performed by using affinity chromatography and gel filtration techniques. Other protocols were also developed in order to collect and study sponge lectins, including screening of sponge genomes and expression in heterologous bacterial systems. The characterization of the lectins was performed by Edman degradation or mass spectrometry. Regarding their physiological roles, sponge lectins showed to be involved in morphogenesis and cell interaction, biomineralization and spiculogenesis, as well as host defense mechanisms and potentially in the association between the sponge and its microorganisms. In addition, these lectins exhibited a broad range of bioactivities, including modulation of inflammatory response, antimicrobial and cytotoxic activities, as well as anticancer and neuromodulatory activity. In view of their potential pharmacological applications, sponge lectins constitute promising molecules of biotechnological interest.

  16. Porifera Lectins: Diversity, Physiological Roles and Biotechnological Potential

    PubMed Central

    Gardères, Johan; Bourguet-Kondracki, Marie-Lise; Hamer, Bojan; Batel, Renato; Schröder, Heinz C.; Müller, Werner E. G.

    2015-01-01

    An overview on the diversity of 39 lectins from the phylum Porifera is presented, including 38 lectins, which were identified from the class of demosponges, and one lectin from the class of hexactinellida. Their purification from crude extracts was mainly performed by using affinity chromatography and gel filtration techniques. Other protocols were also developed in order to collect and study sponge lectins, including screening of sponge genomes and expression in heterologous bacterial systems. The characterization of the lectins was performed by Edman degradation or mass spectrometry. Regarding their physiological roles, sponge lectins showed to be involved in morphogenesis and cell interaction, biomineralization and spiculogenesis, as well as host defense mechanisms and potentially in the association between the sponge and its microorganisms. In addition, these lectins exhibited a broad range of bioactivities, including modulation of inflammatory response, antimicrobial and cytotoxic activities, as well as anticancer and neuromodulatory activity. In view of their potential pharmacological applications, sponge lectins constitute promising molecules of biotechnological interest. PMID:26262628

  17. Enrichment of target sequences for next-generation sequencing applications in research and diagnostics.

    PubMed

    Altmüller, Janine; Budde, Birgit S; Nürnberg, Peter

    2014-02-01

    Abstract Targeted re-sequencing such as gene panel sequencing (GPS) has become very popular in medical genetics, both for research projects and in diagnostic settings. The technical principles of the different enrichment methods have been reviewed several times before; however, new enrichment products are constantly entering the market, and researchers are often puzzled about the requirement to take decisions about long-term commitments, both for the enrichment product and the sequencing technology. This review summarizes important considerations for the experimental design and provides helpful recommendations in choosing the best sequencing strategy for various research projects and diagnostic applications.

  18. The sequence of sequencers: The history of sequencing DNA

    PubMed Central

    Heather, James M.; Chain, Benjamin

    2016-01-01

    Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. PMID:26554401

  19. Recent research on the high-probability instructional sequence: A brief review.

    PubMed

    Lipschultz, Joshua; Wilder, David A

    2017-04-01

    The high-probability (high-p) instructional sequence consists of the delivery of a series of high-probability instructions immediately before delivery of a low-probability or target instruction. It is commonly used to increase compliance in a variety of populations. Recent research has described variations of the high-p instructional sequence and examined the conditions under which the sequence is most effective. This manuscript reviews the most recent research on the sequence and identifies directions for future research. Recommendations for practitioners regarding the use of the high-p instructional sequence are also provided. © 2017 Society for the Experimental Analysis of Behavior.

  20. The sequence of sequencers: The history of sequencing DNA.

    PubMed

    Heather, James M; Chain, Benjamin

    2016-01-01

    Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  1. “Shovel-ready” Sequences as a Stimulus for the Next Generation of Life Scientists

    PubMed Central

    Boyle, Michael D.

    2010-01-01

    Genomics and bioinformatics are dynamic fields well-suited for capturing the imagination of undergraduates in both research laboratories and classrooms. Currently, raw nucleotide sequence is being provided, as part of several genomics research initiatives, for undergraduate research and teaching. These initiatives could be easily extended and much more effective if the source of the sequenced material and the subsequent focus of the data analysis were aligned with the research interests of individual faculty at undergraduate institutions. By judicious use of surplus capacity in existing nucleotide sequencing cores, raw sequence data could be generated to support ongoing research efforts involving undergraduates. This would allow these students to participate actively in discovery research, with a goal of making novel contributions to their field through original research while nurturing the next generation of talented research scientists. PMID:23653696

  2. "Shovel-ready" Sequences as a Stimulus for the Next Generation of Life Scientists.

    PubMed

    Boyle, Michael D

    2010-01-01

    Genomics and bioinformatics are dynamic fields well-suited for capturing the imagination of undergraduates in both research laboratories and classrooms. Currently, raw nucleotide sequence is being provided, as part of several genomics research initiatives, for undergraduate research and teaching. These initiatives could be easily extended and much more effective if the source of the sequenced material and the subsequent focus of the data analysis were aligned with the research interests of individual faculty at undergraduate institutions. By judicious use of surplus capacity in existing nucleotide sequencing cores, raw sequence data could be generated to support ongoing research efforts involving undergraduates. This would allow these students to participate actively in discovery research, with a goal of making novel contributions to their field through original research while nurturing the next generation of talented research scientists.

  3. (+)-Meyeniins A-C, Novel Hexahydroimidazo[1,5-c]thiazole Derivatives from the Tubers of Lepidium meyenii: Complete Structural Elucidation by Biomimetic Synthesis and Racemic Crystallization.

    PubMed

    Zhou, Min; Ma, Hang-Ying; Liu, Zhi-Hua; Yang, Guang-Yu; Du, Gang; Ye, Yan-Qing; Li, Gan-Peng; Hu, Qiu-Fen

    2017-03-08

    (+)-Meyeniins A-C (1-3), a novel class of sulfur-containing hexahydroimidazo[1,5-c]thiazole derivatives, were isolated from the tubers of Lepidium meyenii (maca) cultivated in Lijiang, Yunnan province, China. Guided by their biosynthetic hypothesis, a stereocontrolled biomimetic synthesis of meyeniins A-C and their individual enantiomers was efficiently accomplished by a combination of a condensation reaction and Edman degradation. The formation of high-quality crystals for X-ray crystallography occurred much more readily from a racemic mixture of (±)-meyeniin A than with the single enantiomer alone in this case. These extensive strategies, combined with circular dichroism (CD) spectra, allowed the complete structural assignments of (+)-meyeniins A-C. Among them, (+)-meyeniin A showed moderate selective cytotoxicities against the HL-60, A549 and MCF-7 human cell lines with IC 50 values of 14.41, 32.22, and 33.14 μM, respectively. To some extent, these findings support traditional applications of maca as healthy nutritional supplements or functional foods for cancer prevention.

  4. Evaluation of IgE reactivity of active and thermally inactivated actinidin, a biomarker of kiwifruit allergy.

    PubMed

    Grozdanovic, Milica; Popovic, Milica; Polovic, Natalija; Burazer, Lidija; Vuckovic, Olga; Atanaskovic-Markovic, Marina; Lindner, Buko; Petersen, Arnd; Gavrovic-Jankulovic, Marija

    2012-03-01

    Actinidin, an abundant cysteine protease from kiwifruit, is a specific biomarker of isolated allergy to kiwifruit. This study evaluates the IgE-binding properties of biologically active and thermally inactivated actinidin. Employing two different activity assays (caseinolytic assay and zymogram with gelatin) we showed that actinidin obtained from kiwifruit extract under native conditions represents a mixture of inactive and active enzyme. The structural integrity of actinidin was confirmed by SDS-PAGE, Edman degradation, mass fingerprint and Western blot with polyclonal antibodies. Although it was capable of inducing positive skin prick test reactions, we failed to detect IgE reactivity of active actinidin in Western blot with patient sera. Thermally inactivated actinidin exhibited IgE reactivity both in vivo and in vitro, indicating that heat processed kiwifruit products may induce clinical reactivity. These findings imply that apart from the allergenic epitopes on its surface, actinidin also contains hidden epitopes inside the protein which become accessible to IgE upon thermal treatment. Copyright © 2011 Elsevier Ltd. All rights reserved.

  5. Acidic α-galactosidase is the most abundant nectarin in floral nectar of common tobacco (Nicotiana tabacum)

    PubMed Central

    Zha, Hong-Guang; Flowers, V. Lynn; Yang, Min; Chen, Ling-Yang; Sun, Hang

    2012-01-01

    Background and Aims To date, most floral nectarins (nectar proteins) are reported to function in nectar defence, particularly for insect-pollinated outcrossing species. We compared nectarin composition and abundance in selfing common tobacco (Nicotiana tobaccum) with outcrossing ornamental tobacco plants to elucidate the functional difference of nectarins in different reproductive systems. Methods Common tobacco (CT) nectarins were separated by SDS-PAGE and the N terminus of the most abundant nectarin was sequenced via Edman degradation. The full-length nectarin gene was amplified and cloned from genomic DNA and mRNA with hiTail-PCR and RACE (rapid amplification of cDNA ends), and expression patterns were then investigated in different tissues using semi-quantitative reverse transcriptase PCR. Additionally, high-performance liquid chromatography and enzymatic analyses of nectar sugar composition, and other biochemical traits and functions of the novel nectarin were studied. Key Results The most abundant nectarin in CT nectar is an acidic α-galactosidase, here designated NTα-Gal. This compound has a molecular mass of 40 013 Da and a theoretical pI of 5·33. NTα-Gal has a conserved α-Gal characteristic signature, encodes a mature protein of 364 amino acids and is expressed in different organs. Compared with 27 other melliferous plant species from different families, CT floral nectar demonstrated the highest α-Gal activity, which is inhibited by d-galactose. Raffinose family oligosaccharides were not detected in CT nectar, indicating that NTα-Gal does not function in post-secretory hydrolysis. Moreover, tobacco plant fruits did not develop intact skin with galactose inhibition of NTα-Gal activity in nectar, suggesting that NTα-Gal induces cell-wall surface restructuring during the initial stages of fruit development. Conclusions α-Gal was the most abundant nectarin in selfing CT plants, but was not detected in the nectar of strictly outcrossing sister tobacco species. No function was demonstrated in antimicrobial defence. Therefore, floral nectarins in selfing species maintain their functional significance in reproductive organ development. PMID:22271925

  6. Daboxin P, a Major Phospholipase A2 Enzyme from the Indian Daboia russelii russelii Venom Targets Factor X and Factor Xa for Its Anticoagulant Activity

    PubMed Central

    Iyer, Janaki Krishnamurthy; Shih, Norrapat; Majumder, Munmi; Mattaparthi, Venkata Satish Kumar; Mukhopadhyay, Rupak; Doley, Robin

    2016-01-01

    In the present study a major protein has been purified from the venom of Indian Daboia russelii russelii using gel filtration, ion exchange and Rp-HPLC techniques. The purified protein, named daboxin P accounts for ~24% of the total protein of the crude venom and has a molecular mass of 13.597 kDa. It exhibits strong anticoagulant and phospholipase A2 activity but is devoid of any cytotoxic effect on the tested normal or cancerous cell lines. Its primary structure was deduced by N-terminal sequencing and chemical cleavage using Edman degradation and tandem mass spectrometry. It is composed of 121 amino acids with 14 cysteine residues and catalytically active His48 -Asp49 pair. The secondary structure of daboxin P constitutes 42.73% of α-helix and 12.36% of β-sheet. It is found to be stable at acidic (pH 3.0) and neutral pH (pH 7.0) and has a Tm value of 71.59 ± 0.46°C. Daboxin P exhibits anticoagulant effect under in-vitro and in-vivo conditions. It does not inhibit the catalytic activity of the serine proteases but inhibits the activation of factor X to factor Xa by the tenase complexes both in the presence and absence of phospholipids. It also inhibits the tenase complexes when active site residue (His48) was alkylated suggesting its non-enzymatic mode of anticoagulant activity. Moreover, it also inhibits prothrombinase complex when pre-incubated with factor Xa prior to factor Va addition. Fluorescence emission spectroscopy and affinity chromatography suggest the probable interaction of daboxin P with factor X and factor Xa. Molecular docking analysis reveals the interaction of the Ca+2 binding loop; helix C; anticoagulant region and C-terminal region of daboxin P with the heavy chain of factor Xa. This is the first report of a phospholipase A2 enzyme from Indian viper venom which targets both factor X and factor Xa for its anticoagulant activity. PMID:27089306

  7. Okadaic acid-induced, naringin-sensitive phosphorylation of glycine N-methyltransferase in isolated rat hepatocytes.

    PubMed Central

    Møller, Michael T N; Samari, Hamid R; Fengsrud, Monica; Strømhaug, Per E; øStvold, Anne C; Seglen, Per O

    2003-01-01

    Glycine N-methyltransferase (GNMT) is an abundant cytosolic enzyme that catalyses the methylation of glycine into sarcosine, coupled with conversion of the methyl donor, S -adenosylmethionine (AdoMet), into S -adenosylhomocysteine (AdoHcy). GNMT is believed to play a role in monitoring the AdoMet/AdoHcy ratio, and hence the cellular methylation capacity, but regulation of the enzyme itself is not well understood. In the present study, treatment of isolated rat hepatocytes with the protein phosphatase inhibitor okadaic acid, was found to induce an overphosphorylation of GNMT, as shown by proteomic analysis. The analysis comprised two-dimensional gel electrophoretic separation of (32)P-labelled phosphoproteins and identification of individual protein spots by matrix-assisted laser-desorption ionization-time-of-flight mass spectrometry. The identity of GNMT was verified by N-terminal Edman sequencing of tryptic peptides. Chromatographic separation of proteolytic peptides and (32)P-labelled amino acids suggested that GNMT was phosphorylated within a limited region, and only at serine residues. GNMT phosphorylation could be suppressed by naringin, an okadaic acid-antagonistic flavonoid. To assess the possible functional role of GNMT phosphorylation, the effect of okadaic acid on hepatocytic AdoMet and AdoHcy levels was examined, using HPLC separation for metabolite analysis. Surprisingly, okadaic acid was found to have no effect on the basal levels of AdoMet or AdoHcy. An accelerated AdoMet-AdoHcy flux, induced by the addition of methionine (1 mM), was likewise unaffected by okadaic acid. 5-Aminoimidazole-4-carboxamide riboside, an activator of the hepatocytic AMP-activated protein kinase, similarly induced GNMT phosphorylation without affecting AdoMet and AdoHcy levels. Activation of cAMP-dependent protein kinase by dibutyryl-cAMP, reported to cause GNMT phosphorylation under cell-free conditions, also had little effect on hepatocytic AdoMet and AdoHcy levels. Phosphorylation of GNMT would thus seem to play no role in regulation of the intracellular AdoMet/AdoHcy ratio, but could be involved in other GNMT functions, such as the binding of folates or aromatic hydrocarbons. PMID:12697024

  8. Tryptic digestion of human GPIIIa. Isolation and biochemical characterization of the 23 kDa N-terminal glycopeptide carrying the antigenic determinant for a monoclonal antibody (P37) which inhibits platelet aggregation.

    PubMed Central

    Calvete, J J; Rivas, G; Maruri, M; Alvarez, M V; McGregor, J L; Hew, C L; Gonzalez-Rodriguez, J

    1988-01-01

    Early digestion of pure human platelet glycoprotein IIIa (GPIIIa) leads to a single cleavage of the molecule at 23 kDa far from one of the terminal amino acids. Automated Edman degradation demonstrates that GPIIIa and the smaller (23 kDa) tryptic fragment share the same N-terminal amino acid sequence. A further cleavage occurs in the larger fragment (80 kDa), reducing its apparent molecular mass by 10 kDa. The 23 kDa fragment remains attached to the larger ones in unreduced samples. Stepwise reduction of early digested GPIIIa with dithioerythritol selectively reduces the single disulphide bond joining the smaller (23 kDa) to the larger (80/70 kDa) fragments. Two fractions were obtained by size-exclusion chromatography of early digested GPIIIa after partial or full reduction and alkylation. The larger-size fraction contains the 80/70 kDa fragments, while the 23 kDa fragment is isolated in the smaller. The amino acid compositions of these fractions do not differ very significantly from the composition of GPIIIa; however the 23 kDa fragment contains only 10.2% by weight of sugars and is richer in neuraminic acid. Disulphide bonds are distributed four in the 23 kDa glycopeptide and 20-21 in the 80/70 kDa glycopeptide. The epitope for P37, a monoclonal antibody which inhibits platelet aggregation [Melero & González-Rodríguez (1984) Eur. J. Biochem. 141, 421-427] is situated within the first 17 kDa of the N-terminal region of GPIIIa, which gives a special functional interest to this extracellular region of GPIIIa. On the other hand, the epitopes for GPIIIa-specific monoclonal antibodies, P6, P35, P40 and P97, which do not interfere with platelet aggregation, are located within the larger tryptic fragment (80/70 kDa). Thus, the antigenic areas available in the extracellular surface of GPIIIa for these five monoclonal antibodies are now more precisely delineated. Images Fig. 1. Fig. 2. Fig. 3. Fig. 4. PMID:2455507

  9. Identification of tyrosine phosphorylation sites in human Gab-1 protein by EGF receptor kinase in vitro.

    PubMed

    Lehr, S; Kotzka, J; Herkner, A; Klein, E; Siethoff, C; Knebel, B; Noelle, V; Brüning, J C; Klein, H W; Meyer, H E; Krone, W; Müller-Wieland, D

    1999-01-05

    Grb2-associated binder-1 (Gab-1) has been identified recently in a cDNA library of glioblastoma tumors and appears to play a central role in cellular growth response, transformation, and apoptosis. Structural and functional features indicate that Gab-1 is a multisubstrate docking protein downstream in the signaling pathways of different receptor tyrosine kinases, including the epidermal growth factor receptor (EGFR). Therefore, the aim of the study was to characterize the phosphorylation of recombinant human Gab-1 (hGab-1) protein by EGFR in vitro. Using the pGEX system to express the entire protein and different domains of hGab-1 as glutathione S-transferase proteins, kinetic data for phosphorylation of these proteins by wheat germ agglutinine-purified EGFR and the recombinant EGFR (rEGFR) receptor kinase domain were determined. Our data revealed similar affinities of hGab-1-C for both receptor preparations (KM = 2.7 microM for rEGFR vs 3.2 microM for WGA EGFR) as well as for the different recombinant hGab-1 domains. To identify the specific EGFR phosphorylation sites, hGab-1-C was sequenced by Edman degradation and mass spectrometry. The entire protein was phosphorylated by rEGFR at eight tyrosine residues (Y285, Y373, Y406, Y447, Y472, Y619, Y657, and Y689). Fifty percent of the identified radioactivity was incorporated in tyrosine Y657 as the predominant peak in HPLC analysis, a site exhibiting features of a potential Syp (PTP1D) binding site. Accordingly, GST-pull down assays with A431 and HepG2 cell lysates showed that phosphorylated intact hGab-1 was able to bind Syp. This binding appears to be specific, because it was abolished by changing the Y657 of hGab-1 to F657. These results demonstrate that hGab-1 is a high-affinity substrate for the EGFR and the major tyrosine phosphorylation site Y657 in the C terminus is a specific binding site for the tyrosine phosphatase Syp.

  10. Sequencing at sea: challenges and experiences in Ion Torrent PGM sequencing during the 2013 Southern Line Islands Research Expedition

    PubMed Central

    Lim, Yan Wei; Cuevas, Daniel A.; Silva, Genivaldo Gueiros Z.; Aguinaldo, Kristen; Dinsdale, Elizabeth A.; Haas, Andreas F.; Hatay, Mark; Sanchez, Savannah E.; Wegley-Kelly, Linda; Dutilh, Bas E.; Harkins, Timothy T.; Lee, Clarence C.; Tom, Warren; Sandin, Stuart A.; Smith, Jennifer E.; Zgliczynski, Brian; Vermeij, Mark J.A.; Rohwer, Forest

    2014-01-01

    Genomics and metagenomics have revolutionized our understanding of marine microbial ecology and the importance of microbes in global geochemical cycles. However, the process of DNA sequencing has always been an abstract extension of the research expedition, completed once the samples were returned to the laboratory. During the 2013 Southern Line Islands Research Expedition, we started the first effort to bring next generation sequencing to some of the most remote locations on our planet. We successfully sequenced twenty six marine microbial genomes, and two marine microbial metagenomes using the Ion Torrent PGM platform on the Merchant Yacht Hanse Explorer. Onboard sequence assembly, annotation, and analysis enabled us to investigate the role of the microbes in the coral reef ecology of these islands and atolls. This analysis identified phosphonate as an important phosphorous source for microbes growing in the Line Islands and reinforced the importance of L-serine in marine microbial ecosystems. Sequencing in the field allowed us to propose hypotheses and conduct experiments and further sampling based on the sequences generated. By eliminating the delay between sampling and sequencing, we enhanced the productivity of the research expedition. By overcoming the hurdles associated with sequencing on a boat in the middle of the Pacific Ocean we proved the flexibility of the sequencing, annotation, and analysis pipelines. PMID:25177534

  11. Sequencing at sea: challenges and experiences in Ion Torrent PGM sequencing during the 2013 Southern Line Islands Research Expedition.

    PubMed

    Lim, Yan Wei; Cuevas, Daniel A; Silva, Genivaldo Gueiros Z; Aguinaldo, Kristen; Dinsdale, Elizabeth A; Haas, Andreas F; Hatay, Mark; Sanchez, Savannah E; Wegley-Kelly, Linda; Dutilh, Bas E; Harkins, Timothy T; Lee, Clarence C; Tom, Warren; Sandin, Stuart A; Smith, Jennifer E; Zgliczynski, Brian; Vermeij, Mark J A; Rohwer, Forest; Edwards, Robert A

    2014-01-01

    Genomics and metagenomics have revolutionized our understanding of marine microbial ecology and the importance of microbes in global geochemical cycles. However, the process of DNA sequencing has always been an abstract extension of the research expedition, completed once the samples were returned to the laboratory. During the 2013 Southern Line Islands Research Expedition, we started the first effort to bring next generation sequencing to some of the most remote locations on our planet. We successfully sequenced twenty six marine microbial genomes, and two marine microbial metagenomes using the Ion Torrent PGM platform on the Merchant Yacht Hanse Explorer. Onboard sequence assembly, annotation, and analysis enabled us to investigate the role of the microbes in the coral reef ecology of these islands and atolls. This analysis identified phosphonate as an important phosphorous source for microbes growing in the Line Islands and reinforced the importance of L-serine in marine microbial ecosystems. Sequencing in the field allowed us to propose hypotheses and conduct experiments and further sampling based on the sequences generated. By eliminating the delay between sampling and sequencing, we enhanced the productivity of the research expedition. By overcoming the hurdles associated with sequencing on a boat in the middle of the Pacific Ocean we proved the flexibility of the sequencing, annotation, and analysis pipelines.

  12. Applications of nanotechnology, next generation sequencing and microarrays in biomedical research.

    PubMed

    Elingaramil, Sauli; Li, Xiaolong; He, Nongyue

    2013-07-01

    Next-generation sequencing technologies, microarrays and advances in bio nanotechnology have had an enormous impact on research within a short time frame. This impact appears certain to increase further as many biomedical institutions are now acquiring these prevailing new technologies. Beyond conventional sampling of genome content, wide-ranging applications are rapidly evolving for next-generation sequencing, microarrays and nanotechnology. To date, these technologies have been applied in a variety of contexts, including whole-genome sequencing, targeted re sequencing and discovery of transcription factor binding sites, noncoding RNA expression profiling and molecular diagnostics. This paper thus discusses current applications of nanotechnology, next-generation sequencing technologies and microarrays in biomedical research and highlights the transforming potential these technologies offer.

  13. The Genome Sequencer FLX System--longer reads, more applications, straight forward bioinformatics and more complete data sets.

    PubMed

    Droege, Marcus; Hill, Brendon

    2008-08-31

    The Genome Sequencer FLX System (GS FLX), powered by 454 Sequencing, is a next-generation DNA sequencing technology featuring a unique mix of long reads, exceptional accuracy, and ultra-high throughput. It has been proven to be the most versatile of all currently available next-generation sequencing technologies, supporting many high-profile studies in over seven applications categories. GS FLX users have pursued innovative research in de novo sequencing, re-sequencing of whole genomes and target DNA regions, metagenomics, and RNA analysis. 454 Sequencing is a powerful tool for human genetics research, having recently re-sequenced the genome of an individual human, currently re-sequencing the complete human exome and targeted genomic regions using the NimbleGen sequence capture process, and detected low-frequency somatic mutations linked to cancer.

  14. Glutenite bodies sequence division of the upper Es4 in northern Minfeng zone of Dongying Sag, Bohai Bay Basin, China

    NASA Astrophysics Data System (ADS)

    Shao, Xupeng

    2017-04-01

    Glutenite bodies are widely developed in northern Minfeng zone of Dongying Sag. Their litho-electric relationship is not clear. In addition, as the conventional sequence stratigraphic research method drawbacks of involving too many subjective human factors, it has limited deepening of the regional sequence stratigraphic research. The wavelet transform technique based on logging data and the time-frequency analysis technique based on seismic data have advantages of dividing sequence stratigraphy quantitatively comparing with the conventional methods. Under the basis of the conventional sequence research method, this paper used the above techniques to divide the fourth-order sequence of the upper Es4 in northern Minfeng zone of Dongying Sag. The research shows that the wavelet transform technique based on logging data and the time-frequency analysis technique based on seismic data are essentially consistent, both of which divide sequence stratigraphy quantitatively in the frequency domain; wavelet transform technique has high resolutions. It is suitable for areas with wells. The seismic time-frequency analysis technique has wide applicability, but a low resolution. Both of the techniques should be combined; the upper Es4 in northern Minfeng zone of Dongying Sag is a complete set of third-order sequence, which can be further subdivided into 5 fourth-order sequences that has the depositional characteristics of fine-upward sequence in granularity. Key words: Dongying sag, northern Minfeng zone, wavelet transform technique, time-frequency analysis technique ,the upper Es4, sequence stratigraphy

  15. DNA sequencing using polymerase substrate-binding kinetics

    PubMed Central

    Previte, Michael John Robert; Zhou, Chunhong; Kellinger, Matthew; Pantoja, Rigo; Chen, Cheng-Yao; Shi, Jin; Wang, BeiBei; Kia, Amirali; Etchin, Sergey; Vieceli, John; Nikoomanzar, Ali; Bomati, Erin; Gloeckner, Christian; Ronaghi, Mostafa; He, Molly Min

    2015-01-01

    Next-generation sequencing (NGS) has transformed genomic research by decreasing the cost of sequencing. However, whole-genome sequencing is still costly and complex for diagnostics purposes. In the clinical space, targeted sequencing has the advantage of allowing researchers to focus on specific genes of interest. Routine clinical use of targeted NGS mandates inexpensive instruments, fast turnaround time and an integrated and robust workflow. Here we demonstrate a version of the Sequencing by Synthesis (SBS) chemistry that potentially can become a preferred targeted sequencing method in the clinical space. This sequencing chemistry uses natural nucleotides and is based on real-time recording of the differential polymerase/DNA-binding kinetics in the presence of correct or mismatch nucleotides. This ensemble SBS chemistry has been implemented on an existing Illumina sequencing platform with integrated cluster amplification. We discuss the advantages of this sequencing chemistry for targeted sequencing as well as its limitations for other applications. PMID:25612848

  16. Microbes, metagenomes and marine mammals: enabling the next generation of scientist to enter the genomic era

    PubMed Central

    2013-01-01

    Background The revolution in DNA sequencing technology continues unabated, and is affecting all aspects of the biological and medical sciences. The training and recruitment of the next generation of researchers who are able to use and exploit the new technology is severely lacking and potentially negatively influencing research and development efforts to advance genome biology. Here we present a cross-disciplinary course that provides undergraduate students with practical experience in running a next generation sequencing instrument through to the analysis and annotation of the generated DNA sequences. Results Many labs across world are installing next generation sequencing technology and we show that the undergraduate students produce quality sequence data and were excited to participate in cutting edge research. The students conducted the work flow from DNA extraction, library preparation, running the sequencing instrument, to the extraction and analysis of the data. They sequenced microbes, metagenomes, and a marine mammal, the Californian sea lion, Zalophus californianus. The students met sequencing quality controls, had no detectable contamination in the targeted DNA sequences, provided publication quality data, and became part of an international collaboration to investigate carcinomas in carnivores. Conclusions Students learned important skills for their future education and career opportunities, and a perceived increase in students’ ability to conduct independent scientific research was measured. DNA sequencing is rapidly expanding in the life sciences. Teaching undergraduates to use the latest technology to sequence genomic DNA ensures they are ready to meet the challenges of the genomic era and allows them to participate in annotating the tree of life. PMID:24007365

  17. The Human Microbiome and Understanding the 16S rRNA Gene in Translational Nursing Science.

    PubMed

    Ames, Nancy J; Ranucci, Alexandra; Moriyama, Brad; Wallen, Gwenyth R

    As more is understood regarding the human microbiome, it is increasingly important for nurse scientists and healthcare practitioners to analyze these microbial communities and their role in health and disease. 16S rRNA sequencing is a key methodology in identifying these bacterial populations that has recently transitioned from use primarily in research to having increased utility in clinical settings. The objectives of this review are to (a) describe 16S rRNA sequencing and its role in answering research questions important to nursing science; (b) provide an overview of the oral, lung, and gut microbiomes and relevant research; and (c) identify future implications for microbiome research and 16S sequencing in translational nursing science. Sequencing using the 16S rRNA gene has revolutionized research and allowed scientists to easily and reliably characterize complex bacterial communities. This type of research has recently entered the clinical setting, one of the best examples involving the use of 16S sequencing to identify resistant pathogens, thereby improving the accuracy of bacterial identification in infection control. Clinical microbiota research and related requisite methods are of particular relevance to nurse scientists-individuals uniquely positioned to utilize these techniques in future studies in clinical settings.

  18. Design and implementation of a laboratory-based drug design and synthesis advanced pharmacy practice experience.

    PubMed

    Philip, Ashok; Stephens, Mark; Mitchell, Sheila L; Watkins, E Blake

    2015-04-25

    To provide students with an opportunity to participate in medicinal chemistry research within the doctor of pharmacy (PharmD) curriculum. We designed and implemented a 3-course sequence in drug design or drug synthesis for pharmacy students consisting of a 1-month advanced elective followed by two 1-month research advanced pharmacy practice experiences (APPEs). To maximize student involvement, this 3-course sequence was offered to third-year and fourth-year students twice per calendar year. Students were evaluated based on their commitment to the project's success, productivity, and professionalism. Students also evaluated the course sequence using a 14-item course evaluation rubric. Student feedback was overwhelmingly positive. Students found the experience to be a valuable component of their pharmacy curriculum. We successfully designed and implemented a 3-course research sequence that allows PharmD students in the traditional 4-year program to participate in drug design and synthesis research. Students report the sequence enhanced their critical-thinking and problem-solving skills and helped them develop as independent learners. Based on the success achieved with this sequence, efforts are underway to develop research APPEs in other areas of the pharmaceutical sciences.

  19. Illustrative case studies in the return of exome and genome sequencing results

    PubMed Central

    Amendola, Laura M; Lautenbach, Denise; Scollon, Sarah; Bernhardt, Barbara; Biswas, Sawona; East, Kelly; Everett, Jessica; Gilmore, Marian J; Himes, Patricia; Raymond, Victoria M; Wynn, Julia; Hart, Ragan; Jarvik, Gail P

    2015-01-01

    Whole genome and exome sequencing tests are increasingly being ordered in clinical practice, creating a need for research exploring the return of results from these tests. A goal of the Clinical Sequencing and Exploratory Research (CSER) consortium is to gain experience with this process to develop best practice recommendations for offering exome and genome testing and returning results. Genetic counselors in the CSER consortium have an integral role in the return of results from these genomic sequencing tests and have gained valuable insight. We present seven emerging themes related to return of exome and genome sequencing results accompanied by case descriptions illustrating important lessons learned, counseling challenges specific to these tests and considerations for future research and practice. PMID:26478737

  20. EPSE Project 2: Designing and Evaluating Short Teaching Sequences, Informed by Research Evidence.

    ERIC Educational Resources Information Center

    Leach, John; Hind, Andy; Lewis, Jenny; Scott, Phil

    2002-01-01

    Reports on Project 2 from the Evidence-based Practice in Science Education (EPSE) Research Network. In this project, teachers and researchers worked collaboratively on the design of three short teaching sequences on electric circuits. (DDR)

  1. It’s More Than Stamp Collecting: How Genome Sequencing Can Unify Biological Research

    PubMed Central

    Richards, Stephen

    2015-01-01

    The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, whilst the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to “Big Science” survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. PMID:26003218

  2. It's more than stamp collecting: how genome sequencing can unify biological research.

    PubMed

    Richards, Stephen

    2015-07-01

    The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, while the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to 'big science' survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. Self-consistency tests of large-scale dynamics parameterizations for single-column modeling

    DOE PAGES

    Edman, Jacob P.; Romps, David M.

    2015-03-18

    Large-scale dynamics parameterizations are tested numerically in cloud-resolving simulations, including a new version of the weak-pressure-gradient approximation (WPG) introduced by Edman and Romps (2014), the weak-temperature-gradient approximation (WTG), and a prior implementation of WPG. We perform a series of self-consistency tests with each large-scale dynamics parameterization, in which we compare the result of a cloud-resolving simulation coupled to WTG or WPG with an otherwise identical simulation with prescribed large-scale convergence. In self-consistency tests based on radiative-convective equilibrium (RCE; i.e., no large-scale convergence), we find that simulations either weakly coupled or strongly coupled to either WPG or WTG are self-consistent, butmore » WPG-coupled simulations exhibit a nonmonotonic behavior as the strength of the coupling to WPG is varied. We also perform self-consistency tests based on observed forcings from two observational campaigns: the Tropical Warm Pool International Cloud Experiment (TWP-ICE) and the ARM Southern Great Plains (SGP) Summer 1995 IOP. In these tests, we show that the new version of WPG improves upon prior versions of WPG by eliminating a potentially troublesome gravity-wave resonance.« less

  4. The Human Microbiome and Understanding the 16S rRNA Gene in Translational Nursing Science

    PubMed Central

    Ames, Nancy J.; Ranucci, Alexandra; Moriyama, Brad; Wallen, Gwenyth R.

    2017-01-01

    Background As more is understood regarding the human microbiome, it is increasingly important for nurse scientists and health care practitioners to analyze these microbial communities and their role in health and disease.16S rRNA sequencing is a key methodology in identifying these bacterial populations that has recently transitioned from use primarily in research to having increased utility in clinical settings. Objectives The objectives of this review are to: (a) describe 16S rRNA sequencing and its role in answering research questions important to nursing science; (b) provide an overview of the oral, lung and gut microbiomes and relevant research; and (c) identify future implications for microbiome research and 16S sequencing in translational nursing science. Discussion Sequencing using the 16S rRNA gene has revolutionized research and allowed scientists to easily and reliably characterize complex bacterial communities. This type of research has recently entered the clinical setting, one of the best examples involving the use of 16S sequencing to identify resistant pathogens, thereby improving the accuracy of bacterial identification in infection control. Clinical microbiota research and related requisite methods are of particular relevance to nurse scientists—individuals uniquely positioned to utilize these techniques in future studies in clinical settings. PMID:28252578

  5. Multiplexed fragaria chloroplast genome sequencing

    Treesearch

    W. Njuguna; A. Liston; R. Cronn; N.V. Bassil

    2010-01-01

    A method to sequence multiple chloroplast genomes using ultra high throughput sequencing technologies was recently described. Complete chloroplast genome sequences can resolve phylogenetic relationships at low taxonomic levels and identify informative point mutations and indels. The objective of this research was to sequence multiple Fragaria...

  6. Researches on Transcriptome Sequencing in the Study of Traditional Chinese Medicine

    PubMed Central

    Xin, Jie; Zhang, Rong-chao; Wang, Lei

    2017-01-01

    Due to its incomparable advantages, the application of transcriptome sequencing in the study of traditional Chinese medicine attracts more and more attention of researchers, which greatly promote the development of traditional Chinese medicine. In this paper, the applications of transcriptome sequencing in traditional Chinese medicine were summarized by reviewing recent related papers. PMID:28900463

  7. APPLaUD: access for patients and participants to individual level uninterpreted genomic data.

    PubMed

    Thorogood, Adrian; Bobe, Jason; Prainsack, Barbara; Middleton, Anna; Scott, Erick; Nelson, Sarah; Corpas, Manuel; Bonhomme, Natasha; Rodriguez, Laura Lyman; Murtagh, Madeleine; Kleiderman, Erika

    2018-02-17

    There is a growing support for the stance that patients and research participants should have better and easier access to their raw (uninterpreted) genomic sequence data in both clinical and research contexts. We review legal frameworks and literature on the benefits, risks, and practical barriers of providing individuals access to their data. We also survey genomic sequencing initiatives that provide or plan to provide individual access. Many patients and research participants expect to be able to access their health and genomic data. Individuals have a legal right to access their genomic data in some countries and contexts. Moreover, increasing numbers of participatory research projects, direct-to-consumer genetic testing companies, and now major national sequencing initiatives grant individuals access to their genomic sequence data upon request. Drawing on current practice and regulatory analysis, we outline legal, ethical, and practical guidance for genomic sequencing initiatives seeking to offer interested patients and participants access to their raw genomic data.

  8. [Methods, challenges and opportunities for big data analyses of microbiome].

    PubMed

    Sheng, Hua-Fang; Zhou, Hong-Wei

    2015-07-01

    Microbiome is a novel research field related with a variety of chronic inflamatory diseases. Technically, there are two major approaches to analysis of microbiome: metataxonome by sequencing the 16S rRNA variable tags, and metagenome by shot-gun sequencing of the total microbial (mainly bacterial) genome mixture. The 16S rRNA sequencing analyses pipeline includes sequence quality control, diversity analyses, taxonomy and statistics; metagenome analyses further includes gene annotation and functional analyses. With the development of the sequencing techniques, the cost of sequencing will decrease, and big data analyses will become the central task. Data standardization, accumulation, modeling and disease prediction are crucial for future exploit of these data. Meanwhile, the information property in these data, and the functional verification with culture-dependent and culture-independent experiments remain the focus in future research. Studies of human microbiome will bring a better understanding of the relations between the human body and the microbiome, especially in the context of disease diagnosis and therapy, which promise rich research opportunities.

  9. Practical applications of next-generation sequencing for food-safety research

    USDA-ARS?s Scientific Manuscript database

    Next-generation sequencing (NGS) is a transformative technology that is revolutionizing the biological sciences. However, many researchers remain uncertain as to the best ways to harness the power of NGS and apply it to their own research questions. Here we highlight three case studies of how NGS ...

  10. Using Sequence Diagrams to Detect Communication Problems Between Systems

    NASA Technical Reports Server (NTRS)

    Lindvall, Mikael; Ackermann, Chris; Stratton, William C.; Sibol, Deane E.; Ray, Arnab; Yonkwa, Lyly; Kresser, Jan; Godfrey, Sally H.; Knodel, Jens

    2008-01-01

    Many software systems are evolving complex system of systems (SoS) for which inter-system communication is both mission-critical and error-prone. Such communication problems ideally would be detected before deployment. In a NASA-supported Software Assurance Research Program (SARP) project, we are researching a new approach addressing such problems. In this paper, we show that problems in the communication between two systems can be detected by using sequence diagrams to model the planned communication and by comparing the planned sequence to the actual sequence. We identify different kinds of problems that can be addressed by modeling the planned sequence using different level of abstractions.

  11. A knowledge engineering approach to recognizing and extracting sequences of nucleic acids from scientific literature.

    PubMed

    García-Remesal, Miguel; Maojo, Victor; Crespo, José

    2010-01-01

    In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences.

  12. Learning to Count: Structured Practice with Spatial Cues Supports the Development of Counting Sequence Knowledge in 3-Year-Old English-Speaking Children

    ERIC Educational Resources Information Center

    Dunbar, Kristina; Ridha, Aala; Cankaya, Ozlem; Jiménez Lira, Carolina; LeFevre, Jo-Anne

    2017-01-01

    Research Findings: Children who speak English are slower to learn the counting sequence between 11 and 20 compared to children who speak Asian languages. In the present research, we examined whether providing children with spatially relevant information during counting would facilitate their acquisition of the counting sequence. Three-year-olds…

  13. Next-Generation Technologies for Multiomics Approaches Including Interactome Sequencing

    PubMed Central

    Ohashi, Hiroyuki; Miyamoto-Sato, Etsuko

    2015-01-01

    The development of high-speed analytical techniques such as next-generation sequencing and microarrays allows high-throughput analysis of biological information at a low cost. These techniques contribute to medical and bioscience advancements and provide new avenues for scientific research. Here, we outline a variety of new innovative techniques and discuss their use in omics research (e.g., genomics, transcriptomics, metabolomics, proteomics, and interactomics). We also discuss the possible applications of these methods, including an interactome sequencing technology that we developed, in future medical and life science research. PMID:25649523

  14. Ethical and legal implications of whole genome and whole exome sequencing in African populations.

    PubMed

    Wright, Galen E B; Koornhof, Pieter G J; Adeyemo, Adebowale A; Tiffin, Nicki

    2013-05-28

    Rapid advances in high throughput genomic technologies and next generation sequencing are making medical genomic research more readily accessible and affordable, including the sequencing of patient and control whole genomes and exomes in order to elucidate genetic factors underlying disease. Over the next five years, the Human Heredity and Health in Africa (H3Africa) Initiative, funded by the Wellcome Trust (United Kingdom) and the National Institutes of Health (United States of America), will contribute greatly towards sequencing of numerous African samples for biomedical research. Funding agencies and journals often require submission of genomic data from research participants to databases that allow open or controlled data access for all investigators. Access to such genotype-phenotype and pedigree data, however, needs careful control in order to prevent identification of individuals or families. This is particularly the case in Africa, where many researchers and their patients are inexperienced in the ethical issues accompanying whole genome and exome research; and where an historical unidirectional flow of samples and data out of Africa has created a sense of exploitation and distrust. In the current study, we analysed the implications of the anticipated surge of next generation sequencing data in Africa and the subsequent data sharing concepts on the protection of privacy of research subjects. We performed a retrospective analysis of the informed consent process for the continent and the rest-of-the-world and examined relevant legislation, both current and proposed. We investigated the following issues: (i) informed consent, including guidelines for performing culturally-sensitive next generation sequencing research in Africa and availability of suitable informed consent documents; (ii) data security and subject privacy whilst practicing data sharing; (iii) conveying the implications of such concepts to research participants in resource limited settings. We conclude that, in order to meet the unique requirements of performing next generation sequencing-related research in African populations, novel approaches to the informed consent process are required. This will help to avoid infringement of privacy of individual subjects as well as to ensure that informed consent adheres to acceptable data protection levels with regard to use and transfer of such information.

  15. Ethical and legal implications of whole genome and whole exome sequencing in African populations

    PubMed Central

    2013-01-01

    Background Rapid advances in high throughput genomic technologies and next generation sequencing are making medical genomic research more readily accessible and affordable, including the sequencing of patient and control whole genomes and exomes in order to elucidate genetic factors underlying disease. Over the next five years, the Human Heredity and Health in Africa (H3Africa) Initiative, funded by the Wellcome Trust (United Kingdom) and the National Institutes of Health (United States of America), will contribute greatly towards sequencing of numerous African samples for biomedical research. Discussion Funding agencies and journals often require submission of genomic data from research participants to databases that allow open or controlled data access for all investigators. Access to such genotype-phenotype and pedigree data, however, needs careful control in order to prevent identification of individuals or families. This is particularly the case in Africa, where many researchers and their patients are inexperienced in the ethical issues accompanying whole genome and exome research; and where an historical unidirectional flow of samples and data out of Africa has created a sense of exploitation and distrust. In the current study, we analysed the implications of the anticipated surge of next generation sequencing data in Africa and the subsequent data sharing concepts on the protection of privacy of research subjects. We performed a retrospective analysis of the informed consent process for the continent and the rest-of-the-world and examined relevant legislation, both current and proposed. We investigated the following issues: (i) informed consent, including guidelines for performing culturally-sensitive next generation sequencing research in Africa and availability of suitable informed consent documents; (ii) data security and subject privacy whilst practicing data sharing; (iii) conveying the implications of such concepts to research participants in resource limited settings. Summary We conclude that, in order to meet the unique requirements of performing next generation sequencing-related research in African populations, novel approaches to the informed consent process are required. This will help to avoid infringement of privacy of individual subjects as well as to ensure that informed consent adheres to acceptable data protection levels with regard to use and transfer of such information. PMID:23714101

  16. Apollo 13 post-flight Service Module tests to determine reason for explosion

    NASA Technical Reports Server (NTRS)

    1970-01-01

    Sequence photo from 16mm motion picture film of test at Langley Research Center which seeks to determine mechanism by which Apollo 13 panel was separated from Service Module. The test used a 1/2 scale model with a honeycomb sandwich panel and was conducted in a vacuum (41982); Second photograph in sequence of three of panel separation test at Langley Research Center (41983); Full-scale propogation test at the NASA Manned Spacecraft Center of fire inside the Apollo Service Module oxygen tank. The photograph from a motion picture sequence taken from outside the vessel shows failure of tank conduit with abrupt loss of oxygen pressure (41984); Third photograph in sequence of three showing panel separation test at Langley Research Center (41985).

  17. Teaching Research Methodology Using a Project-Based Three Course Sequence Critical Reflections on Practice

    ERIC Educational Resources Information Center

    Braguglia, Kay H.; Jackson, Kanata A.

    2012-01-01

    This article presents a reflective analysis of teaching research methodology through a three course sequence using a project-based approach. The authors reflect critically on their experiences in teaching research methods courses in an undergraduate business management program. The introduction of a range of specific techniques including student…

  18. International Space Station Research Plan, Assembly Sequence Rev., F

    DTIC Science & Technology

    2000-08-01

    muscles ü Higher risk for bone fracture upon return to Earth ü Potential for “slipped discs” ü Diminished ability to quickly respond to emergencies...Office of Biological and Physical Research International Space Station Research Plan Assembly Sequence Rev. F, Aug. 2000l . , . Report...Organization Name(s) and Address(es) NASA, Office of Biological and Physical Research Performing Organization Report Number Sponsoring/Monitoring Agency

  19. Biomolecule Sequencer: Next-Generation DNA Sequencing Technology for In-Flight Environmental Monitoring, Research, and Beyond

    NASA Technical Reports Server (NTRS)

    Smith, David J.; Burton, Aaron; Castro-Wallace, Sarah; John, Kristen; Stahl, Sarah E.; Dworkin, Jason Peter; Lupisella, Mark L.

    2016-01-01

    On the International Space Station (ISS), technologies capable of rapid microbial identification and disease diagnostics are not currently available. NASA still relies upon sample return for comprehensive, molecular-based sample characterization. Next-generation DNA sequencing is a powerful approach for identifying microorganisms in air, water, and surfaces onboard spacecraft. The Biomolecule Sequencer payload, manifested to SpaceX-9 and scheduled on the Increment 4748 research plan (June 2016), will assess the functionality of a commercially-available next-generation DNA sequencer in the microgravity environment of ISS. The MinION device from Oxford Nanopore Technologies (Oxford, UK) measures picoamp changes in electrical current dependent on nucleotide sequences of the DNA strand migrating through nanopores in the system. The hardware is exceptionally small (9.5 x 3.2 x 1.6 cm), lightweight (120 grams), and powered only by a USB connection. For the ISS technology demonstration, the Biomolecule Sequencer will be powered by a Microsoft Surface Pro3. Ground-prepared samples containing lambda bacteriophage, Escherichia coli, and mouse genomic DNA, will be launched and stored frozen on the ISS until experiment initiation. Immediately prior to sequencing, a crew member will collect and thaw frozen DNA samples, connect the sequencer to the Surface Pro3, inject thawed samples into a MinION flow cell, and initiate sequencing. At the completion of the sequencing run, data will be downlinked for ground analysis. Identical, synchronous ground controls will be used for data comparisons to determine sequencer functionality, run-time sequence, current dynamics, and overall accuracy. We will present our latest results from the ISS flight experiment the first time DNA has ever been sequenced in space and discuss the many potential applications of the Biomolecule Sequencer for environmental monitoring, medical diagnostics, higher fidelity and more adaptable Space Biology Human Research Program investigations, and even life detection experiments for astrobiology missions.

  20. Clinical Sequencing Exploratory Research Consortium: Accelerating Evidence-Based Practice of Genomic Medicine.

    PubMed

    Green, Robert C; Goddard, Katrina A B; Jarvik, Gail P; Amendola, Laura M; Appelbaum, Paul S; Berg, Jonathan S; Bernhardt, Barbara A; Biesecker, Leslie G; Biswas, Sawona; Blout, Carrie L; Bowling, Kevin M; Brothers, Kyle B; Burke, Wylie; Caga-Anan, Charlisse F; Chinnaiyan, Arul M; Chung, Wendy K; Clayton, Ellen W; Cooper, Gregory M; East, Kelly; Evans, James P; Fullerton, Stephanie M; Garraway, Levi A; Garrett, Jeremy R; Gray, Stacy W; Henderson, Gail E; Hindorff, Lucia A; Holm, Ingrid A; Lewis, Michelle Huckaby; Hutter, Carolyn M; Janne, Pasi A; Joffe, Steven; Kaufman, David; Knoppers, Bartha M; Koenig, Barbara A; Krantz, Ian D; Manolio, Teri A; McCullough, Laurence; McEwen, Jean; McGuire, Amy; Muzny, Donna; Myers, Richard M; Nickerson, Deborah A; Ou, Jeffrey; Parsons, Donald W; Petersen, Gloria M; Plon, Sharon E; Rehm, Heidi L; Roberts, J Scott; Robinson, Dan; Salama, Joseph S; Scollon, Sarah; Sharp, Richard R; Shirts, Brian; Spinner, Nancy B; Tabor, Holly K; Tarczy-Hornoch, Peter; Veenstra, David L; Wagle, Nikhil; Weck, Karen; Wilfond, Benjamin S; Wilhelmsen, Kirk; Wolf, Susan M; Wynn, Julia; Yu, Joon-Ho

    2016-06-02

    Despite rapid technical progress and demonstrable effectiveness for some types of diagnosis and therapy, much remains to be learned about clinical genome and exome sequencing (CGES) and its role within the practice of medicine. The Clinical Sequencing Exploratory Research (CSER) consortium includes 18 extramural research projects, one National Human Genome Research Institute (NHGRI) intramural project, and a coordinating center funded by the NHGRI and National Cancer Institute. The consortium is exploring analytic and clinical validity and utility, as well as the ethical, legal, and social implications of sequencing via multidisciplinary approaches; it has thus far recruited 5,577 participants across a spectrum of symptomatic and healthy children and adults by utilizing both germline and cancer sequencing. The CSER consortium is analyzing data and creating publically available procedures and tools related to participant preferences and consent, variant classification, disclosure and management of primary and secondary findings, health outcomes, and integration with electronic health records. Future research directions will refine measures of clinical utility of CGES in both germline and somatic testing, evaluate the use of CGES for screening in healthy individuals, explore the penetrance of pathogenic variants through extensive phenotyping, reduce discordances in public databases of genes and variants, examine social and ethnic disparities in the provision of genomics services, explore regulatory issues, and estimate the value and downstream costs of sequencing. The CSER consortium has established a shared community of research sites by using diverse approaches to pursue the evidence-based development of best practices in genomic medicine. Copyright © 2016 American Society of Human Genetics. All rights reserved.

  1. New developments in ancient genomics.

    PubMed

    Millar, Craig D; Huynen, Leon; Subramanian, Sankar; Mohandesan, Elmira; Lambert, David M

    2008-07-01

    Ancient DNA research is on the crest of a 'third wave' of progress due to the introduction of a new generation of DNA sequencing technologies. Here we review the advantages and disadvantages of the four new DNA sequencers that are becoming available to researchers. These machines now allow the recovery of orders of magnitude more DNA sequence data, albeit as short sequence reads. Hence, the potential reassembly of complete ancient genomes seems imminent, and when used to screen libraries of ancient sequences, these methods are cost effective. This new wealth of data is also likely to herald investigations into the functional properties of extinct genes and gene complexes and will improve our understanding of the biological basis of extinct phenotypes.

  2. Genome assembly reborn: recent computational challenges

    PubMed Central

    2009-01-01

    Research into genome assembly algorithms has experienced a resurgence due to new challenges created by the development of next generation sequencing technologies. Several genome assemblers have been published in recent years specifically targeted at the new sequence data; however, the ever-changing technological landscape leads to the need for continued research. In addition, the low cost of next generation sequencing data has led to an increased use of sequencing in new settings. For example, the new field of metagenomics relies on large-scale sequencing of entire microbial communities instead of isolate genomes, leading to new computational challenges. In this article, we outline the major algorithmic approaches for genome assembly and describe recent developments in this domain. PMID:19482960

  3. Integration of Temporal and Ordinal Information During Serial Interception Sequence Learning

    PubMed Central

    Gobel, Eric W.; Sanchez, Daniel J.; Reber, Paul J.

    2011-01-01

    The expression of expert motor skills typically involves learning to perform a precisely timed sequence of movements (e.g., language production, music performance, athletic skills). Research examining incidental sequence learning has previously relied on a perceptually-cued task that gives participants exposure to repeating motor sequences but does not require timing of responses for accuracy. Using a novel perceptual-motor sequence learning task, learning a precisely timed cued sequence of motor actions is shown to occur without explicit instruction. Participants learned a repeating sequence through practice and showed sequence-specific knowledge via a performance decrement when switched to an unfamiliar sequence. In a second experiment, the integration of representation of action order and timing sequence knowledge was examined. When either action order or timing sequence information was selectively disrupted, performance was reduced to levels similar to completely novel sequences. Unlike prior sequence-learning research that has found timing information to be secondary to learning action sequences, when the task demands require accurate action and timing information, an integrated representation of these types of information is acquired. These results provide the first evidence for incidental learning of fully integrated action and timing sequence information in the absence of an independent representation of action order, and suggest that this integrative mechanism may play a material role in the acquisition of complex motor skills. PMID:21417511

  4. RNA sequencing: current and prospective uses in metabolic research.

    PubMed

    Vikman, Petter; Fadista, Joao; Oskolkov, Nikolay

    2014-10-01

    Previous global RNA analysis was restricted to known transcripts in species with a defined transcriptome. Next generation sequencing has transformed transcriptomics by making it possible to analyse expressed genes with an exon level resolution from any tissue in any species without any a priori knowledge of which genes that are being expressed, splice patterns or their nucleotide sequence. In addition, RNA sequencing is a more sensitive technique compared with microarrays with a larger dynamic range, and it also allows for investigation of imprinting and allele-specific expression. This can be done for a cost that is able to compete with that of a microarray, making RNA sequencing a technique available to most researchers. Therefore RNA sequencing has recently become the state of the art with regards to large-scale RNA investigations and has to a large extent replaced microarrays. The only drawback is the large data amounts produced, which together with the complexity of the data can make a researcher spend far more time on analysis than performing the actual experiment. © 2014 Society for Endocrinology.

  5. Research progress of plant population genomics based on high-throughput sequencing.

    PubMed

    Wang, Yun-sheng

    2016-08-01

    Population genomics, a new paradigm for population genetics, combine the concepts and techniques of genomics with the theoretical system of population genetics and improve our understanding of microevolution through identification of site-specific effect and genome-wide effects using genome-wide polymorphic sites genotypeing. With the appearance and improvement of the next generation high-throughput sequencing technology, the numbers of plant species with complete genome sequences increased rapidly and large scale resequencing has also been carried out in recent years. Parallel sequencing has also been done in some plant species without complete genome sequences. These studies have greatly promoted the development of population genomics and deepened our understanding of the genetic diversity, level of linking disequilibium, selection effect, demographical history and molecular mechanism of complex traits of relevant plant population at a genomic level. In this review, I briely introduced the concept and research methods of population genomics and summarized the research progress of plant population genomics based on high-throughput sequencing. I also discussed the prospect as well as existing problems of plant population genomics in order to provide references for related studies.

  6. NGS Catalog: A Database of Next Generation Sequencing Studies in Humans

    PubMed Central

    Xia, Junfeng; Wang, Qingguo; Jia, Peilin; Wang, Bing; Pao, William; Zhao, Zhongming

    2015-01-01

    Next generation sequencing (NGS) technologies have been rapidly applied in biomedical and biological research since its advent only a few years ago, and they are expected to advance at an unprecedented pace in the following years. To provide the research community with a comprehensive NGS resource, we have developed the database Next Generation Sequencing Catalog (NGS Catalog, http://bioinfo.mc.vanderbilt.edu/NGS/index.html), a continually updated database that collects, curates and manages available human NGS data obtained from published literature. NGS Catalog deposits publication information of NGS studies and their mutation characteristics (SNVs, small insertions/deletions, copy number variations, and structural variants), as well as mutated genes and gene fusions detected by NGS. Other functions include user data upload, NGS general analysis pipelines, and NGS software. NGS Catalog is particularly useful for investigators who are new to NGS but would like to take advantage of these powerful technologies for their own research. Finally, based on the data deposited in NGS Catalog, we summarized features and findings from whole exome sequencing, whole genome sequencing, and transcriptome sequencing studies for human diseases or traits. PMID:22517761

  7. Analysis of plant microbe interactions in the era of next generation sequencing technologies

    PubMed Central

    Knief, Claudia

    2014-01-01

    Next generation sequencing (NGS) technologies have impressively accelerated research in biological science during the last years by enabling the production of large volumes of sequence data to a drastically lower price per base, compared to traditional sequencing methods. The recent and ongoing developments in the field allow addressing research questions in plant-microbe biology that were not conceivable just a few years ago. The present review provides an overview of NGS technologies and their usefulness for the analysis of microorganisms that live in association with plants. Possible limitations of the different sequencing systems, in particular sources of errors and bias, are critically discussed and methods are disclosed that help to overcome these shortcomings. A focus will be on the application of NGS methods in metagenomic studies, including the analysis of microbial communities by amplicon sequencing, which can be considered as a targeted metagenomic approach. Different applications of NGS technologies are exemplified by selected research articles that address the biology of the plant associated microbiota to demonstrate the worth of the new methods. PMID:24904612

  8. Fast single-pass alignment and variant calling using sequencing data

    USDA-ARS?s Scientific Manuscript database

    Sequencing research requires efficient computation. Few programs use already known information about DNA variants when aligning sequence data to the reference map. New program findmap.f90 reads the previous variant list before aligning sequence, calling variant alleles, and summing the allele counts...

  9. HASP server: a database and structural visualization platform for comparative models of influenza A hemagglutinin proteins.

    PubMed

    Ambroggio, Xavier I; Dommer, Jennifer; Gopalan, Vivek; Dunham, Eleca J; Taubenberger, Jeffery K; Hurt, Darrell E

    2013-06-18

    Influenza A viruses possess RNA genomes that mutate frequently in response to immune pressures. The mutations in the hemagglutinin genes are particularly significant, as the hemagglutinin proteins mediate attachment and fusion to host cells, thereby influencing viral pathogenicity and species specificity. Large-scale influenza A genome sequencing efforts have been ongoing to understand past epidemics and pandemics and anticipate future outbreaks. Sequencing efforts thus far have generated nearly 9,000 distinct hemagglutinin amino acid sequences. Comparative models for all publicly available influenza A hemagglutinin protein sequences (8,769 to date) were generated using the Rosetta modeling suite. The C-alpha root mean square deviations between a randomly chosen test set of models and their crystallographic templates were less than 2 Å, suggesting that the modeling protocols yielded high-quality results. The models were compiled into an online resource, the Hemagglutinin Structure Prediction (HASP) server. The HASP server was designed as a scientific tool for researchers to visualize hemagglutinin protein sequences of interest in a three-dimensional context. With a built-in molecular viewer, hemagglutinin models can be compared side-by-side and navigated by a corresponding sequence alignment. The models and alignments can be downloaded for offline use and further analysis. The modeling protocols used in the HASP server scale well for large amounts of sequences and will keep pace with expanded sequencing efforts. The conservative approach to modeling and the intuitive search and visualization interfaces allow researchers to quickly analyze hemagglutinin sequences of interest in the context of the most highly related experimental structures, and allow them to directly compare hemagglutinin sequences to each other simultaneously in their two- and three-dimensional contexts. The models and methodology have shown utility in current research efforts and the ongoing aim of the HASP server is to continue to accelerate influenza A research and have a positive impact on global public health.

  10. A test matrix sequencer for research test facility automation

    NASA Technical Reports Server (NTRS)

    Mccartney, Timothy P.; Emery, Edward F.

    1990-01-01

    The hardware and software configuration of a Test Matrix Sequencer, a general purpose test matrix profiler that was developed for research test facility automation at the NASA Lewis Research Center, is described. The system provides set points to controllers and contact closures to data systems during the course of a test. The Test Matrix Sequencer consists of a microprocessor controlled system which is operated from a personal computer. The software program, which is the main element of the overall system is interactive and menu driven with pop-up windows and help screens. Analog and digital input/output channels can be controlled from a personal computer using the software program. The Test Matrix Sequencer provides more efficient use of aeronautics test facilities by automating repetitive tasks that were once done manually.

  11. Web Apollo: a web-based genomic annotation editing platform.

    PubMed

    Lee, Eduardo; Helt, Gregg A; Reese, Justin T; Munoz-Torres, Monica C; Childers, Chris P; Buels, Robert M; Stein, Lincoln; Holmes, Ian H; Elsik, Christine G; Lewis, Suzanna E

    2013-08-30

    Web Apollo is the first instantaneous, collaborative genomic annotation editor available on the web. One of the natural consequences following from current advances in sequencing technology is that there are more and more researchers sequencing new genomes. These researchers require tools to describe the functional features of their newly sequenced genomes. With Web Apollo researchers can use any of the common browsers (for example, Chrome or Firefox) to jointly analyze and precisely describe the features of a genome in real time, whether they are in the same room or working from opposite sides of the world.

  12. Web Apollo: a web-based genomic annotation editing platform

    PubMed Central

    2013-01-01

    Web Apollo is the first instantaneous, collaborative genomic annotation editor available on the web. One of the natural consequences following from current advances in sequencing technology is that there are more and more researchers sequencing new genomes. These researchers require tools to describe the functional features of their newly sequenced genomes. With Web Apollo researchers can use any of the common browsers (for example, Chrome or Firefox) to jointly analyze and precisely describe the features of a genome in real time, whether they are in the same room or working from opposite sides of the world. PMID:24000942

  13. A Molecular Genetics Laboratory Course Applying Bioinformatics and Cell Biology in the Context of Original Research

    PubMed Central

    Pruitt, Wendy M.; Robinson, Lucy C.

    2008-01-01

    Research based laboratory courses have been shown to stimulate student interest in science and to improve scientific skills. We describe here a project developed for a semester-long research-based laboratory course that accompanies a genetics lecture course. The project was designed to allow students to become familiar with the use of bioinformatics tools and molecular biology and genetic approaches while carrying out original research. Students were required to present their hypotheses, experiments, and results in a comprehensive lab report. The lab project concerned the yeast casein kinase 1 (CK1) protein kinase Yck2. CK1 protein kinases are present in all organisms and are well conserved in primary structure. These enzymes display sequence features that differ from other protein kinase subfamilies. Students identified such sequences within the CK1 subfamily, chose a sequence to analyze, used available structural data to determine possible functions for their sequences, and designed mutations within the sequences. After generating the mutant alleles, these were expressed in yeast and tested for function by using two growth assays. The student response to the project was positive, both in terms of knowledge and skills increases and interest in research, and several students are continuing the analysis of mutant alleles as summer projects. PMID:19047427

  14. Whole exome or genome sequencing: nurses need to prepare families for the possibilities.

    PubMed

    Prows, Cynthia A; Tran, Grace; Blosser, Beverly

    2014-12-01

    A discussion of whole exome sequencing and the type of possible results patients and families should be aware of before samples are obtained. To find the genetic cause of a rare disorder, whole exome sequencing analyses all known and suspected human genes from a single sample. Over 20,000 detected DNA variants in each individual exome must be considered as possibly causing disease or disregarded as not relevant to the person's disease. In the process, unexpected gene variants associated with known diseases unrelated to the primary purpose of the test may be incidentally discovered. Because family members' DNA samples are often needed, gene variants associated with known genetic diseases or predispositions for diseases can also be discovered in their samples. Discussion paper. PubMed 2009-2013, list of references in retrieved articles, Google Scholar. Nurses need a general understanding of the scope of potential genomic information that may be revealed with whole exome sequencing to provide support and guidance to individuals and families during their decision-making process, while waiting for results and after disclosure. Nurse scientists who want to use whole exome sequencing in their study design and methods must decide early in study development if they will return primary whole exome sequencing research results and if they will give research participants choices about learning incidental research results. It is critical that nurses translate their knowledge about whole exome sequencing into their patient education and patient advocacy roles and relevant programmes of research. © 2014 John Wiley & Sons Ltd.

  15. A Novel Center Star Multiple Sequence Alignment Algorithm Based on Affine Gap Penalty and K-Band

    NASA Astrophysics Data System (ADS)

    Zou, Quan; Shan, Xiao; Jiang, Yi

    Multiple sequence alignment is one of the most important topics in computational biology, but it cannot deal with the large data so far. As the development of copy-number variant(CNV) and Single Nucleotide Polymorphisms(SNP) research, many researchers want to align numbers of similar sequences for detecting CNV and SNP. In this paper, we propose a novel multiple sequence alignment algorithm based on affine gap penalty and k-band. It can align more quickly and accurately, that will be helpful for mining CNV and SNP. Experiments prove the performance of our algorithm.

  16. Protecting genomic sequence anonymity with generalization lattices.

    PubMed

    Malin, B A

    2005-01-01

    Current genomic privacy technologies assume the identity of genomic sequence data is protected if personal information, such as demographics, are obscured, removed, or encrypted. While demographic features can directly compromise an individual's identity, recent research demonstrates such protections are insufficient because sequence data itself is susceptible to re-identification. To counteract this problem, we introduce an algorithm for anonymizing a collection of person-specific DNA sequences. The technique is termed DNA lattice anonymization (DNALA), and is based upon the formal privacy protection schema of k -anonymity. Under this model, it is impossible to observe or learn features that distinguish one genetic sequence from k-1 other entries in a collection. To maximize information retained in protected sequences, we incorporate a concept generalization lattice to learn the distance between two residues in a single nucleotide region. The lattice provides the most similar generalized concept for two residues (e.g. adenine and guanine are both purines). The method is tested and evaluated with several publicly available human population datasets ranging in size from 30 to 400 sequences. Our findings imply the anonymization schema is feasible for the protection of sequences privacy. The DNALA method is the first computational disclosure control technique for general DNA sequences. Given the computational nature of the method, guarantees of anonymity can be formally proven. There is room for improvement and validation, though this research provides the groundwork from which future researchers can construct genomics anonymization schemas tailored to specific datasharing scenarios.

  17. Legume Genome Initiative at the University of Oklahoma

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bruce A. Roe

    2004-02-27

    Consolidated Appropriations Resolution, 2003 Conference Report for the Department of Energy's Biological and Environmental Research (BER) program provided $481,000 for the Legume Genome Initiative at the University of Oklahoma. These funds were used to support our research that is aimed at determining the entire sequence of the gene rich regions of the genome of the legume, Medicago truncatula, by allowing us to obtain a greater degree of finished BAC sequences from the draft sequences we have already obtained through research funded by the Noble Foundation. During the funding period we increased the number of Medicago truncatula BACs with finished (Bermudamore » standard) sequences from 109 to 359, and the total number of BACs for which we collected sequence data from 584 to 842, 359 of which reached phase 2 (ordered and oriented contigs). We also sequenced a series of pooled BAC clones that cover additional euchromatic (gene rich) genomic regions. This work resulted in 6 refereed publications, see below. Genes whose sequence was determined during this study included multiple members of the plant disease resistance (R-gene) family as well as several genes involved in flavinoid biosynthesis, nitrogen fixation and plant-microbial symbosis. This work also served as a prelude to obtaining NSF funding for the international collaborative effort to complete the entire sequence of the Medicago truncatula genomic euchromatic regions using a BAC based approach.« less

  18. Replication Research in Pedagogical Approaches to Formulaic Sequences: Jones & Haywood (2004) and Alali & Schmitt (2012)

    ERIC Educational Resources Information Center

    Coxhead, Averil

    2018-01-01

    Research into the formulaic nature of language has grown in size and scale in the last 20 years or more, much of it based in corpus studies and involving the identification and categorisation of formulas. Research suggests that there are benefits for second and foreign language learners recognising formulaic sequences when listening and reading,…

  19. ASGARD: an open-access database of annotated transcriptomes for emerging model arthropod species.

    PubMed

    Zeng, Victor; Extavour, Cassandra G

    2012-01-01

    The increased throughput and decreased cost of next-generation sequencing (NGS) have shifted the bottleneck genomic research from sequencing to annotation, analysis and accessibility. This is particularly challenging for research communities working on organisms that lack the basic infrastructure of a sequenced genome, or an efficient way to utilize whatever sequence data may be available. Here we present a new database, the Assembled Searchable Giant Arthropod Read Database (ASGARD). This database is a repository and search engine for transcriptomic data from arthropods that are of high interest to multiple research communities but currently lack sequenced genomes. We demonstrate the functionality and utility of ASGARD using de novo assembled transcriptomes from the milkweed bug Oncopeltus fasciatus, the cricket Gryllus bimaculatus and the amphipod crustacean Parhyale hawaiensis. We have annotated these transcriptomes to assign putative orthology, coding region determination, protein domain identification and Gene Ontology (GO) term annotation to all possible assembly products. ASGARD allows users to search all assemblies by orthology annotation, GO term annotation or Basic Local Alignment Search Tool. User-friendly features of ASGARD include search term auto-completion suggestions based on database content, the ability to download assembly product sequences in FASTA format, direct links to NCBI data for predicted orthologs and graphical representation of the location of protein domains and matches to similar sequences from the NCBI non-redundant database. ASGARD will be a useful repository for transcriptome data from future NGS studies on these and other emerging model arthropods, regardless of sequencing platform, assembly or annotation status. This database thus provides easy, one-stop access to multi-species annotated transcriptome information. We anticipate that this database will be useful for members of multiple research communities, including developmental biology, physiology, evolutionary biology, ecology, comparative genomics and phylogenomics. Database URL: asgard.rc.fas.harvard.edu.

  20. Recombinative Generalization: An Exploratory Study in Musical Reading

    PubMed Central

    Perez, William Ferreira; de Rose, Julio C

    2010-01-01

    The present study aimed to extend the findings of recombinative generalization research in alphabetical reading and spelling to the context of musical reading. One participant was taught to respond discriminatively to six two-note sequences, choosing the corresponding notation on the staff in the presence of each sequence. When novel three- and four-note sequences were presented, she selected the corresponding notation. These results suggest the generality of previous research to the context of musical teaching. PMID:22477462

  1. A strategic stakeholder approach for addressing further analysis requests in whole genome sequencing research.

    PubMed

    Thornock, Bradley Steven O

    2016-01-01

    Whole genome sequencing (WGS) can be a cost-effective and efficient means of diagnosis for some children, but it also raises a number of ethical concerns. One such concern is how researchers derive and communicate results from WGS, including future requests for further analysis of stored sequences. The purpose of this paper is to think about what is at stake, and for whom, in any solution that is developed to deal with such requests. To accomplish this task, this paper will utilize stakeholder theory, a common method used in business ethics. Several scenarios that connect stakeholder concerns and WGS will also posited and analyzed. This paper concludes by developing criteria composed of a series of questions that researchers can answer in order to more effectively address requests for further analysis of stored sequences.

  2. Characterization and Amplification of Gene-Based Simple Sequence Repeat (SSR) Markers in Date Palm.

    PubMed

    Zhao, Yongli; Keremane, Manjunath; Prakash, Channapatna S; He, Guohao

    2017-01-01

    The paucity of molecular markers limits the application of genetic and genomic research in date palm (Phoenix dactylifera L.). Availability of expressed sequence tag (EST) sequences in date palm may provide a good resource for developing gene-based markers. This study characterizes a substantial fraction of transcriptome sequences containing simple sequence repeats (SSRs) from the EST sequences in date palm. The EST sequences studied are mainly homologous to those of Elaeis guineensis and Musa acuminata. A total of 911 gene-based SSR markers, characterized with functional annotations, have provided a useful basis not only for discovering candidate genes and understanding genetic basis of traits of interest but also for developing genetic and genomic tools for molecular research in date palm, such as diversity study, quantitative trait locus (QTL) mapping, and molecular breeding. The procedures of DNA extraction, polymerase chain reaction (PCR) amplification of these gene-based SSR markers, and gel electrophoresis of PCR products are described in this chapter.

  3. The application of the high throughput sequencing technology in the transposable elements.

    PubMed

    Liu, Zhen; Xu, Jian-hong

    2015-09-01

    High throughput sequencing technology has dramatically improved the efficiency of DNA sequencing, and decreased the costs to a great extent. Meanwhile, this technology usually has advantages of better specificity, higher sensitivity and accuracy. Therefore, it has been applied to the research on genetic variations, transcriptomics and epigenomics. Recently, this technology has been widely employed in the studies of transposable elements and has achieved fruitful results. In this review, we summarize the application of high throughput sequencing technology in the fields of transposable elements, including the estimation of transposon content, preference of target sites and distribution, insertion polymorphism and population frequency, identification of rare copies, transposon horizontal transfers as well as transposon tagging. We also briefly introduce the major common sequencing strategies and algorithms, their advantages and disadvantages, and the corresponding solutions. Finally, we envision the developing trends of high throughput sequencing technology, especially the third generation sequencing technology, and its application in transposon studies in the future, hopefully providing a comprehensive understanding and reference for related scientific researchers.

  4. Hemoglobin adducts as biomarkers of 1,3-butadiene in occupationally low exposed Italian workers and a few diesel-exposed miners.

    PubMed

    Begemann, P; Upton, P B; Ranasinghe, A; Swenberg, J A; Soleo, L; Vimercati, L; Gelormini, A; Fustinoni, S; Zwirner-Baier, I; Neumann, H G

    2001-06-01

    Hemoglobin adducts were determined as biomarkers of 1,3-butadiene (BD) in 30 workers and 10 controls from an Italian BD plant and in 14 diesel-exposed miners. N-(2,3,4-trihydroxybutyl)valine (THBVal), an N-terminal valine globin adduct of reactive butadiene metabolites, was analyzed by gas chromatography/high resolution mass spectrometry after a modified Edman degradation and further acetylation. The BD exposure for the plant workers was 31 microg/m(3) (personal sampling). Whereas there was no detectable difference in hemoglobin adduct levels (range 17.7-61.4 pmol/g globin) between the total group of exposed and controls, slight but significant differences could be found between two subgroups of workers from different production units as well as one subgroup and controls (P<0.05), between smoking (n=13) and non-smoking exposed workers (n=17; P=0.066) as well as between smoking exposed workers and controls (P=0.055). Adduct levels of the miners (all non-smokers) were in the same range as those of the Italian BD-workers and controls. The internal exposure and strain measured by THBVal levels resulting from a very low occupational BD exposure was in the range of the contribution of moderate smoking.

  5. DEPRESSION AND INTERNALLY DIRECTED AGGRESSION: GENETIC AND ENVIRONMENTAL CONTRIBUTIONS

    PubMed Central

    Haddad, Suzanne K.; Neiderhiser, Jenae M.; Spotts, Erica L.; Ganiban, Jody; Lichtenstein, Paul; Reiss, David

    2013-01-01

    This study uses behavior genetic (BG) methodology to investigate Freud’s theory of depression as aggression directed toward the self (1930) and the extent to which genetically and environmentally influenced aggressive tendencies contribute to depressive symptoms. Data from the Twin and Offspring Study in Sweden (TOSS) is used to demonstrate how, in estimating shared and unique environmental influences, BG methods can inform psychoanalytic theory and practice, particularly because of their shared emphasis on the importance of individual experience in development. The TOSS sample consists of 909 pairs of adult twins, their partners, and one adolescent child per couple. The Center for Epidemiologic Studies Depression Scale (Radloff 1977) was used to measure depressive symptoms and the Karolinska Scales of Personality (Schalling and Edman 1993) to measure internally directed aggression. Genetic analyses indicated that for both men and women, their unique experiences as well as genetic factors contributed equally to the association between internally directed aggression and depressive symptoms. These findings support Freud’s theory that constitutionally based differences in aggression, along with individual experiences, contribute to a person’s depressive symptoms. Establishing that an individual’s unique, not shared, experiences and perceptions contribute to depressive symptoms and internally directed aggression reinforces the use of patient-specific treatment approaches implemented in psychoanalytic psychotherapy or psychoanalysis. PMID:18515705

  6. Historical Perspective, Development and Applications of Next-Generation Sequencing in Plant Virology

    PubMed Central

    Barba, Marina; Czosnek, Henryk; Hadidi, Ahmed

    2014-01-01

    Next-generation high throughput sequencing technologies became available at the onset of the 21st century. They provide a highly efficient, rapid, and low cost DNA sequencing platform beyond the reach of the standard and traditional DNA sequencing technologies developed in the late 1970s. They are continually improved to become faster, more efficient and cheaper. They have been used in many fields of biology since 2004. In 2009, next-generation sequencing (NGS) technologies began to be applied to several areas of plant virology including virus/viroid genome sequencing, discovery and detection, ecology and epidemiology, replication and transcription. Identification and characterization of known and unknown viruses and/or viroids in infected plants are currently among the most successful applications of these technologies. It is expected that NGS will play very significant roles in many research and non-research areas of plant virology. PMID:24399207

  7. Gambling on a shortcut to genome sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Roberts, L.

    1991-06-21

    Almost from the start of the Human Genome Project, a debate has been raging over whether to sequence the entire human genome, all 3 billion bases, or just the genes - a mere 2% or 3% of the genome, and by far the most interesting part. In England, Sydney Brenner convinced the Medical Research Council (MRC) to start with the expressed genes, or complementary DNAs. But the US stance has been that the entire sequence is essential if we are to understand the blueprint of man. Craig Venter of the National Institute of Neurological Disorders and Stroke says that focusingmore » on the expressed genes may be even more useful than expected. His strategy involves randomly selecting clones from cDNA libraries which theoretically contain all the genes that are switched on at a particular time in a particular tissue. Then the researchers sequence just a short stretch of each clone, about 400 to 500 bases, to create can expressed sequence tag or EST. The sequences of these ESTs are then stored in a database. Using that information, other researchers can then recreate that EST by using polymerase chain reaction techniques.« less

  8. Single-Cell Sequencing for Precise Cancer Research: Progress and Prospects.

    PubMed

    Zhang, Xiaoyan; Marjani, Sadie L; Hu, Zhaoyang; Weissman, Sherman M; Pan, Xinghua; Wu, Shixiu

    2016-03-15

    Advances in genomic technology have enabled the faithful detection and measurement of mutations and the gene expression profile of cancer cells at the single-cell level. Recently, several single-cell sequencing methods have been developed that permit the comprehensive and precise analysis of the cancer-cell genome, transcriptome, and epigenome. The use of these methods to analyze cancer cells has led to a series of unanticipated discoveries, such as the high heterogeneity and stochastic changes in cancer-cell populations, the new driver mutations and the complicated clonal evolution mechanisms, and the novel identification of biomarkers of variant tumors. These methods and the knowledge gained from their utilization could potentially improve the early detection and monitoring of rare cancer cells, such as circulating tumor cells and disseminated tumor cells, and promote the development of personalized and highly precise cancer therapy. Here, we discuss the current methods for single cancer-cell sequencing, with a strong focus on those practically used or potentially valuable in cancer research, including single-cell isolation, whole genome and transcriptome amplification, epigenome profiling, multi-dimensional sequencing, and next-generation sequencing and analysis. We also examine the current applications, challenges, and prospects of single cancer-cell sequencing. ©2016 American Association for Cancer Research.

  9. Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment

    PubMed Central

    2011-01-01

    Background Many Bioinformatics studies begin with a multiple sequence alignment as the foundation for their research. This is because multiple sequence alignment can be a useful technique for studying molecular evolution and analyzing sequence structure relationships. Results In this paper, we have proposed a Vertical Decomposition with Genetic Algorithm (VDGA) for Multiple Sequence Alignment (MSA). In VDGA, we divide the sequences vertically into two or more subsequences, and then solve them individually using a guide tree approach. Finally, we combine all the subsequences to generate a new multiple sequence alignment. This technique is applied on the solutions of the initial generation and of each child generation within VDGA. We have used two mechanisms to generate an initial population in this research: the first mechanism is to generate guide trees with randomly selected sequences and the second is shuffling the sequences inside such trees. Two different genetic operators have been implemented with VDGA. To test the performance of our algorithm, we have compared it with existing well-known methods, namely PRRP, CLUSTALX, DIALIGN, HMMT, SB_PIMA, ML_PIMA, MULTALIGN, and PILEUP8, and also other methods, based on Genetic Algorithms (GA), such as SAGA, MSA-GA and RBT-GA, by solving a number of benchmark datasets from BAliBase 2.0. Conclusions The experimental results showed that the VDGA with three vertical divisions was the most successful variant for most of the test cases in comparison to other divisions considered with VDGA. The experimental results also confirmed that VDGA outperformed the other methods considered in this research. PMID:21867510

  10. Genome Science: A Video Tour of the Washington University Genome Sequencing Center for High School and Undergraduate Students

    PubMed Central

    2005-01-01

    Sequencing of the human genome has ushered in a new era of biology. The technologies developed to facilitate the sequencing of the human genome are now being applied to the sequencing of other genomes. In 2004, a partnership was formed between Washington University School of Medicine Genome Sequencing Center's Outreach Program and Washington University Department of Biology Science Outreach to create a video tour depicting the processes involved in large-scale sequencing. “Sequencing a Genome: Inside the Washington University Genome Sequencing Center” is a tour of the laboratory that follows the steps in the sequencing pipeline, interspersed with animated explanations of the scientific procedures used at the facility. Accompanying interviews with the staff illustrate different entry levels for a career in genome science. This video project serves as an example of how research and academic institutions can provide teachers and students with access and exposure to innovative technologies at the forefront of biomedical research. Initial feedback on the video from undergraduate students, high school teachers, and high school students provides suggestions for use of this video in a classroom setting to supplement present curricula. PMID:16341256

  11. De novo characterization of Lentinula edodes C(91-3) transcriptome by deep Solexa sequencing.

    PubMed

    Zhong, Mintao; Liu, Ben; Wang, Xiaoli; Liu, Lei; Lun, Yongzhi; Li, Xingyun; Ning, Anhong; Cao, Jing; Huang, Min

    2013-02-01

    Lentinula edodes, has been utilized as food, as well as, in popular medicine, moreover, its extract isolated from its mycelium and fruiting body have shown several therapeutic properties. Yet little is understood about its genes involved in these properties, and the absence of L.edodes genomes has been a barrier to the development of functional genomics research. However, high throughput sequencing technologies are now being widely applied to non-model species. To facilitate research on L.edodes, we leveraged Solexa sequencing technology in de novo assembly of L.edodes C(91-3) transcriptome. In a single run, we produced more than 57 million sequencing reads. These reads were assembled into 28,923 unigene sequences (mean size=689bp) including 18,120 unigenes with coding sequence (CDS). Based on similarity search with known proteins, assembled unigene sequences were annotated with gene descriptions, gene ontology (GO) and clusters of orthologous group (COG) terms. Our data provides the first comprehensive sequence resource available for functional genomics studies in L.edodes, and demonstrates the utility of Illumina/Solexa sequencing for de novo transcriptome characterization and gene discovery in a non-model mushroom. Copyright © 2012 Elsevier Inc. All rights reserved.

  12. Identification of optimum sequencing depth especially for de novo genome assembly of small genomes using next generation sequencing data.

    PubMed

    Desai, Aarti; Marwah, Veer Singh; Yadav, Akshay; Jha, Vineet; Dhaygude, Kishor; Bangar, Ujwala; Kulkarni, Vivek; Jere, Abhay

    2013-01-01

    Next Generation Sequencing (NGS) is a disruptive technology that has found widespread acceptance in the life sciences research community. The high throughput and low cost of sequencing has encouraged researchers to undertake ambitious genomic projects, especially in de novo genome sequencing. Currently, NGS systems generate sequence data as short reads and de novo genome assembly using these short reads is computationally very intensive. Due to lower cost of sequencing and higher throughput, NGS systems now provide the ability to sequence genomes at high depth. However, currently no report is available highlighting the impact of high sequence depth on genome assembly using real data sets and multiple assembly algorithms. Recently, some studies have evaluated the impact of sequence coverage, error rate and average read length on genome assembly using multiple assembly algorithms, however, these evaluations were performed using simulated datasets. One limitation of using simulated datasets is that variables such as error rates, read length and coverage which are known to impact genome assembly are carefully controlled. Hence, this study was undertaken to identify the minimum depth of sequencing required for de novo assembly for different sized genomes using graph based assembly algorithms and real datasets. Illumina reads for E.coli (4.6 MB) S.kudriavzevii (11.18 MB) and C.elegans (100 MB) were assembled using SOAPdenovo, Velvet, ABySS, Meraculous and IDBA-UD. Our analysis shows that 50X is the optimum read depth for assembling these genomes using all assemblers except Meraculous which requires 100X read depth. Moreover, our analysis shows that de novo assembly from 50X read data requires only 6-40 GB RAM depending on the genome size and assembly algorithm used. We believe that this information can be extremely valuable for researchers in designing experiments and multiplexing which will enable optimum utilization of sequencing as well as analysis resources.

  13. Comet: an open-source MS/MS sequence database search tool.

    PubMed

    Eng, Jimmy K; Jahan, Tahmina A; Hoopmann, Michael R

    2013-01-01

    Proteomics research routinely involves identifying peptides and proteins via MS/MS sequence database search. Thus the database search engine is an integral tool in many proteomics research groups. Here, we introduce the Comet search engine to the existing landscape of commercial and open-source database search tools. Comet is open source, freely available, and based on one of the original sequence database search tools that has been widely used for many years. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Special Focus

    PubMed Central

    Nawrocki, Eric P.; Burge, Sarah W.

    2013-01-01

    The development of RNA bioinformatic tools began more than 30 y ago with the description of the Nussinov and Zuker dynamic programming algorithms for single sequence RNA secondary structure prediction. Since then, many tools have been developed for various RNA sequence analysis problems such as homology search, multiple sequence alignment, de novo RNA discovery, read-mapping, and many more. In this issue, we have collected a sampling of reviews and original research that demonstrate some of the many ways bioinformatics is integrated with current RNA biology research. PMID:23948768

  15. Nanopore sequencing in microgravity

    PubMed Central

    McIntyre, Alexa B R; Rizzardi, Lindsay; Yu, Angela M; Alexander, Noah; Rosen, Gail L; Botkin, Douglas J; Stahl, Sarah E; John, Kristen K; Castro-Wallace, Sarah L; McGrath, Ken; Burton, Aaron S; Feinberg, Andrew P; Mason, Christopher E

    2016-01-01

    Rapid DNA sequencing and analysis has been a long-sought goal in remote research and point-of-care medicine. In microgravity, DNA sequencing can facilitate novel astrobiological research and close monitoring of crew health, but spaceflight places stringent restrictions on the mass and volume of instruments, crew operation time, and instrument functionality. The recent emergence of portable, nanopore-based tools with streamlined sample preparation protocols finally enables DNA sequencing on missions in microgravity. As a first step toward sequencing in space and aboard the International Space Station (ISS), we tested the Oxford Nanopore Technologies MinION during a parabolic flight to understand the effects of variable gravity on the instrument and data. In a successful proof-of-principle experiment, we found that the instrument generated DNA reads over the course of the flight, including the first ever sequenced in microgravity, and additional reads measured after the flight concluded its parabolas. Here we detail modifications to the sample-loading procedures to facilitate nanopore sequencing aboard the ISS and in other microgravity environments. We also evaluate existing analysis methods and outline two new approaches, the first based on a wave-fingerprint method and the second on entropy signal mapping. Computationally light analysis methods offer the potential for in situ species identification, but are limited by the error profiles (stays, skips, and mismatches) of older nanopore data. Higher accuracies attainable with modified sample processing methods and the latest version of flow cells will further enable the use of nanopore sequencers for diagnostics and research in space. PMID:28725742

  16. Making Strategic Decisions: Conducting and Using Research on the Impact of Sequenced Library Instruction

    ERIC Educational Resources Information Center

    Lundstrom, Kacy; Martin, Pamela; Cochran, Dory

    2016-01-01

    This study explores the relationship between course grades and sequenced library instruction interventions throughout psychology students' curriculum. Researchers conducted this study to inform decisions about sustaining and improving program integrations for first- and second-year composition courses and to improve discipline-level integrations.…

  17. Next-Generation Sequencing in the Mycology Lab.

    PubMed

    Zoll, Jan; Snelders, Eveline; Verweij, Paul E; Melchers, Willem J G

    New state-of-the-art techniques in sequencing offer valuable tools in both detection of mycobiota and in understanding of the molecular mechanisms of resistance against antifungal compounds and virulence. Introduction of new sequencing platform with enhanced capacity and a reduction in costs for sequence analysis provides a potential powerful tool in mycological diagnosis and research. In this review, we summarize the applications of next-generation sequencing techniques in mycology.

  18. Music and language perception: expectations, structural integration, and cognitive sequencing.

    PubMed

    Tillmann, Barbara

    2012-10-01

    Music can be described as sequences of events that are structured in pitch and time. Studying music processing provides insight into how complex event sequences are learned, perceived, and represented by the brain. Given the temporal nature of sound, expectations, structural integration, and cognitive sequencing are central in music perception (i.e., which sounds are most likely to come next and at what moment should they occur?). This paper focuses on similarities in music and language cognition research, showing that music cognition research provides insight into the understanding of not only music processing but also language processing and the processing of other structured stimuli. The hypothesis of shared resources between music and language processing and of domain-general dynamic attention has motivated the development of research to test music as a means to stimulate sensory, cognitive, and motor processes. Copyright © 2012 Cognitive Science Society, Inc.

  19. Refining the structure and content of clinical genomic reports.

    PubMed

    Dorschner, Michael O; Amendola, Laura M; Shirts, Brian H; Kiedrowski, Lesli; Salama, Joseph; Gordon, Adam S; Fullerton, Stephanie M; Tarczy-Hornoch, Peter; Byers, Peter H; Jarvik, Gail P

    2014-03-01

    To effectively articulate the results of exome and genome sequencing we refined the structure and content of molecular test reports. To communicate results of a randomized control trial aimed at the evaluation of exome sequencing for clinical medicine, we developed a structured narrative report. With feedback from genetics and non-genetics professionals, we developed separate indication-specific and incidental findings reports. Standard test report elements were supplemented with research study-specific language, which highlighted the limitations of exome sequencing and provided detailed, structured results, and interpretations. The report format we developed to communicate research results can easily be transformed for clinical use by removal of research-specific statements and disclaimers. The development of clinical reports for exome sequencing has shown that accurate and open communication between the clinician and laboratory is ideally an ongoing process to address the increasing complexity of molecular genetic testing. © 2014 Wiley Periodicals, Inc.

  20. History, applications, and challenges of immune repertoire research.

    PubMed

    Liu, Xiao; Wu, Jinghua

    2018-02-27

    The diversity of T and B cells in terms of their receptor sequences is huge in the vertebrate's immune system and provides broad protection against the vast diversity of pathogens. Immune repertoire is defined as the sum of T cell receptors and B cell receptors (also named immunoglobulin) that makes the organism's adaptive immune system. Before the emergence of high-throughput sequencing, the studies on immune repertoire were limited by the underdeveloped methodologies, since it was impossible to capture the whole picture by the low-throughput tools. The massive paralleled sequencing technology suits perfectly the researches on immune repertoire. In this article, we review the history of immune repertoire studies, in terms of technologies and research applications. Particularly, we discuss several aspects of challenges in this field and highlight the efforts to develop potential solutions, in the era of high-throughput sequencing of the immune repertoire.

  1. Refining the Structure and Content of Clinical Genomic Reports

    PubMed Central

    DORSCHNER, MICHAEL O.; AMENDOLA, LAURA M.; SHIRTS, BRIAN H.; KIEDROWSKI, LESLI; SALAMA, JOSEPH; GORDON, ADAM S.; FULLERTON, STEPHANIE M.; TARCZY-HORNOCH, PETER; BYERS, PETER H.; JARVIK, GAIL P.

    2014-01-01

    To effectively articulate the results of exome and genome sequencing we refined the structure and content of molecular test reports. To communicate results of a randomized control trial aimed at the evaluation of exome sequencing for clinical medicine, we developed a structured narrative report. With feedback from genetics and non-genetics professionals, we developed separate indication-specific and incidental findings reports. Standard test report elements were supplemented with research study-specific language, which highlighted the limitations of exome sequencing and provided detailed, structured results, and interpretations. The report format we developed to communicate research results can easily be transformed for clinical use by removal of research-specific statements and disclaimers. The development of clinical reports for exome sequencing has shown that accurate and open communication between the clinician and laboratory is ideally an ongoing process to address the increasing complexity of molecular genetic testing. PMID:24616401

  2. Evaluating and Redesigning Teaching Learning Sequences at the Introductory Physics Level

    ERIC Educational Resources Information Center

    Guisasola, Jenaro; Zuza, Kristina; Ametller, Jaume; Gutierrez-Berraondo, José

    2017-01-01

    In this paper we put forward a proposal for the design and evaluation of teaching and learning sequences in upper secondary school and university. We will connect our proposal with relevant contributions on the design of teaching sequences, ground it on the design-based research methodology, and discuss how teaching and learning sequences designed…

  3. The Sequence of Study Changes What Information Is Attended to, Encoded, and Remembered during Category Learning

    ERIC Educational Resources Information Center

    Carvalho, Paulo F.; Goldstone, Robert L.

    2017-01-01

    The sequence of study influences how we learn. Previous research has identified different sequences as potentially beneficial for learning in different contexts and with different materials. Here we investigate the mechanisms involved in inductive category learning that give rise to these sequencing effects. Across 3 experiments we show evidence…

  4. A field ornithologist’s guide to genomics: Practical considerations for ecology and conservation

    USGS Publications Warehouse

    Oyler-McCance, Sara J.; Oh, Kevin; Langin, Kathryn; Aldridge, Cameron L.

    2016-01-01

    Vast improvements in sequencing technology have made it practical to simultaneously sequence millions of nucleotides distributed across the genome, opening the door for genomic studies in virtually any species. Ornithological research stands to benefit in three substantial ways. First, genomic methods enhance our ability to parse and simultaneously analyze both neutral and non-neutral genomic regions, thus providing insight into adaptive evolution and divergence. Second, the sheer quantity of sequence data generated by current sequencing platforms allows increased precision and resolution in analyses. Third, high-throughput sequencing can benefit applications that focus on a small number of loci that are otherwise prohibitively expensive, time-consuming, and technically difficult using traditional sequencing methods. These advances have improved our ability to understand evolutionary processes like speciation and local adaptation, but they also offer many practical applications in the fields of population ecology, migration tracking, conservation planning, diet analyses, and disease ecology. This review provides a guide for field ornithologists interested in incorporating genomic approaches into their research program, with an emphasis on techniques related to ecology and conservation. We present a general overview of contemporary genomic approaches and methods, as well as important considerations when selecting a genomic technique. We also discuss research questions that are likely to benefit from utilizing high-throughput sequencing instruments, highlighting select examples from recent avian studies.

  5. Gel-seq: A Method for Simultaneous Sequencing Library Preparation of DNA and RNA Using Hydrogel Matrices.

    PubMed

    Hoople, Gordon D; Richards, Andrew; Wu, Yan; Pisano, Albert P; Zhang, Kun

    2018-03-26

    The ability to amplify and sequence either DNA or RNA from small starting samples has only been achieved in the last five years. Unfortunately, the standard protocols for generating genomic or transcriptomic libraries are incompatible and researchers must choose whether to sequence DNA or RNA for a particular sample. Gel-seq solves this problem by enabling researchers to simultaneously prepare libraries for both DNA and RNA starting with 100 - 1000 cells using a simple hydrogel device. This paper presents a detailed approach for the fabrication of the device as well as the biological protocol to generate paired libraries. We designed Gel-seq so that it could be easily implemented by other researchers; many genetics labs already have the necessary equipment to reproduce the Gel-seq device fabrication. Our protocol employs commonly-used kits for both whole-transcript amplification (WTA) and library preparation, which are also likely to be familiar to researchers already versed in generating genomic and transcriptomic libraries. Our approach allows researchers to bring to bear the power of both DNA and RNA sequencing on a single sample without splitting and with negligible added cost.

  6. Approaching the taxonomic affiliation of unidentified sequences in public databases--an example from the mycorrhizal fungi.

    PubMed

    Nilsson, R Henrik; Kristiansson, Erik; Ryberg, Martin; Larsson, Karl-Henrik

    2005-07-18

    During the last few years, DNA sequence analysis has become one of the primary means of taxonomic identification of species, particularly so for species that are minute or otherwise lack distinct, readily obtainable morphological characters. Although the number of sequences available for comparison in public databases such as GenBank increases exponentially, only a minuscule fraction of all organisms have been sequenced, leaving taxon sampling a momentous problem for sequence-based taxonomic identification. When querying GenBank with a set of unidentified sequences, a considerable proportion typically lack fully identified matches, forming an ever-mounting pile of sequences that the researcher will have to monitor manually in the hope that new, clarifying sequences have been submitted by other researchers. To alleviate these concerns, a project to automatically monitor select unidentified sequences in GenBank for taxonomic progress through repeated local BLAST searches was initiated. Mycorrhizal fungi--a field where species identification often is prohibitively complex--and the much used ITS locus were chosen as test bed. A Perl script package called emerencia is presented. On a regular basis, it downloads select sequences from GenBank, separates the identified sequences from those insufficiently identified, and performs BLAST searches between these two datasets, storing all results in an SQL database. On the accompanying web-service http://emerencia.math.chalmers.se, users can monitor the taxonomic progress of insufficiently identified sequences over time, either through active searches or by signing up for e-mail notification upon disclosure of better matches. Other search categories, such as listing all insufficiently identified sequences (and their present best fully identified matches) publication-wise, are also available. The ever-increasing use of DNA sequences for identification purposes largely falls back on the assumption that public sequence databases contain a thorough sampling of taxonomically well-annotated sequences. Taxonomy, held by some to be an old-fashioned trade, has accordingly never been more important. emerencia does not automate the taxonomic process, but it does allow researchers to focus their efforts elsewhere than countless manual BLAST runs and arduous sieving of BLAST hit lists. The emerencia system is available on an open source basis for local installation with any organism and gene group as targets.

  7. Cloud computing for genomic data analysis and collaboration.

    PubMed

    Langmead, Ben; Nellore, Abhinav

    2018-04-01

    Next-generation sequencing has made major strides in the past decade. Studies based on large sequencing data sets are growing in number, and public archives for raw sequencing data have been doubling in size every 18 months. Leveraging these data requires researchers to use large-scale computational resources. Cloud computing, a model whereby users rent computers and storage from large data centres, is a solution that is gaining traction in genomics research. Here, we describe how cloud computing is used in genomics for research and large-scale collaborations, and argue that its elasticity, reproducibility and privacy features make it ideally suited for the large-scale reanalysis of publicly available archived data, including privacy-protected data.

  8. Ray Wu as Fifth Business: Deconstructing collective memory in the history of DNA sequencing.

    PubMed

    Onaga, Lisa A

    2014-06-01

    The concept of 'Fifth Business' is used to analyze a minority standpoint and bring serious attention to the role of scientists who play a galvanizing role in a science but for multiple reasons appear less prominently in more common recounts of any particular development. Biochemist Ray Wu (1928-2008) published a DNA sequencing experiment in March 1970 using DNA polymerase catalysis and specific nucleotide labeling, both of which are foundational to general sequencing methods today. The scant mention of Wu's work from textbooks, research articles, and other accounts of DNA sequencing calls into question how scientific collective memory forms. This alternative history seeks to understand why a key figure in nucleic acid sequence analysis has remained less visibly connected or peripheral to solidifying narratives about the history of DNA sequencing. The study resists predictable dismissals of Wu's work in order to seriously examine the formation of his nucleic acid sequence analysis research program and how he shared his knowledge of sequencing during a period of rapid advancement in the field. An analysis of Wu's work on sequencing the cohesive ends of lambda bacteriophage in the 1960s and 1970s exemplifies how a variety of individuals and groups attempted to develop protocol for sequencing the order of nucleotide base pairs comprising DNA. This historical examination of the sociality of scientific research suggests a way to understand how Wu and others contributed to the very collective memory of DNA sequencing that Wu eventually tried to repair. The study of Wu, who was a Chinese immigrant to the United States, provides a foundation for further critical scholarship on the heterogeneous histories of Asian American bioscientists, the sociality of their scientific works, and how the resulting knowledge produced is preserved, if not evenly, in a scientific field's collective memory. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Mind the gap; seven reasons to close fragmented genome assemblies

    USDA-ARS?s Scientific Manuscript database

    Like other domains of life, research into the biology of filamentous microbes has greatly benefited from the advent of whole-genome sequencing. Next-generation sequencing (NGS) technologies have revolutionized sequencing, making genomic sciences accessible to many academic laboratories including tho...

  10. SeqHound: biological sequence and structure database as a platform for bioinformatics research

    PubMed Central

    2002-01-01

    Background SeqHound has been developed as an integrated biological sequence, taxonomy, annotation and 3-D structure database system. It provides a high-performance server platform for bioinformatics research in a locally-hosted environment. Results SeqHound is based on the National Center for Biotechnology Information data model and programming tools. It offers daily updated contents of all Entrez sequence databases in addition to 3-D structural data and information about sequence redundancies, sequence neighbours, taxonomy, complete genomes, functional annotation including Gene Ontology terms and literature links to PubMed. SeqHound is accessible via a web server through a Perl, C or C++ remote API or an optimized local API. It provides functionality necessary to retrieve specialized subsets of sequences, structures and structural domains. Sequences may be retrieved in FASTA, GenBank, ASN.1 and XML formats. Structures are available in ASN.1, XML and PDB formats. Emphasis has been placed on complete genomes, taxonomy, domain and functional annotation as well as 3-D structural functionality in the API, while fielded text indexing functionality remains under development. SeqHound also offers a streamlined WWW interface for simple web-user queries. Conclusions The system has proven useful in several published bioinformatics projects such as the BIND database and offers a cost-effective infrastructure for research. SeqHound will continue to develop and be provided as a service of the Blueprint Initiative at the Samuel Lunenfeld Research Institute. The source code and examples are available under the terms of the GNU public license at the Sourceforge site http://sourceforge.net/projects/slritools/ in the SLRI Toolkit. PMID:12401134

  11. Acoustic sequences in non-human animals: a tutorial review and prospectus.

    PubMed

    Kershenbaum, Arik; Blumstein, Daniel T; Roch, Marie A; Akçay, Çağlar; Backus, Gregory; Bee, Mark A; Bohn, Kirsten; Cao, Yan; Carter, Gerald; Cäsar, Cristiane; Coen, Michael; DeRuiter, Stacy L; Doyle, Laurance; Edelman, Shimon; Ferrer-i-Cancho, Ramon; Freeberg, Todd M; Garland, Ellen C; Gustison, Morgan; Harley, Heidi E; Huetz, Chloé; Hughes, Melissa; Hyland Bruno, Julia; Ilany, Amiyaal; Jin, Dezhe Z; Johnson, Michael; Ju, Chenghui; Karnowski, Jeremy; Lohr, Bernard; Manser, Marta B; McCowan, Brenda; Mercado, Eduardo; Narins, Peter M; Piel, Alex; Rice, Megan; Salmi, Roberta; Sasahara, Kazutoshi; Sayigh, Laela; Shiu, Yu; Taylor, Charles; Vallejo, Edgar E; Waller, Sara; Zamora-Gutierrez, Veronica

    2016-02-01

    Animal acoustic communication often takes the form of complex sequences, made up of multiple distinct acoustic units. Apart from the well-known example of birdsong, other animals such as insects, amphibians, and mammals (including bats, rodents, primates, and cetaceans) also generate complex acoustic sequences. Occasionally, such as with birdsong, the adaptive role of these sequences seems clear (e.g. mate attraction and territorial defence). More often however, researchers have only begun to characterise - let alone understand - the significance and meaning of acoustic sequences. Hypotheses abound, but there is little agreement as to how sequences should be defined and analysed. Our review aims to outline suitable methods for testing these hypotheses, and to describe the major limitations to our current and near-future knowledge on questions of acoustic sequences. This review and prospectus is the result of a collaborative effort between 43 scientists from the fields of animal behaviour, ecology and evolution, signal processing, machine learning, quantitative linguistics, and information theory, who gathered for a 2013 workshop entitled, 'Analysing vocal sequences in animals'. Our goal is to present not just a review of the state of the art, but to propose a methodological framework that summarises what we suggest are the best practices for research in this field, across taxa and across disciplines. We also provide a tutorial-style introduction to some of the most promising algorithmic approaches for analysing sequences. We divide our review into three sections: identifying the distinct units of an acoustic sequence, describing the different ways that information can be contained within a sequence, and analysing the structure of that sequence. Each of these sections is further subdivided to address the key questions and approaches in that area. We propose a uniform, systematic, and comprehensive approach to studying sequences, with the goal of clarifying research terms used in different fields, and facilitating collaboration and comparative studies. Allowing greater interdisciplinary collaboration will facilitate the investigation of many important questions in the evolution of communication and sociality. © 2014 Cambridge Philosophical Society.

  12. Acoustic sequences in non-human animals: a tutorial review and prospectus

    PubMed Central

    Kershenbaum, Arik; Blumstein, Daniel T.; Roch, Marie A.; Akçay, Çağlar; Backus, Gregory; Bee, Mark A.; Bohn, Kirsten; Cao, Yan; Carter, Gerald; Cäsar, Cristiane; Coen, Michael; DeRuiter, Stacy L.; Doyle, Laurance; Edelman, Shimon; Ferrer-i-Cancho, Ramon; Freeberg, Todd M.; Garland, Ellen C.; Gustison, Morgan; Harley, Heidi E.; Huetz, Chloé; Hughes, Melissa; Bruno, Julia Hyland; Ilany, Amiyaal; Jin, Dezhe Z.; Johnson, Michael; Ju, Chenghui; Karnowski, Jeremy; Lohr, Bernard; Manser, Marta B.; McCowan, Brenda; Mercado, Eduardo; Narins, Peter M.; Piel, Alex; Rice, Megan; Salmi, Roberta; Sasahara, Kazutoshi; Sayigh, Laela; Shiu, Yu; Taylor, Charles; Vallejo, Edgar E.; Waller, Sara; Zamora-Gutierrez, Veronica

    2015-01-01

    Animal acoustic communication often takes the form of complex sequences, made up of multiple distinct acoustic units. Apart from the well-known example of birdsong, other animals such as insects, amphibians, and mammals (including bats, rodents, primates, and cetaceans) also generate complex acoustic sequences. Occasionally, such as with birdsong, the adaptive role of these sequences seems clear (e.g. mate attraction and territorial defence). More often however, researchers have only begun to characterise – let alone understand – the significance and meaning of acoustic sequences. Hypotheses abound, but there is little agreement as to how sequences should be defined and analysed. Our review aims to outline suitable methods for testing these hypotheses, and to describe the major limitations to our current and near-future knowledge on questions of acoustic sequences. This review and prospectus is the result of a collaborative effort between 43 scientists from the fields of animal behaviour, ecology and evolution, signal processing, machine learning, quantitative linguistics, and information theory, who gathered for a 2013 workshop entitled, “Analysing vocal sequences in animals”. Our goal is to present not just a review of the state of the art, but to propose a methodological framework that summarises what we suggest are the best practices for research in this field, across taxa and across disciplines. We also provide a tutorial-style introduction to some of the most promising algorithmic approaches for analysing sequences. We divide our review into three sections: identifying the distinct units of an acoustic sequence, describing the different ways that information can be contained within a sequence, and analysing the structure of that sequence. Each of these sections is further subdivided to address the key questions and approaches in that area. We propose a uniform, systematic, and comprehensive approach to studying sequences, with the goal of clarifying research terms used in different fields, and facilitating collaboration and comparative studies. Allowing greater interdisciplinary collaboration will facilitate the investigation of many important questions in the evolution of communication and sociality. PMID:25428267

  13. The Bivariate (Complex) Fibonacci and Lucas Polynomials: An Historical Investigation with the Maple's Help

    ERIC Educational Resources Information Center

    Alves, Francisco Regis Vieira; Catarino, Paula Maria Machado Cruz

    2016-01-01

    The current research around the Fibonacci's and Lucas' sequence evidences the scientific vigor of both mathematical models that continue to inspire and provide numerous specializations and generalizations, especially from the sixties. One of the current of research and investigations around the Generalized Sequence of Lucas, involves it's…

  14. Technology-Enhanced Research in the Science Classroom.

    ERIC Educational Resources Information Center

    Francis, Joseph W.

    1997-01-01

    Describes a project where students use the Internet as a research tool. Discusses using e-mail to access molecular biology databases and identify proteins using amino acid sequences, obtaining complete amino acid sequences using the world wide web, using telnet to access library resources on the Internet, and various stages of protein analysis…

  15. Real-time DNA barcoding in a rainforest using nanopore sequencing: opportunities for rapid biodiversity assessments and local capacity building.

    PubMed

    Pomerantz, Aaron; Peñafiel, Nicolás; Arteaga, Alejandro; Bustamante, Lucas; Pichardo, Frank; Coloma, Luis A; Barrio-Amorós, César L; Salazar-Valenzuela, David; Prost, Stefan

    2018-04-01

    Advancements in portable scientific instruments provide promising avenues to expedite field work in order to understand the diverse array of organisms that inhabit our planet. Here, we tested the feasibility for in situ molecular analyses of endemic fauna using a portable laboratory fitting within a single backpack in one of the world's most imperiled biodiversity hotspots, the Ecuadorian Chocó rainforest. We used portable equipment, including the MinION nanopore sequencer (Oxford Nanopore Technologies) and the miniPCR (miniPCR), to perform DNA extraction, polymerase chain reaction amplification, and real-time DNA barcoding of reptile specimens in the field. We demonstrate that nanopore sequencing can be implemented in a remote tropical forest to quickly and accurately identify species using DNA barcoding, as we generated consensus sequences for species resolution with an accuracy of >99% in less than 24 hours after collecting specimens. The flexibility of our mobile laboratory further allowed us to generate sequence information at the Universidad Tecnológica Indoamérica in Quito for rare, endangered, and undescribed species. This includes the recently rediscovered Jambato toad, which was thought to be extinct for 28 years. Sequences generated on the MinION required as few as 30 reads to achieve high accuracy relative to Sanger sequencing, and with further multiplexing of samples, nanopore sequencing can become a cost-effective approach for rapid and portable DNA barcoding. Overall, we establish how mobile laboratories and nanopore sequencing can help to accelerate species identification in remote areas to aid in conservation efforts and be applied to research facilities in developing countries. This opens up possibilities for biodiversity studies by promoting local research capacity building, teaching nonspecialists and students about the environment, tackling wildlife crime, and promoting conservation via research-focused ecotourism.

  16. Motor programming when sequencing multiple elements of the same duration.

    PubMed

    Magnuson, Curt E; Robin, Donald A; Wright, David L

    2008-11-01

    Motor programming at the self-select paradigm was adopted in 2 experiments to examine the processing demands of independent processes. One process (INT) is responsible for organizing the internal features of the individual elements in a movement (e.g., response duration). The 2nd process (SEQ) is responsible for placing the elements into the proper serial order before execution. Participants in Experiment 1 performed tasks involving 1 key press or sequences of 4 key presses of the same duration. Implementing INT and SEQ was more time consuming for key-pressing sequences than for single key-press tasks. Experiment 2 examined whether the INT costs resulting from the increase in sequence length observed in Experiment 1 resulted from independent planning of each sequence element or via a separate "multiplier" process that handled repetitions of elements of the same duration. Findings from Experiment 2, in which participants performed single key presses or double or triple key sequences of the same duration, suggested that INT is involved with the independent organization of each element contained in the sequence. Researchers offer an elaboration of the 2-process account of motor programming to incorporate the present findings and the findings from other recent sequence-learning research.

  17. Next-generation sequencing for endocrine cancers: Recent advances and challenges.

    PubMed

    Suresh, Padmanaban S; Venkatesh, Thejaswini; Tsutsumi, Rie; Shetty, Abhishek

    2017-05-01

    Contemporary molecular biology research tools have enriched numerous areas of biomedical research that address challenging diseases, including endocrine cancers (pituitary, thyroid, parathyroid, adrenal, testicular, ovarian, and neuroendocrine cancers). These tools have placed several intriguing clues before the scientific community. Endocrine cancers pose a major challenge in health care and research despite considerable attempts by researchers to understand their etiology. Microarray analyses have provided gene signatures from many cells, tissues, and organs that can differentiate healthy states from diseased ones, and even show patterns that correlate with stages of a disease. Microarray data can also elucidate the responses of endocrine tumors to therapeutic treatments. The rapid progress in next-generation sequencing methods has overcome many of the initial challenges of these technologies, and their advantages over microarray techniques have enabled them to emerge as valuable aids for clinical research applications (prognosis, identification of drug targets, etc.). A comprehensive review describing the recent advances in next-generation sequencing methods and their application in the evaluation of endocrine and endocrine-related cancers is lacking. The main purpose of this review is to illustrate the concepts that collectively constitute our current view of the possibilities offered by next-generation sequencing technological platforms, challenges to relevant applications, and perspectives on the future of clinical genetic testing of patients with endocrine tumors. We focus on recent discoveries in the use of next-generation sequencing methods for clinical diagnosis of endocrine tumors in patients and conclude with a discussion on persisting challenges and future objectives.

  18. Sequence skill learning in persons who stutter: implications for cortico-striato-thalamo-cortical dysfunction.

    PubMed

    Smits-Bandstra, Sarah; De Nil, Luc F

    2007-01-01

    The basal ganglia and cortico-striato-thalamo-cortical connections are known to play a critical role in sequence skill learning and increasing automaticity over practice. The current paper reviews four studies comparing the sequence skill learning and the transition to automaticity of persons who stutter (PWS) and fluent speakers (PNS) over practice. Studies One and Two found PWS to have poor finger tap sequencing skill and nonsense syllable sequencing skill after practice, and on retention and transfer tests relative to PNS. Studies Three and Four found PWS to be significantly less accurate and/or significantly slower after practice on dual tasks requiring concurrent sequencing and colour recognition over practice relative to PNS. Evidence of PWS' deficits in sequence skill learning and automaticity development support the hypothesis that dysfunction in cortico-striato-thalamo-cortical connections may be one etiological component in the development and maintenance of stuttering. As a result of this activity, the reader will: (1) be able to articulate the research regarding the basal ganglia system relating to sequence skill learning; (2) be able to summarize the research on stuttering with indications of sequence skill learning deficits; and (3) be able to discuss basal ganglia mechanisms with relevance for theory of stuttering.

  19. [Development of laboratory sequence analysis software based on WWW and UNIX].

    PubMed

    Huang, Y; Gu, J R

    2001-01-01

    Sequence analysis tools based on WWW and UNIX were developed in our laboratory to meet the needs of molecular genetics research in our laboratory. General principles of computer analysis of DNA and protein sequences were also briefly discussed in this paper.

  20. Control of automated behavior: insights from the discrete sequence production task

    PubMed Central

    Abrahamse, Elger L.; Ruitenberg, Marit F. L.; de Kleine, Elian; Verwey, Willem B.

    2013-01-01

    Work with the discrete sequence production (DSP) task has provided a substantial literature on discrete sequencing skill over the last decades. The purpose of the current article is to provide a comprehensive overview of this literature and of the theoretical progress that it has prompted. We start with a description of the DSP task and the phenomena that are typically observed with it. Then we propose a cognitive model, the dual processor model (DPM), which explains performance of (skilled) discrete key-press sequences. Key features of this model are the distinction between a cognitive processor and a motor system (i.e., motor buffer and motor processor), the interplay between these two processing systems, and the possibility to execute familiar sequences in two different execution modes. We further discuss how this model relates to several related sequence skill research paradigms and models, and we outline outstanding questions for future research throughout the paper. We conclude by sketching a tentative neural implementation of the DPM. PMID:23515430

  1. Research Techniques Made Simple: Single-Cell RNA Sequencing and its Applications in Dermatology.

    PubMed

    Wu, Xiaojun; Yang, Bin; Udo-Inyang, Imo; Ji, Suyun; Ozog, David; Zhou, Li; Mi, Qing-Sheng

    2018-05-01

    RNA sequencing is one of the most highly reliable and reproducible methods of assessing the cell transcriptome. As high-throughput RNA sequencing libraries at the single cell level have recently developed, single cell RNA sequencing has become more feasible and popular in biology research. Single cell RNA sequencing allows investigators to evaluate cell transcriptional profiles at the single cell level. It has become a very useful tool to perform investigations that could not be addressed by other methodologies, such as the assessment of cell-to-cell variation, the identification of rare populations, and the determination of heterogeneity within a cell population. So far, the single cell RNA sequencing technique has been widely applied to embryonic development, immune cell development, and human disease progress and treatment. Here, we describe the history of single cell technology development and its potential application in the field of dermatology. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  2. Disclosure of Incidental Findings From Next-Generation Sequencing in Pediatric Genomic Research

    PubMed Central

    Abdul-Karim, Ruqayyah; Berkman, Benjamin E.; Wendler, David; Rid, Annette; Khan, Javed; Badgett, Tom

    2013-01-01

    Next-generation sequencing technologies will likely be used with increasing frequency in pediatric research. One consequence will be the increased identification of individual genomic research findings that are incidental to the aims of the research. Although researchers and ethicists have raised theoretical concerns about incidental findings in the context of genetic research, next-generation sequencing will make this once largely hypothetical concern an increasing reality. Most commentators have begun to accept the notion that there is some duty to disclose individual genetic research results to research subjects; however, the scope of that duty remains unclear. These issues are especially complicated in the pediatric setting, where subjects cannot currently but typically will eventually be able to make their own medical decisions at the age of adulthood. This article discusses the management of incidental findings in the context of pediatric genomic research. We provide an overview of the current literature and propose a framework to manage incidental findings in this unique context, based on what we believe is a limited responsibility to disclose. We hope this will be a useful source of guidance for investigators, institutional review boards, and bioethicists that anticipates the complicated ethical issues raised by advances in genomic technology. PMID:23400601

  3. Sma3s: a three-step modular annotator for large sequence datasets.

    PubMed

    Muñoz-Mérida, Antonio; Viguera, Enrique; Claros, M Gonzalo; Trelles, Oswaldo; Pérez-Pulido, Antonio J

    2014-08-01

    Automatic sequence annotation is an essential component of modern 'omics' studies, which aim to extract information from large collections of sequence data. Most existing tools use sequence homology to establish evolutionary relationships and assign putative functions to sequences. However, it can be difficult to define a similarity threshold that achieves sufficient coverage without sacrificing annotation quality. Defining the correct configuration is critical and can be challenging for non-specialist users. Thus, the development of robust automatic annotation techniques that generate high-quality annotations without needing expert knowledge would be very valuable for the research community. We present Sma3s, a tool for automatically annotating very large collections of biological sequences from any kind of gene library or genome. Sma3s is composed of three modules that progressively annotate query sequences using either: (i) very similar homologues, (ii) orthologous sequences or (iii) terms enriched in groups of homologous sequences. We trained the system using several random sets of known sequences, demonstrating average sensitivity and specificity values of ~85%. In conclusion, Sma3s is a versatile tool for high-throughput annotation of a wide variety of sequence datasets that outperforms the accuracy of other well-established annotation algorithms, and it can enrich existing database annotations and uncover previously hidden features. Importantly, Sma3s has already been used in the functional annotation of two published transcriptomes. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  4. TriageTools: tools for partitioning and prioritizing analysis of high-throughput sequencing data.

    PubMed

    Fimereli, Danai; Detours, Vincent; Konopka, Tomasz

    2013-04-01

    High-throughput sequencing is becoming a popular research tool but carries with it considerable costs in terms of computation time, data storage and bandwidth. Meanwhile, some research applications focusing on individual genes or pathways do not necessitate processing of a full sequencing dataset. Thus, it is desirable to partition a large dataset into smaller, manageable, but relevant pieces. We present a toolkit for partitioning raw sequencing data that includes a method for extracting reads that are likely to map onto pre-defined regions of interest. We show the method can be used to extract information about genes of interest from DNA or RNA sequencing samples in a fraction of the time and disk space required to process and store a full dataset. We report speedup factors between 2.6 and 96, depending on settings and samples used. The software is available at http://www.sourceforge.net/projects/triagetools/.

  5. Formative Research on the Simplifying Conditions Method (SCM) for Task Analysis and Sequencing.

    ERIC Educational Resources Information Center

    Kim, YoungHwan; Reigluth, Charles M.

    The Simplifying Conditions Method (SCM) is a set of guidelines for task analysis and sequencing of instructional content under the Elaboration Theory (ET). This article introduces the fundamentals of SCM and presents the findings from a formative research study on SCM. It was conducted in two distinct phases: design and instruction. In the first…

  6. Sequence walkers: a graphical method to display how binding proteins interact with DNA or RNA sequences | Center for Cancer Research

    Cancer.gov

    A graphical method is presented for displaying how binding proteins and other macromolecules interact with individual bases of nucleotide sequences. Characters representing the sequence are either oriented normally and placed above a line indicating favorable contact, or upside-down and placed below the line indicating unfavorable contact. The positive or negative height of

  7. An Observational Study of Children's Involvement in Informed Consent for Exome Sequencing Research.

    PubMed

    Miller, Victoria A; Werner-Lin, Allison; Walser, Sarah A; Biswas, Sawona; Bernhardt, Barbara A

    2017-02-01

    The goal of this study was to examine children's involvement in consent sessions for exome sequencing research and associations of involvement with provider and parent communication. Participants included 44 children (8-17 years) from five cohorts who were offered participation in an exome sequencing study. The consent sessions were audiotaped, transcribed, and coded. Providers attempted to facilitate the child's involvement in the majority (73%) of sessions, and most (75%) children also verbally participated. Provider facilitation was strongly associated with likelihood of child participation. These findings underscore that strategies such as asking for children's opinions and soliciting their questions show respect for children and may increase the likelihood that they are engaged and involved in decisions about research participation.

  8. Scientist, Single Cell Analysis Facility | Center for Cancer Research

    Cancer.gov

    The Cancer Research Technology Program (CRTP) develops and implements emerging technology, cancer biology expertise and research capabilities to accomplish NCI research objectives.  The CRTP is an outward-facing, multi-disciplinary hub purposed to enable the external cancer research community and provides dedicated support to NCI’s intramural Center for Cancer Research (CCR).  The dedicated units provide electron microscopy, protein characterization, protein expression, optical microscopy and nextGen sequencing. These research efforts are an integral part of CCR at the Frederick National Laboratory for Cancer Research (FNLCR).  CRTP scientists also work collaboratively with intramural NCI investigators to provide research technologies and expertise. KEY ROLES AND RESPONSIBILITIES We are seeking a highly motivated Scientist II to join the newly established Single Cell Analysis Facility (SCAF) of the Center for Cancer Research (CCR) at NCI. The SCAF will house state of the art single cell sequencing technologies including 10xGenomics Chromium, BD Genomics Rhapsody, DEPPArray, and other emerging single cell technologies. The Scientist: Will interact with close to 200 laboratories within the CCR to design and carry out single cell experiments for cancer research Will work on single cell isolation/preparation from various tissues and cells and related NexGen sequencing library preparation Is expected to author publications in peer reviewed scientific journals

  9. VAMPS: a website for visualization and analysis of microbial population structures.

    PubMed

    Huse, Susan M; Mark Welch, David B; Voorhis, Andy; Shipunova, Anna; Morrison, Hilary G; Eren, A Murat; Sogin, Mitchell L

    2014-02-05

    The advent of next-generation DNA sequencing platforms has revolutionized molecular microbial ecology by making the detailed analysis of complex communities over time and space a tractable research pursuit for small research groups. However, the ability to generate 10⁵-10⁸ reads with relative ease brings with it many downstream complications. Beyond the computational resources and skills needed to process and analyze data, it is difficult to compare datasets in an intuitive and interactive manner that leads to hypothesis generation and testing. We developed the free web service VAMPS (Visualization and Analysis of Microbial Population Structures, http://vamps.mbl.edu) to address these challenges and to facilitate research by individuals or collaborating groups working on projects with large-scale sequencing data. Users can upload marker gene sequences and associated metadata; reads are quality filtered and assigned to both taxonomic structures and to taxonomy-independent clusters. A simple point-and-click interface allows users to select for analysis any combination of their own or their collaborators' private data and data from public projects, filter these by their choice of taxonomic and/or abundance criteria, and then explore these data using a wide range of analytic methods and visualizations. Each result is extensively hyperlinked to other analysis and visualization options, promoting data exploration and leading to a greater understanding of data relationships. VAMPS allows researchers using marker gene sequence data to analyze the diversity of microbial communities and the relationships between communities, to explore these analyses in an intuitive visual context, and to download data, results, and images for publication. VAMPS obviates the need for individual research groups to make the considerable investment in computational infrastructure and bioinformatic support otherwise necessary to process, analyze, and interpret massive amounts of next-generation sequence data. Any web-capable device can be used to upload, process, explore, and extract data and results from VAMPS. VAMPS encourages researchers to share sequence and metadata, and fosters collaboration between researchers of disparate biomes who recognize common patterns in shared data.

  10. YM500: a small RNA sequencing (smRNA-seq) database for microRNA research

    PubMed Central

    Cheng, Wei-Chung; Chung, I-Fang; Huang, Tse-Shun; Chang, Shih-Ting; Sun, Hsing-Jen; Tsai, Cheng-Fong; Liang, Muh-Lii; Wong, Tai-Tong; Wang, Hsei-Wei

    2013-01-01

    MicroRNAs (miRNAs) are small RNAs ∼22 nt in length that are involved in the regulation of a variety of physiological and pathological processes. Advances in high-throughput small RNA sequencing (smRNA-seq), one of the next-generation sequencing applications, have reshaped the miRNA research landscape. In this study, we established an integrative database, the YM500 (http://ngs.ym.edu.tw/ym500/), containing analysis pipelines and analysis results for 609 human and mice smRNA-seq results, including public data from the Gene Expression Omnibus (GEO) and some private sources. YM500 collects analysis results for miRNA quantification, for isomiR identification (incl. RNA editing), for arm switching discovery, and, more importantly, for novel miRNA predictions. Wetlab validation on >100 miRNAs confirmed high correlation between miRNA profiling and RT-qPCR results (R = 0.84). This database allows researchers to search these four different types of analysis results via our interactive web interface. YM500 allows researchers to define the criteria of isomiRs, and also integrates the information of dbSNP to help researchers distinguish isomiRs from SNPs. A user-friendly interface is provided to integrate miRNA-related information and existing evidence from hundreds of sequencing datasets. The identified novel miRNAs and isomiRs hold the potential for both basic research and biotech applications. PMID:23203880

  11. Multimodal sequence learning.

    PubMed

    Kemény, Ferenc; Meier, Beat

    2016-02-01

    While sequence learning research models complex phenomena, previous studies have mostly focused on unimodal sequences. The goal of the current experiment is to put implicit sequence learning into a multimodal context: to test whether it can operate across different modalities. We used the Task Sequence Learning paradigm to test whether sequence learning varies across modalities, and whether participants are able to learn multimodal sequences. Our results show that implicit sequence learning is very similar regardless of the source modality. However, the presence of correlated task and response sequences was required for learning to take place. The experiment provides new evidence for implicit sequence learning of abstract conceptual representations. In general, the results suggest that correlated sequences are necessary for implicit sequence learning to occur. Moreover, they show that elements from different modalities can be automatically integrated into one unitary multimodal sequence. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. Advances in single-cell RNA sequencing and its applications in cancer research.

    PubMed

    Zhu, Sibo; Qing, Tao; Zheng, Yuanting; Jin, Li; Shi, Leming

    2017-08-08

    Unlike population-level approaches, single-cell RNA sequencing enables transcriptomic analysis of an individual cell. Through the combination of high-throughput sequencing and bioinformatic tools, single-cell RNA-seq can detect more than 10,000 transcripts in one cell to distinguish cell subsets and dynamic cellular changes. After several years' development, single-cell RNA-seq can now achieve massively parallel, full-length mRNA sequencing as well as in situ sequencing and even has potential for multi-omic detection. One appealing area of single-cell RNA-seq is cancer research, and it is regarded as a promising way to enhance prognosis and provide more precise target therapy by identifying druggable subclones. Indeed, progresses have been made regarding solid tumor analysis to reveal intratumoral heterogeneity, correlations between signaling pathways, stemness, drug resistance, and tumor architecture shaping the microenvironment. Furthermore, through investigation into circulating tumor cells, many genes have been shown to promote a propensity toward stemness and the epithelial-mesenchymal transition, to enhance anchoring and adhesion, and to be involved in mechanisms of anoikis resistance and drug resistance. This review focuses on advances and progresses of single-cell RNA-seq with regard to the following aspects: 1. Methodologies of single-cell RNA-seq 2. Single-cell isolation techniques 3. Single-cell RNA-seq in solid tumor research 4. Single-cell RNA-seq in circulating tumor cell research 5.

  13. Advances in single-cell RNA sequencing and its applications in cancer research

    PubMed Central

    Zhu, Sibo; Qing, Tao; Zheng, Yuanting; Jin, Li; Shi, Leming

    2017-01-01

    Unlike population-level approaches, single-cell RNA sequencing enables transcriptomic analysis of an individual cell. Through the combination of high-throughput sequencing and bioinformatic tools, single-cell RNA-seq can detect more than 10,000 transcripts in one cell to distinguish cell subsets and dynamic cellular changes. After several years’ development, single-cell RNA-seq can now achieve massively parallel, full-length mRNA sequencing as well as in situ sequencing and even has potential for multi-omic detection. One appealing area of single-cell RNA-seq is cancer research, and it is regarded as a promising way to enhance prognosis and provide more precise target therapy by identifying druggable subclones. Indeed, progresses have been made regarding solid tumor analysis to reveal intratumoral heterogeneity, correlations between signaling pathways, stemness, drug resistance, and tumor architecture shaping the microenvironment. Furthermore, through investigation into circulating tumor cells, many genes have been shown to promote a propensity toward stemness and the epithelial-mesenchymal transition, to enhance anchoring and adhesion, and to be involved in mechanisms of anoikis resistance and drug resistance. This review focuses on advances and progresses of single-cell RNA-seq with regard to the following aspects: 1. Methodologies of single-cell RNA-seq 2. Single-cell isolation techniques 3. Single-cell RNA-seq in solid tumor research 4. Single-cell RNA-seq in circulating tumor cell research 5. Perspectives PMID:28881849

  14. Pleurochrysome: A Web Database of Pleurochrysis Transcripts and Orthologs Among Heterogeneous Algae

    PubMed Central

    Fujiwara, Shoko; Takatsuka, Yukiko; Hirokawa, Yasutaka; Tsuzuki, Mikio; Takano, Tomoyuki; Kobayashi, Masaaki; Suda, Kunihiro; Asamizu, Erika; Yokoyama, Koji; Shibata, Daisuke; Tabata, Satoshi; Yano, Kentaro

    2016-01-01

    Pleurochrysis is a coccolithophorid genus, which belongs to the Coccolithales in the Haptophyta. The genus has been used extensively for biological research, together with Emiliania in the Isochrysidales, to understand distinctive features between the two coccolithophorid-including orders. However, molecular biological research on Pleurochrysis such as elucidation of the molecular mechanism behind coccolith formation has not made great progress at least in part because of lack of comprehensive gene information. To provide such information to the research community, we built an open web database, the Pleurochrysome (http://bioinf.mind.meiji.ac.jp/phapt/), which currently stores 9,023 unique gene sequences (designated as UNIGENEs) assembled from expressed sequence tag sequences of P. haptonemofera as core information. The UNIGENEs were annotated with gene sequences sharing significant homology, conserved domains, Gene Ontology, KEGG Orthology, predicted subcellular localization, open reading frames and orthologous relationship with genes of 10 other algal species, a cyanobacterium and the yeast Saccharomyces cerevisiae. This sequence and annotation information can be easily accessed via several search functions. Besides fundamental functions such as BLAST and keyword searches, this database also offers search functions to explore orthologous genes in the 12 organisms and to seek novel genes. The Pleurochrysome will promote molecular biological and phylogenetic research on coccolithophorids and other haptophytes by helping scientists mine data from the primary transcriptome of P. haptonemofera. PMID:26746174

  15. Encoding and choice in the task span paradigm.

    PubMed

    Reiman, Kaitlin M; Weaver, Starla M; Arrington, Catherine M

    2015-03-01

    Cognitive control during sequences of planned behaviors requires both plan-level processes such as generating, maintaining, and monitoring the plan, as well as task-level processes such as selecting, establishing and implementing specific task sets. The task span paradigm (Logan in J Exp Psychol Gen 133:218-236, 2004) combines two common cognitive control paradigms, task switching and working memory span, to investigate the integration of plan-level and task-level processes during control of sequential behavior. The current study expands past task span research to include measures of encoding processes and choice behavior with volitional sequence generation, using the standard task span as well as a novel voluntary task span paradigm. In two experiments, we consider how sequence complexity, defined separately for plan-level and task-level complexity, influences sequence encoding (Experiment 1), sequence choice (Experiment 2), sequence memory, and task performance of planned sequences of action. Results indicate that participants were sensitive to sequence complexity, but that different aspects of behavior are most strongly influenced by different types of complexity. Hierarchical complexity at the plan level best predicts voluntary sequence generation and memory; while switch frequency at the task level best predicts encoding of externally defined sequences and task performance. Furthermore, performance RTs were similar for externally and internally defined plans, whereas memory was improved for internally defined sequences. Finally, participants demonstrated a significant sequence choice bias in the voluntary task span. Consistent with past research on choice behavior, volitional selection of plans was markedly influenced by both the ease of memory and performance.

  16. A descriptive investigation of the impact of student research projects arising from elective research courses.

    PubMed

    Harirforoosh, Sam; Stewart, David W

    2016-01-27

    Pharmacy academicians have noted the need to develop research skills in student pharmacists. At the Gatton College of Pharmacy, significant focus has been placed on the development of research skills through offering elective research courses. In order to evaluate the impact of participation in the research elective(s), we analyzed college records and surveyed faculty members with regard to the number of poster/podium presentations, published peer-reviewed manuscripts, and funded projects. Student enrollment in the research elective sequence has increased over time and has resulted in 81 poster presentations, 14 podium presentations, and 15 peer-reviewed publications. Implementation of a research elective sequence and fostering of a research culture amongst the faculty and students has resulted in increased student engagement in research and related scholarly activities.

  17. Adaptive Learning Resources Sequencing in Educational Hypermedia Systems

    ERIC Educational Resources Information Center

    Karampiperis, Pythagoras; Sampson, Demetrios

    2005-01-01

    Adaptive learning resources selection and sequencing is recognized as among the most interesting research questions in adaptive educational hypermedia systems (AEHS). In order to adaptively select and sequence learning resources in AEHS, the definition of adaptation rules contained in the Adaptation Model, is required. Although, some efforts have…

  18. Sequences for Student Investigation

    ERIC Educational Resources Information Center

    Barton, Jeffrey; Feil, David; Lartigue, David; Mullins, Bernadette

    2004-01-01

    We describe two classes of sequences that give rise to accessible problems for undergraduate research. These problems may be understood with virtually no prerequisites and are well suited for computer-aided investigation. The first sequence is a variation of one introduced by Stephen Wolfram in connection with his study of cellular automata. The…

  19. Open source tools to exploit DNA sequence data from livestock species

    USDA-ARS?s Scientific Manuscript database

    Next-Generation Sequencing (NGS) is a recent technological development that allows researchers to rapidly determine the DNA sequence of an individual. The decrease in cost of NGS has brought the technology into the realm of practical applications in livestock genomics, where it can be used to genera...

  20. Computational complexity of algorithms for sequence comparison, short-read assembly and genome alignment.

    PubMed

    Baichoo, Shakuntala; Ouzounis, Christos A

    A multitude of algorithms for sequence comparison, short-read assembly and whole-genome alignment have been developed in the general context of molecular biology, to support technology development for high-throughput sequencing, numerous applications in genome biology and fundamental research on comparative genomics. The computational complexity of these algorithms has been previously reported in original research papers, yet this often neglected property has not been reviewed previously in a systematic manner and for a wider audience. We provide a review of space and time complexity of key sequence analysis algorithms and highlight their properties in a comprehensive manner, in order to identify potential opportunities for further research in algorithm or data structure optimization. The complexity aspect is poised to become pivotal as we will be facing challenges related to the continuous increase of genomic data on unprecedented scales and complexity in the foreseeable future, when robust biological simulation at the cell level and above becomes a reality. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. UCSC genome browser: deep support for molecular biomedical research.

    PubMed

    Mangan, Mary E; Williams, Jennifer M; Lathe, Scott M; Karolchik, Donna; Lathe, Warren C

    2008-01-01

    The volume and complexity of genomic sequence data, and the additional experimental data required for annotation of the genomic context, pose a major challenge for display and access for biomedical researchers. Genome browsers organize this data and make it available in various ways to extract useful information to advance research projects. The UCSC Genome Browser is one of these resources. The official sequence data for a given species forms the framework to display many other types of data such as expression, variation, cross-species comparisons, and more. Visual representations of the data are available for exploration. Data can be queried with sequences. Complex database queries are also easily achieved with the Table Browser interface. Associated tools permit additional query types or access to additional data sources such as images of in situ localizations. Support for solving researcher's issues is provided with active discussion mailing lists and by providing updated training materials. The UCSC Genome Browser provides a source of deep support for a wide range of biomedical molecular research (http://genome.ucsc.edu).

  2. Advances in high throughput DNA sequence data compression.

    PubMed

    Sardaraz, Muhammad; Tahir, Muhammad; Ikram, Ataul Aziz

    2016-06-01

    Advances in high throughput sequencing technologies and reduction in cost of sequencing have led to exponential growth in high throughput DNA sequence data. This growth has posed challenges such as storage, retrieval, and transmission of sequencing data. Data compression is used to cope with these challenges. Various methods have been developed to compress genomic and sequencing data. In this article, we present a comprehensive review of compression methods for genome and reads compression. Algorithms are categorized as referential or reference free. Experimental results and comparative analysis of various methods for data compression are presented. Finally, key challenges and research directions in DNA sequence data compression are highlighted.

  3. Large-scale transcriptome sequencing and gene analyses in the crab-eating macaque (Macaca fascicularis) for biomedical research

    PubMed Central

    2012-01-01

    Background As a human replacement, the crab-eating macaque (Macaca fascicularis) is an invaluable non-human primate model for biomedical research, but the lack of genetic information on this primate has represented a significant obstacle for its broader use. Results Here, we sequenced the transcriptome of 16 tissues originated from two individuals of crab-eating macaque (male and female), and identified genes to resolve the main obstacles for understanding the biological response of the crab-eating macaque. From 4 million reads with 1.4 billion base sequences, 31,786 isotigs containing genes similar to those of humans, 12,672 novel isotigs, and 348,160 singletons were identified using the GS FLX sequencing method. Approximately 86% of human genes were represented among the genes sequenced in this study. Additionally, 175 tissue-specific transcripts were identified, 81 of which were experimentally validated. In total, 4,314 alternative splicing (AS) events were identified and analyzed. Intriguingly, 10.4% of AS events were associated with transposable element (TE) insertions. Finally, investigation of TE exonization events and evolutionary analysis were conducted, revealing interesting phenomena of human-specific amplified trends in TE exonization events. Conclusions This report represents the first large-scale transcriptome sequencing and genetic analyses of M. fascicularis and could contribute to its utility for biomedical research and basic biology. PMID:22554259

  4. TARGETED CAPTURE IN EVOLUTIONARY AND ECOLOGICAL GENOMICS

    PubMed Central

    Jones, Matthew R.; Good, Jeffrey M.

    2016-01-01

    The rapid expansion of next-generation sequencing has yielded a powerful array of tools to address fundamental biological questions at a scale that was inconceivable just a few years ago. Various genome partitioning strategies to sequence select subsets of the genome have emerged as powerful alternatives to whole genome sequencing in ecological and evolutionary genomic studies. High throughput targeted capture is one such strategy that involves the parallel enrichment of pre-selected genomic regions of interest. The growing use of targeted capture demonstrates its potential power to address a range of research questions, yet these approaches have yet to expand broadly across labs focused on evolutionary and ecological genomics. In part, the use of targeted capture has been hindered by the logistics of capture design and implementation in species without established reference genomes. Here we aim to 1) increase the accessibility of targeted capture to researchers working in non-model taxa by discussing capture methods that circumvent the need of a reference genome, 2) highlight the evolutionary and ecological applications where this approach is emerging as a powerful sequencing strategy, and 3) discuss the future of targeted capture and other genome partitioning approaches in light of the increasing accessibility of whole genome sequencing. Given the practical advantages and increasing feasibility of high-throughput targeted capture, we anticipate an ongoing expansion of capture-based approaches in evolutionary and ecological research, synergistic with an expansion of whole genome sequencing. PMID:26137993

  5. The autism sequencing consortium: large-scale, high-throughput sequencing in autism spectrum disorders.

    PubMed

    Buxbaum, Joseph D; Daly, Mark J; Devlin, Bernie; Lehner, Thomas; Roeder, Kathryn; State, Matthew W

    2012-12-20

    Research during the past decade has seen significant progress in the understanding of the genetic architecture of autism spectrum disorders (ASDs), with gene discovery accelerating as the characterization of genomic variation has become increasingly comprehensive. At the same time, this research has highlighted ongoing challenges. Here we address the enormous impact of high-throughput sequencing (HTS) on ASD gene discovery, outline a consensus view for leveraging this technology, and describe a large multisite collaboration developed to accomplish these goals. Similar approaches could prove effective for severe neurodevelopmental disorders more broadly. Copyright © 2012 Elsevier Inc. All rights reserved.

  6. Fungal genome sequencing: basic biology to biotechnology.

    PubMed

    Sharma, Krishna Kant

    2016-08-01

    The genome sequences provide a first glimpse into the genomic basis of the biological diversity of filamentous fungi and yeast. The genome sequence of the budding yeast, Saccharomyces cerevisiae, with a small genome size, unicellular growth, and rich history of genetic and molecular analyses was a milestone of early genomics in the 1990s. The subsequent completion of fission yeast, Schizosaccharomyces pombe and genetic model, Neurospora crassa initiated a revolution in the genomics of the fungal kingdom. In due course of time, a substantial number of fungal genomes have been sequenced and publicly released, representing the widest sampling of genomes from any eukaryotic kingdom. An ambitious genome-sequencing program provides a wealth of data on metabolic diversity within the fungal kingdom, thereby enhancing research into medical science, agriculture science, ecology, bioremediation, bioenergy, and the biotechnology industry. Fungal genomics have higher potential to positively affect human health, environmental health, and the planet's stored energy. With a significant increase in sequenced fungal genomes, the known diversity of genes encoding organic acids, antibiotics, enzymes, and their pathways has increased exponentially. Currently, over a hundred fungal genome sequences are publicly available; however, no inclusive review has been published. This review is an initiative to address the significance of the fungal genome-sequencing program and provides the road map for basic and applied research.

  7. [Progress on molecular biology of Isaria farinosa, pathogen of host of Ophiocordyceps sinensis during the artificial culture].

    PubMed

    Liu, Fei; Wu, Xiao-Li; Liu, Ying; Chen, Da-Xia; Zhang, De-Li; Yang, Da-Jian

    2016-02-01

    Isaria farinosa is the pathogen of the host of Ophiocordyceps sinensis. The present research has analyzed the progress on the molecular biology according to the bibliometrics, the sequences (including the gene sequences) of I. farinosa in the NCBI. The results indicated that different country had published different number of the papers, and had landed different kinds and different number of the sequences (including the gene sequences). China had published the most number of the papers, and had landed the most number of the sequences (including the gene sequences). America had landed the most numbers of the function genes. The main content about the pathogen study was focus on the biological controlling. The main content about the molecular study concentrated on the phylogenies classification. In recent years some protease genes and chitinase genes had been researched. With the increase of the effect on the healthy of O. sinensis, and the whole sequence and more and more pharmacological activities of I. farinosa being made known to the public, the study on the molecular biology of the I. farinosa would be deeper and wider. Copyright© by the Chinese Pharmaceutical Association.

  8. Genome Sequencing Technologies and Nursing: What Are the Roles of Nurses and Nurse Scientists?

    PubMed

    Taylor, Jacquelyn Y; Wright, Michelle L; Hickey, Kathleen T; Housman, David E

    Advances in DNA sequencing technology have resulted in an abundance of personalized data with challenging clinical utility and meaning for clinicians. This wealth of data has potential to dramatically impact the quality of healthcare. Nurses are at the focal point in educating patients regarding relevant healthcare needs; therefore, an understanding of sequencing technology and utilizing these data are critical. The objective of this study was to explicate the role of nurses and nurse scientists as integral members of healthcare teams in improving understanding of DNA sequencing data and translational genomics for patients. A history of the nurse role in newborn screening is used as an exemplar. This study serves as an exemplar on how genome sequencing has been utilized in nursing science and incorporates linkages of other omics approaches used by nurses that are included in this special issue. This special issue showcased nurse scientists conducting multi-omic research from various methods, including targeted candidate genes, pharmacogenomics, proteomics, epigenomics, and the microbiome. From this vantage point, we provide an overview of the roles of nurse scientists in genome sequencing research and provide recommendations for the best utilization of nurses and nurse scientists related to genome sequencing.

  9. Problems in turbidite research: A need for COMFAN

    USGS Publications Warehouse

    Normark, W.R.; Mutti, E.; Bouma, A.H.

    1984-01-01

    Comparison of modern submarine fans and ancient turbidite sequences is still in its infancy, mainly because of the incompatibility of study approaches. Research on modern fan systems mainly deals with morphologic aspects and surficial sediments, while observations on ancient turbidite formations are mostly directed to vertical sequences. The lack of a common data set also results from different scales of observation. To review the current status of modern and ancient turbidite research, an international group of specialists formed COMFAN (Committee on Fans) and met in September 1982 at the Gulf Research and Development Company research facilities in Pennsylvania. ?? 1984 Springer-Verlag New York Inc.

  10. Improving transmission efficiency of large sequence alignment/map (SAM) files.

    PubMed

    Sakib, Muhammad Nazmus; Tang, Jijun; Zheng, W Jim; Huang, Chin-Tser

    2011-01-01

    Research in bioinformatics primarily involves collection and analysis of a large volume of genomic data. Naturally, it demands efficient storage and transfer of this huge amount of data. In recent years, some research has been done to find efficient compression algorithms to reduce the size of various sequencing data. One way to improve the transmission time of large files is to apply a maximum lossless compression on them. In this paper, we present SAMZIP, a specialized encoding scheme, for sequence alignment data in SAM (Sequence Alignment/Map) format, which improves the compression ratio of existing compression tools available. In order to achieve this, we exploit the prior knowledge of the file format and specifications. Our experimental results show that our encoding scheme improves compression ratio, thereby reducing overall transmission time significantly.

  11. Development and evaluation of clicker methodology for introductory physics courses

    NASA Astrophysics Data System (ADS)

    Lee, Albert H.

    Many educators understand that lectures are cost effective but not learning efficient, so continue to search for ways to increase active student participation in this traditionally passive learning environment. In-class polling systems, or "clickers", are inexpensive and reliable tools allowing students to actively participate in lectures by answering multiple-choice questions. Students assess their learning in real time by observing instant polling summaries displayed in front of them. This in turn motivates additional discussions which increase the opportunity for active learning. We wanted to develop a comprehensive clicker methodology that creates an active lecture environment for a broad spectrum of students taking introductory physics courses. We wanted our methodology to incorporate many findings of contemporary learning science. It is recognized that learning requires active construction; students need to be actively involved in their own learning process. Learning also depends on preexisting knowledge; students construct new knowledge and understandings based on what they already know and believe. Learning is context dependent; students who have learned to apply a concept in one context may not be able to recognize and apply the same concept in a different context, even when both contexts are considered to be isomorphic by experts. On this basis, we developed question sequences, each involving the same concept but having different contexts. Answer choices are designed to address students preexisting knowledge. These sequences are used with the clickers to promote active discussions and multiple assessments. We have created, validated, and evaluated sequences sufficient in number to populate all of introductory physics courses. Our research has found that using clickers with our question sequences significantly improved student conceptual understanding. Our research has also found how to best measure student conceptual gain using research-based instruments. Finally, we discovered that students need to have full access to the question sequences after lectures to reap the maximum benefit. Chapter 1 provides an introduction to our research. Chapter 2 provides a literature review relevant for our research. Chapter 3 discusses the creation of the clicker question sequences. Chapter 4 provides a picture of the validation process involving both physics experts and the introductory physics students. Chapter 5 describes how the sequences have been used with clickers in lectures. Chapter 6 provides the evaluation of the effectiveness of the clicker methodology. Chapter 7 contains a brief summary of research results and conclusions.

  12. How Incidental Sequence Learning Creates Reportable Knowledge: The Role of Unexpected Events

    ERIC Educational Resources Information Center

    Runger, Dennis; Frensch, Peter A.

    2008-01-01

    Research on incidental sequence learning typically is concerned with the characteristics of implicit or nonconscious learning. In this article, the authors aim to elucidate the cognitive mechanisms that contribute to the generation of explicit, reportable sequence knowledge. According to the unexpected-event hypothesis (P. A. Frensch, H. Haider,…

  13. Identifying Students' Conceptions of Basic Principles in Sequence Stratigraphy

    ERIC Educational Resources Information Center

    Herrera, Juan S.; Riggs, Eric M.

    2013-01-01

    Sequence stratigraphy is a major research subject in the geosciences academia and the oil industry. However, the geoscience education literature addressing students' understanding of the basic concepts of sequence stratigraphy is relatively thin, and the topic has not been well explored. We conducted an assessment of 27 students' conceptions of…

  14. Dynamic Assessment of Microbial Ecology (DAME): A web app for interactive analysis and visualization of microbial sequencing data

    USDA-ARS?s Scientific Manuscript database

    Dynamic Assessment of Microbial Ecology (DAME) is a shiny-based web application for interactive analysis and visualization of microbial sequencing data. DAME provides researchers not familiar with R programming the ability to access the most current R functions utilized for ecology and gene sequenci...

  15. Visual Sequence Learning in Infancy: Domain-General and Domain-Specific Associations with Language

    ERIC Educational Resources Information Center

    Shafto, Carissa L.; Conway, Christopher M.; Field, Suzanne L.; Houston, Derek M.

    2012-01-01

    Research suggests that nonlinguistic sequence learning abilities are an important contributor to language development (Conway, Bauernschmidt, Huang, & Pisoni, 2010). The current study investigated visual sequence learning (VSL) as a possible predictor of vocabulary development in infants. Fifty-eight 8.5-month-old infants were presented with a…

  16. Full Genome Sequence of Giant Panda Rotavirus Strain CH-1

    PubMed Central

    Guo, Ling; Yang, Shaolin; Wang, Chengdong; Chen, Shijie; Yang, Xiaonong; Hou, Rong; Quan, Zifang; Hao, Zhongxiang

    2013-01-01

    We report here the complete genomic sequence of the giant panda rotavirus strain CH-1. This work is the first to document the complete genomic sequence (segments 1 to 11) of the CH-1 strain, which offers an effective platform for providing authentic research experiences to novice scientists. PMID:23469354

  17. Designing and Evaluating Research-Based Instructional Sequences for Introducing Magnetic Fields

    ERIC Educational Resources Information Center

    Guisasola, Jenaro; Almudi, Jose Manuel; Ceberio, Mikel; Zubimendi, Jose Luis

    2009-01-01

    This study examines the didactic suitability of introducing a teaching sequence when teaching the concept of magnetic fields within introductory physics courses at the university level. This instructional sequence was designed taking into account students' common conceptions, an analysis of the course content, and the history of the development of…

  18. Enabling next-gen sequencing and analysis at the USDA-ARS U.S. Meat Animal Research Center with MiniLIMS

    USDA-ARS?s Scientific Manuscript database

    There is a growing need to combine DNA sequencing technologies to address complex problems in genome biology. These genomic studies routinely generate voluminous image, sequence, and mapping files that should be associated with quality control information (gels, spectra, etc.), and other important ...

  19. The complete nucleotide sequence and genomic characterization of tropical soda apple mosaic virus

    USDA-ARS?s Scientific Manuscript database

    Tropical soda apple mosaic virus (TSAMV) was first identified in tropical soda apple (Solanum viarum), a noxious weed, in Florida in 2002. This report provides the first full genome sequence of TSAMV. The full genome sequence of this virus will enable research scientists to develop additional spec...

  20. Whole exome sequencing: a state-of-the-art approach for defining (and exploring!) genetic landscapes in pediatric nephrology.

    PubMed

    Gulati, Ashima; Somlo, Stefan

    2018-05-01

    The genesis of whole exome sequencing as a powerful tool for detailing the protein coding sequence of the human genome was conceptualized based on the availability of next-generation sequencing technology and knowledge of the human reference genome. The field of pediatric nephrology enriched with molecularly unsolved phenotypes is allowing the clinical and research application of whole exome sequencing to enable novel gene discovery and provide amendment of phenotypic misclassification. Recent studies in the field have informed us that newer high-throughput sequencing techniques are likely to be of high yield when applied in conjunction with conventional genomic approaches such as linkage analysis and other strategies used to focus subsequent analysis. They have also emphasized the need for the validation of novel genetic findings in large collaborative cohorts and the production of robust corroborative biological data. The well-structured application of comprehensive genomic testing in clinical and research arenas will hopefully continue to advance patient care and precision medicine, but does call for attention to be paid to its integrated challenges.

  1. Identifiability, genomics and U.K. data protection law.

    PubMed

    Curren, Liam; Boddington, Paula; Gowans, Heather; Hawkins, Naomi; Kanellopoulou, Nadja; Kaye, Jane; Melham, Karen

    2010-09-01

    Analyses of individuals' genomes--their entire DNA sequence--have increased knowledge about the links between genetics and disease. Anticipated advances in 'next generation' DNA-sequencing techniques will see the routine research use of whole genomes, rather than distinct parts, within the next few years. The scientific benefits of genomic research are, however, accompanied by legal and ethical concerns. Despite the assumption that genetic research data can and will be rendered anonymous, participants' identities can sometimes be elucidated, which could cause data protection legislation to apply. We undertake a timely reappraisal of these laws--particularly new penalties--and identifiability in genomic research.

  2. Interaction of healthcare worker hands and portable medical equipment: a sequence analysis to show potential transmission opportunities.

    PubMed

    Jinadatha, Chetan; Villamaria, Frank C; Coppin, John D; Dale, Charles R; Williams, Marjory D; Whitworth, Ryan; Stibich, Mark

    2017-12-28

    While research has demonstrated the importance of a clean health care environment, there is a lack of research on the role portable medical equipment (PME) play in the transmission cycle of healthcare-acquired infections (HAIs). This study investigated the patterns and sequence of contact events among health care workers, patients, surfaces, and medical equipment in a hospital environment. Research staff observed patient care events over six different 24 h periods on six different hospital units. Each encounter was recorded as a sequence of events and analyzed using sequence analysis and visually represented by network plots. In addition, a point prevalence microbial sample was taken from the computer on wheels (COW). The most touched items during patient care was the individual patient (850), bedrail (375), bed-surface (302), and bed side Table (223). Three of the top ten most common subsequences included touching PME and the patient: computer on wheels ➔ patient (62 of 274 total sequences, 22.6%, contained this sequence), patient ➔ COW (20.4%), and patient ➔ IV pump (16.1%). The network plots revealed large interconnectedness among objects in the room, the patient, PME, and the healthcare worker. Our results demonstrated that PME such as COW and IV pump were two of the most highly-touched items during patient care. Even with proper hand sanitization and personal protective equipment, this sequence analysis reveals the potential for contamination from the patient and environment, to a vector such as portable medical equipment, and ultimately to another patient in the hospital.

  3. Library construction for next-generation sequencing: Overviews and challenges

    PubMed Central

    Head, Steven R.; Komori, H. Kiyomi; LaMere, Sarah A.; Whisenant, Thomas; Van Nieuwerburgh, Filip; Salomon, Daniel R.; Ordoukhanian, Phillip

    2014-01-01

    High-throughput sequencing, also known as next-generation sequencing (NGS), has revolutionized genomic research. In recent years, NGS technology has steadily improved, with costs dropping and the number and range of sequencing applications increasing exponentially. Here, we examine the critical role of sequencing library quality and consider important challenges when preparing NGS libraries from DNA and RNA sources. Factors such as the quantity and physical characteristics of the RNA or DNA source material as well as the desired application (i.e., genome sequencing, targeted sequencing, RNA-seq, ChIP-seq, RIP-seq, and methylation) are addressed in the context of preparing high quality sequencing libraries. In addition, the current methods for preparing NGS libraries from single cells are also discussed. PMID:24502796

  4. PCR Amplification Strategies towards full-length HIV-1 Genome sequencing.

    PubMed

    Liu, Chao Chun; Ji, Hezhao

    2018-06-26

    The advent of next generation sequencing has enabled greater resolution of viral diversity and improved feasibility of full viral genome sequencing allowing routine HIV-1 full genome sequencing in both research and diagnostic settings. Regardless of the sequencing platform selected, successful PCR amplification of the HIV-1 genome is essential for sequencing template preparation. As such, full HIV-1 genome amplification is a crucial step in dictating the successful and reliable sequencing downstream. Here we reviewed existing PCR protocols leading to HIV-1 full genome sequencing. In addition to the discussion on basic considerations on relevant PCR design, the advantages as well as the pitfalls of published protocols were reviewed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  5. Single molecule sequencing of the M13 virus genome without amplification

    PubMed Central

    Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X.; Yan, Qin; Deem, Michael W.; He, Jiankui

    2017-01-01

    Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias. PMID:29253901

  6. Single molecule sequencing of the M13 virus genome without amplification.

    PubMed

    Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X; Yan, Qin; Deem, Michael W; He, Jiankui

    2017-01-01

    Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias.

  7. The fsr Quorum-Sensing System and Cognate Gelatinase Orchestrate the Expression and Processing of Proprotein EF_1097 into the Mature Antimicrobial Peptide Enterocin O16.

    PubMed

    Dundar, Halil; Brede, Dag A; La Rosa, Sabina Leanti; El-Gendy, Ahmed Osama; Diep, Dzung B; Nes, Ingolf F

    2015-07-01

    A novel antimicrobial peptide designated enterocin O16 was purified from Enterococcus faecalis. Mass spectrometry showed a monoisotopic mass of 7,231 Da, and N-terminal Edman degradation identified a 29-amino-acid sequence corresponding to residues 90 to 119 of the EF_1097 protein. Bioinformatic analysis showed that enterocin O16 is composed of the 68 most C-terminal residues of the EF_1097 protein. Introduction of an in-frame isogenic deletion in the ef1097 gene abolished the production of enterocin O16. Enterocin O16 has a narrow inhibitory spectrum, as it inhibits mostly lactobacilli. Apparently, E. faecalis is intrinsically resistant to the antimicrobial peptide, as no immunity connected to the production of enterocin O16 could be identified. ef1097 has previously been identified as one of three loci regulated by the fsr quorum-sensing system. The introduction of a nonsense mutation into fsrB consistently impaired enterocin O16 production, but externally added gelatinase biosynthesis-activating pheromone restored the antimicrobial activity. Functional genetic analysis showed that the EF_1097 proprotein is processed extracellularly into enterocin O16 by the metalloprotease GelE. Thus, it is evident that the fsr quorum-sensing system constitutes the regulatory unit that controls the expression of the EF_1097 precursor protein and the protease GelE and that the latter is required for the formation of enterocin O16. On the basis of these results, this study identified antibacterial antagonism as a novel aspect related to the function of fsr and provides a rationale for why ef1097 is part of the fsr regulon. The fsr quorum-sensing system modulates important physiological functions in E. faecalis via the activity of GelE. The present study presents a new facet of fsr signaling. The system controls the expression of three primary target operons (fsrABCD, gelE-sprE, and ef1097-ef1097b). We demonstrate that the concerted expression of these operons constitutes the elements necessary for the production of a bacteriocin-type peptide and that antimicrobial antagonism is an intrinsic function of fsr. The bacteriocin enterocin O16 consists of the 68 most C-terminal residues of the EF_1097 secreted proprotein. The GelE protease processes the EF_1097 proprotein into enterocin O16. In this manner, fsr signaling enables E. faecalis populations to express antimicrobial activity in a cell density-dependent manner. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  8. The fsr Quorum-Sensing System and Cognate Gelatinase Orchestrate the Expression and Processing of Proprotein EF_1097 into the Mature Antimicrobial Peptide Enterocin O16

    PubMed Central

    Dundar, Halil; Brede, Dag A.; La Rosa, Sabina Leanti; El-Gendy, Ahmed Osama; Diep, Dzung B.

    2015-01-01

    ABSTRACT A novel antimicrobial peptide designated enterocin O16 was purified from Enterococcus faecalis. Mass spectrometry showed a monoisotopic mass of 7,231 Da, and N-terminal Edman degradation identified a 29-amino-acid sequence corresponding to residues 90 to 119 of the EF_1097 protein. Bioinformatic analysis showed that enterocin O16 is composed of the 68 most C-terminal residues of the EF_1097 protein. Introduction of an in-frame isogenic deletion in the ef1097 gene abolished the production of enterocin O16. Enterocin O16 has a narrow inhibitory spectrum, as it inhibits mostly lactobacilli. Apparently, E. faecalis is intrinsically resistant to the antimicrobial peptide, as no immunity connected to the production of enterocin O16 could be identified. ef1097 has previously been identified as one of three loci regulated by the fsr quorum-sensing system. The introduction of a nonsense mutation into fsrB consistently impaired enterocin O16 production, but externally added gelatinase biosynthesis-activating pheromone restored the antimicrobial activity. Functional genetic analysis showed that the EF_1097 proprotein is processed extracellularly into enterocin O16 by the metalloprotease GelE. Thus, it is evident that the fsr quorum-sensing system constitutes the regulatory unit that controls the expression of the EF_1097 precursor protein and the protease GelE and that the latter is required for the formation of enterocin O16. On the basis of these results, this study identified antibacterial antagonism as a novel aspect related to the function of fsr and provides a rationale for why ef1097 is part of the fsr regulon. IMPORTANCE The fsr quorum-sensing system modulates important physiological functions in E. faecalis via the activity of GelE. The present study presents a new facet of fsr signaling. The system controls the expression of three primary target operons (fsrABCD, gelE-sprE, and ef1097-ef1097b). We demonstrate that the concerted expression of these operons constitutes the elements necessary for the production of a bacteriocin-type peptide and that antimicrobial antagonism is an intrinsic function of fsr. The bacteriocin enterocin O16 consists of the 68 most C-terminal residues of the EF_1097 secreted proprotein. The GelE protease processes the EF_1097 proprotein into enterocin O16. In this manner, fsr signaling enables E. faecalis populations to express antimicrobial activity in a cell density-dependent manner. PMID:25733609

  9. Production and structural characterization of amino terminally histidine tagged human oncostatin M in E. coli.

    PubMed

    Sporeno, E; Barbato, G; Graziani, R; Pucci, P; Nitti, G; Paonessa, G

    1994-05-01

    Oncostatin M is a cytokine that acts as a growth regulator on a wide variety of cells and has diverse biological activities including acute phase protein induction, LDL receptor up-regulation and cell-specific gene expression. In order to gather information about the Onc M structure, we established a protocol for large scale production and single step purification of this functional cytokine from bacterial cells. The cDNA of human Onc M was cloned by RT-PCR from total RNA of PMA induced U937 cells. After the addition of a six histidine tag at the N-terminus, the coding region of mature Onc M was cloned in the pT7.7 expression vector. Histidine tagged Onc M was overexpressed in bacterial cells and purified to homogeneity in one step on a metal chelating column. We found that recombinant 6xHis-OncM remains fully active in a growth inhibition assay. Structural characterization of the purified protein was performed by electrospray mass spectrometry, automated Edman degradation and peptide mapping by high-pressure liquid chromatography/fast-atom-bombardment mass spectrometry. Thermal and pH stability dependence of Onc M was assessed by circular dichroism spectroscopy; the helical content is about 50%, in agreement with the four helix bundle fold postulated for cytokines that bind haematopoietic receptors of type I.

  10. Real-time DNA barcoding in a rainforest using nanopore sequencing: opportunities for rapid biodiversity assessments and local capacity building

    PubMed Central

    Peñafiel, Nicolás; Arteaga, Alejandro; Bustamante, Lucas; Pichardo, Frank; Coloma, Luis A; Barrio-Amorós, César L; Salazar-Valenzuela, David; Prost, Stefan

    2018-01-01

    Abstract Background Advancements in portable scientific instruments provide promising avenues to expedite field work in order to understand the diverse array of organisms that inhabit our planet. Here, we tested the feasibility for in situ molecular analyses of endemic fauna using a portable laboratory fitting within a single backpack in one of the world's most imperiled biodiversity hotspots, the Ecuadorian Chocó rainforest. We used portable equipment, including the MinION nanopore sequencer (Oxford Nanopore Technologies) and the miniPCR (miniPCR), to perform DNA extraction, polymerase chain reaction amplification, and real-time DNA barcoding of reptile specimens in the field. Findings We demonstrate that nanopore sequencing can be implemented in a remote tropical forest to quickly and accurately identify species using DNA barcoding, as we generated consensus sequences for species resolution with an accuracy of >99% in less than 24 hours after collecting specimens. The flexibility of our mobile laboratory further allowed us to generate sequence information at the Universidad Tecnológica Indoamérica in Quito for rare, endangered, and undescribed species. This includes the recently rediscovered Jambato toad, which was thought to be extinct for 28 years. Sequences generated on the MinION required as few as 30 reads to achieve high accuracy relative to Sanger sequencing, and with further multiplexing of samples, nanopore sequencing can become a cost-effective approach for rapid and portable DNA barcoding. Conclusions Overall, we establish how mobile laboratories and nanopore sequencing can help to accelerate species identification in remote areas to aid in conservation efforts and be applied to research facilities in developing countries. This opens up possibilities for biodiversity studies by promoting local research capacity building, teaching nonspecialists and students about the environment, tackling wildlife crime, and promoting conservation via research-focused ecotourism. PMID:29617771

  11. Complete genome sequence of Cupriavidus basilensis 4G11, isolated from the Oak Ridge Field Research Center site

    DOE PAGES

    Ray, Jayashree; Waters, R. Jordan; Skerker, Jeffrey M.; ...

    2015-05-14

    Cupriavidus basilensis 4G11 was isolated from groundwater at the Oak Ridge Field Research Center (FRC) site. Here, we report the complete genome sequence and annotation of Cupriavidus basilensis 4G11. The genome contains 8,421,483 bp, 7,661 predicted protein-coding genes, and a total GC content of 64.4%.

  12. Toward a Better Compression for DNA Sequences Using Huffman Encoding

    PubMed Central

    Almarri, Badar; Al Yami, Sultan; Huang, Chun-Hsi

    2017-01-01

    Abstract Due to the significant amount of DNA data that are being generated by next-generation sequencing machines for genomes of lengths ranging from megabases to gigabases, there is an increasing need to compress such data to a less space and a faster transmission. Different implementations of Huffman encoding incorporating the characteristics of DNA sequences prove to better compress DNA data. These implementations center on the concepts of selecting frequent repeats so as to force a skewed Huffman tree, as well as the construction of multiple Huffman trees when encoding. The implementations demonstrate improvements on the compression ratios for five genomes with lengths ranging from 5 to 50 Mbp, compared with the standard Huffman tree algorithm. The research hence suggests an improvement on all such DNA sequence compression algorithms that use the conventional Huffman encoding. The research suggests an improvement on all DNA sequence compression algorithms that use the conventional Huffman encoding. Accompanying software is publicly available (AL-Okaily, 2016). PMID:27960065

  13. Toward a Better Compression for DNA Sequences Using Huffman Encoding.

    PubMed

    Al-Okaily, Anas; Almarri, Badar; Al Yami, Sultan; Huang, Chun-Hsi

    2017-04-01

    Due to the significant amount of DNA data that are being generated by next-generation sequencing machines for genomes of lengths ranging from megabases to gigabases, there is an increasing need to compress such data to a less space and a faster transmission. Different implementations of Huffman encoding incorporating the characteristics of DNA sequences prove to better compress DNA data. These implementations center on the concepts of selecting frequent repeats so as to force a skewed Huffman tree, as well as the construction of multiple Huffman trees when encoding. The implementations demonstrate improvements on the compression ratios for five genomes with lengths ranging from 5 to 50 Mbp, compared with the standard Huffman tree algorithm. The research hence suggests an improvement on all such DNA sequence compression algorithms that use the conventional Huffman encoding. The research suggests an improvement on all DNA sequence compression algorithms that use the conventional Huffman encoding. Accompanying software is publicly available (AL-Okaily, 2016 ).

  14. Next-generation sequencing in the clinic: promises and challenges.

    PubMed

    Xuan, Jiekun; Yu, Ying; Qing, Tao; Guo, Lei; Shi, Leming

    2013-11-01

    The advent of next generation sequencing (NGS) technologies has revolutionized the field of genomics, enabling fast and cost-effective generation of genome-scale sequence data with exquisite resolution and accuracy. Over the past years, rapid technological advances led by academic institutions and companies have continued to broaden NGS applications from research to the clinic. A recent crop of discoveries have highlighted the medical impact of NGS technologies on Mendelian and complex diseases, particularly cancer. However, the ever-increasing pace of NGS adoption presents enormous challenges in terms of data processing, storage, management and interpretation as well as sequencing quality control, which hinder the translation from sequence data into clinical practice. In this review, we first summarize the technical characteristics and performance of current NGS platforms. We further highlight advances in the applications of NGS technologies towards the development of clinical diagnostics and therapeutics. Common issues in NGS workflows are also discussed to guide the selection of NGS platforms and pipelines for specific research purposes. Published by Elsevier Ireland Ltd.

  15. Influenza Research Database: an integrated bioinformatics resource for influenza research and surveillance

    PubMed Central

    Squires, R. Burke; Noronha, Jyothi; Hunt, Victoria; García‐Sastre, Adolfo; Macken, Catherine; Baumgarth, Nicole; Suarez, David; Pickett, Brett E.; Zhang, Yun; Larsen, Christopher N.; Ramsey, Alvin; Zhou, Liwei; Zaremba, Sam; Kumar, Sanjeev; Deitrich, Jon; Klem, Edward; Scheuermann, Richard H.

    2012-01-01

    Please cite this paper as: Squires et al. (2012) Influenza research database: an integrated bioinformatics resource for influenza research and surveillance. Influenza and Other Respiratory Viruses 6(6), 404–416. Background  The recent emergence of the 2009 pandemic influenza A/H1N1 virus has highlighted the value of free and open access to influenza virus genome sequence data integrated with information about other important virus characteristics. Design  The Influenza Research Database (IRD, http://www.fludb.org) is a free, open, publicly‐accessible resource funded by the U.S. National Institute of Allergy and Infectious Diseases through the Bioinformatics Resource Centers program. IRD provides a comprehensive, integrated database and analysis resource for influenza sequence, surveillance, and research data, including user‐friendly interfaces for data retrieval, visualization and comparative genomics analysis, together with personal log in‐protected ‘workbench’ spaces for saving data sets and analysis results. IRD integrates genomic, proteomic, immune epitope, and surveillance data from a variety of sources, including public databases, computational algorithms, external research groups, and the scientific literature. Results  To demonstrate the utility of the data and analysis tools available in IRD, two scientific use cases are presented. A comparison of hemagglutinin sequence conservation and epitope coverage information revealed highly conserved protein regions that can be recognized by the human adaptive immune system as possible targets for inducing cross‐protective immunity. Phylogenetic and geospatial analysis of sequences from wild bird surveillance samples revealed a possible evolutionary connection between influenza virus from Delaware Bay shorebirds and Alberta ducks. Conclusions  The IRD provides a wealth of integrated data and information about influenza virus to support research of the genetic determinants dictating virus pathogenicity, host range restriction and transmission, and to facilitate development of vaccines, diagnostics, and therapeutics. PMID:22260278

  16. DraGnET: Software for storing, managing and analyzing annotated draft genome sequence data

    PubMed Central

    2010-01-01

    Background New "next generation" DNA sequencing technologies offer individual researchers the ability to rapidly generate large amounts of genome sequence data at dramatically reduced costs. As a result, a need has arisen for new software tools for storage, management and analysis of genome sequence data. Although bioinformatic tools are available for the analysis and management of genome sequences, limitations still remain. For example, restrictions on the submission of data and use of these tools may be imposed, thereby making them unsuitable for sequencing projects that need to remain in-house or proprietary during their initial stages. Furthermore, the availability and use of next generation sequencing in industrial, governmental and academic environments requires biologist to have access to computational support for the curation and analysis of the data generated; however, this type of support is not always immediately available. Results To address these limitations, we have developed DraGnET (Draft Genome Evaluation Tool). DraGnET is an open source web application which allows researchers, with no experience in programming and database management, to setup their own in-house projects for storing, retrieving, organizing and managing annotated draft and complete genome sequence data. The software provides a web interface for the use of BLAST, allowing users to perform preliminary comparative analysis among multiple genomes. We demonstrate the utility of DraGnET for performing comparative genomics on closely related bacterial strains. Furthermore, DraGnET can be further developed to incorporate additional tools for more sophisticated analyses. Conclusions DraGnET is designed for use either by individual researchers or as a collaborative tool available through Internet (or Intranet) deployment. For genome projects that require genome sequencing data to initially remain proprietary, DraGnET provides the means for researchers to keep their data in-house for analysis using local programs or until it is made publicly available, at which point it may be uploaded to additional analysis software applications. The DraGnET home page is available at http://www.dragnet.cvm.iastate.edu and includes example files for examining the functionalities, a link for downloading the DraGnET setup package and a link to the DraGnET source code hosted with full documentation on SourceForge. PMID:20175920

  17. Design and Analysis of Single-Cell Sequencing Experiments.

    PubMed

    Grün, Dominic; van Oudenaarden, Alexander

    2015-11-05

    Recent advances in single-cell sequencing hold great potential for exploring biological systems with unprecedented resolution. Sequencing the genome of individual cells can reveal somatic mutations and allows the investigation of clonal dynamics. Single-cell transcriptome sequencing can elucidate the cell type composition of a sample. However, single-cell sequencing comes with major technical challenges and yields complex data output. In this Primer, we provide an overview of available methods and discuss experimental design and single-cell data analysis. We hope that these guidelines will enable a growing number of researchers to leverage the power of single-cell sequencing. Copyright © 2015 Elsevier Inc. All rights reserved.

  18. Environmental microbiology through the lens of high-throughput DNA sequencing: synopsis of current platforms and bioinformatics approaches.

    PubMed

    Logares, Ramiro; Haverkamp, Thomas H A; Kumar, Surendra; Lanzén, Anders; Nederbragt, Alexander J; Quince, Christopher; Kauserud, Håvard

    2012-10-01

    The incursion of High-Throughput Sequencing (HTS) in environmental microbiology brings unique opportunities and challenges. HTS now allows a high-resolution exploration of the vast taxonomic and metabolic diversity present in the microbial world, which can provide an exceptional insight on global ecosystem functioning, ecological processes and evolution. This exploration has also economic potential, as we will have access to the evolutionary innovation present in microbial metabolisms, which could be used for biotechnological development. HTS is also challenging the research community, and the current bottleneck is present in the data analysis side. At the moment, researchers are in a sequence data deluge, with sequencing throughput advancing faster than the computer power needed for data analysis. However, new tools and approaches are being developed constantly and the whole process could be depicted as a fast co-evolution between sequencing technology, informatics and microbiologists. In this work, we examine the most popular and recently commercialized HTS platforms as well as bioinformatics methods for data handling and analysis used in microbial metagenomics. This non-exhaustive review is intended to serve as a broad state-of-the-art guide to researchers expanding into this rapidly evolving field. Copyright © 2012 Elsevier B.V. All rights reserved.

  19. Systematization of the protein sequence diversity in enzymes related to secondary metabolic pathways in plants, in the context of big data biology inspired by the KNApSAcK motorcycle database.

    PubMed

    Ikeda, Shun; Abe, Takashi; Nakamura, Yukiko; Kibinge, Nelson; Hirai Morita, Aki; Nakatani, Atsushi; Ono, Naoaki; Ikemura, Toshimichi; Nakamura, Kensuke; Altaf-Ul-Amin, Md; Kanaya, Shigehiko

    2013-05-01

    Biology is increasingly becoming a data-intensive science with the recent progress of the omics fields, e.g. genomics, transcriptomics, proteomics and metabolomics. The species-metabolite relationship database, KNApSAcK Core, has been widely utilized and cited in metabolomics research, and chronological analysis of that research work has helped to reveal recent trends in metabolomics research. To meet the needs of these trends, the KNApSAcK database has been extended by incorporating a secondary metabolic pathway database called Motorcycle DB. We examined the enzyme sequence diversity related to secondary metabolism by means of batch-learning self-organizing maps (BL-SOMs). Initially, we constructed a map by using a big data matrix consisting of the frequencies of all possible dipeptides in the protein sequence segments of plants and bacteria. The enzyme sequence diversity of the secondary metabolic pathways was examined by identifying clusters of segments associated with certain enzyme groups in the resulting map. The extent of diversity of 15 secondary metabolic enzyme groups is discussed. Data-intensive approaches such as BL-SOM applied to big data matrices are needed for systematizing protein sequences. Handling big data has become an inevitable part of biology.

  20. Applications and challenges of next-generation sequencing in Brassica species.

    PubMed

    Wei, Lijuan; Xiao, Meili; Hayward, Alice; Fu, Donghui

    2013-12-01

    Next-generation sequencing (NGS) produces numerous (often millions) short DNA sequence reads, typically varying between 25 and 400 bp in length, at a relatively low cost and in a short time. This revolutionary technology is being increasingly applied in whole-genome, transcriptome, epigenome and small RNA sequencing, molecular marker and gene discovery, comparative and evolutionary genomics, and association studies. The Brassica genus comprises some of the most agro-economically important crops, providing abundant vegetables, condiments, fodder, oil and medicinal products. Many Brassica species have undergone the process of polyploidization, which makes their genomes exceptionally complex and can create difficulties in genomics research. NGS injects new vigor into Brassica research, yet also faces specific challenges in the analysis of complex crop genomes and traits. In this article, we review the advantages and limitations of different NGS technologies and their applications and challenges, using Brassica as an advanced model system for agronomically important, polyploid crops. Specifically, we focus on the use of NGS for genome resequencing, transcriptome sequencing, development of single-nucleotide polymorphism markers, and identification of novel microRNAs and their targets. We present trends and advances in NGS technology in relation to Brassica crop improvement, with wide application for sophisticated genomics research into agronomically important polyploid crops.

  1. Hardware for dynamic quantum computing experiments: Part I

    NASA Astrophysics Data System (ADS)

    Johnson, Blake; Ryan, Colm; Riste, Diego; Donovan, Brian; Ohki, Thomas

    Static, pre-defined control sequences routinely achieve high-fidelity operation on superconducting quantum processors. Efforts toward dynamic experiments depending on real-time information have mostly proceeded through hardware duplication and triggers, requiring a combinatorial explosion in the number of channels. We provide a hardware efficient solution to dynamic control with a complete platform of specialized FPGA-based control and readout electronics; these components enable arbitrary control flow, low-latency feedback and/or feedforward, and scale far beyond single-qubit control and measurement. We will introduce the BBN Arbitrary Pulse Sequencer 2 (APS2) control system and the X6 QDSP readout platform. The BBN APS2 features: a sequencer built around implementing short quantum gates, a sequence cache to allow long sequences with branching structures, subroutines for code re-use, and a trigger distribution module to capture and distribute steering information. The X6 QDSP features a single-stage DSP pipeline that combines demodulation with arbitrary integration kernels, and multiple taps to inspect data flow for debugging and calibration. We will show system performance when putting it all together, including a latency budget for feedforward operations. This research was funded by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA), through the Army Research Office Contract No. W911NF-10-1-0324.

  2. Living laboratory: whole-genome sequencing as a learning healthcare enterprise.

    PubMed

    Angrist, M; Jamal, L

    2015-04-01

    With the proliferation of affordable large-scale human genomic data come profound and vexing questions about management of such data and their clinical uncertainty. These issues challenge the view that genomic research on human beings can (or should) be fully segregated from clinical genomics, either conceptually or practically. Here, we argue that the sharp distinction between clinical care and research is especially problematic in the context of large-scale genomic sequencing of people with suspected genetic conditions. Core goals of both enterprises (e.g. understanding genotype-phenotype relationships; generating an evidence base for genomic medicine) are more likely to be realized at a population scale if both those ordering and those undergoing sequencing for diagnostic reasons are routinely and longitudinally studied. Rather than relying on expensive and lengthy randomized clinical trials and meta-analyses, we propose leveraging nascent clinical-research hybrid frameworks into a broader, more permanent instantiation of exploratory medical sequencing. Such an investment could enlighten stakeholders about the real-life challenges posed by whole-genome sequencing, such as establishing the clinical actionability of genetic variants, returning 'off-target' results to families, developing effective service delivery models and monitoring long-term outcomes. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  3. The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color

    USDA-ARS?s Scientific Manuscript database

    Background: Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. Results: We describe the sequencing and assembly of...

  4. Dissociation between the Procedural Learning of Letter Names and Motor Sequences in Developmental Dyslexia

    ERIC Educational Resources Information Center

    Gabay, Yafit; Schiff, Rachel; Vakil, Eli

    2012-01-01

    Motor sequence learning has been studied extensively in Developmental dyslexia (DD). The purpose of the present research was to examine procedural learning of letter names and motor sequences in individuals with DD and control groups. Both groups completed the Serial Search Task which enabled the assessment of learning of letter names and motor…

  5. The Effects of Explicit Instruction of Formulaic Sequences on Second-Language Writers

    ERIC Educational Resources Information Center

    Colovic-Markovic, Jelena

    2012-01-01

    The present study investigated the effects of the explicit teaching of formulaic sequences (i.e., academic and topic-induced) on L2 writing. The research examined separately the effects of the treatment on the students' abilities to produce the target formulaic sequences in controlled (i.e., C-tests) and uncontrolled situations (i.e.,…

  6. Complete genome sequence of Tomato mosaic virus isolated from jasmine in the United States

    USDA-ARS?s Scientific Manuscript database

    Tomato mosaic virus (ToMV) was first identified in jasmine in the U.S. in Florida in 1999. This report provides the first full genome sequence of a ToMV isolate from jasmine. The full genome sequence of this virus will enable research scientists to develop additional specific diagnostic tests for ...

  7. Genome sequencing of Deutsch strain of cattle ticks, Rhipicephalus microplus: Raw Pac Bio reads.

    USDA-ARS?s Scientific Manuscript database

    Pac Bio RS II whole genome shotgun sequencing technology was used to sequence the genome of the cattle tick, Rhipicephalus microplus. The DNA was derived from 14 day old eggs from the Deutsch Texas outbreak strain reared at the USDA-ARS Cattle Fever Tick Research Laboratory, Edinburg, TX. Each corre...

  8. The Effects of Meiosis/Genetics Integration and Instructional Sequence on College Biology Student Achievement in Genetics.

    ERIC Educational Resources Information Center

    Browning, Mark

    The purpose of the research was to manipulate two aspects of genetics instruction in order to measure their effects on college, introductory biology students' achievement in genetics. One instructional sequence that was used dealt first with monohybrid autosomal inheritance patterns, then sex-linkage. The alternate sequence was the reverse.…

  9. Early-Stage Chunking of Finger Tapping Sequences by Persons Who Stutter and Fluent Speakers

    ERIC Educational Resources Information Center

    Smits-Bandstra, Sarah; De Nil, Luc F.

    2013-01-01

    This research note explored the hypothesis that chunking differences underlie the slow finger-tap sequencing performance reported in the literature for persons who stutter (PWS) relative to fluent speakers (PNS). Early-stage chunking was defined as an immediate and spontaneous tendency to organize a long sequence into pauses, for motor planning,…

  10. A Teaching Sequence for Learning the Concept of Chemical Equilibrium in Secondary School Education

    ERIC Educational Resources Information Center

    Ghirardi, Marco; Marchetti, Fabio; Pettinari, Claudio; Regis, Alberto; Roletto, Ezio

    2014-01-01

    A novel didactic sequence is proposed for the teaching of chemical equilibrium. This teaching sequence takes into account the historical and epistemological evolution of the concept, the alternative conceptions and learning difficulties highlighted by teaching science and research in education, and the need to focus on both the students'…

  11. Overview of Next-generation Sequencing Platforms Used in Published Draft Plant Genomes in Light of Genotypization of Immortelle Plant (Helichrysium Arenarium)

    PubMed Central

    Hodzic, Jasin; Gurbeta, Lejla; Omanovic-Miklicanin, Enisa; Badnjevic, Almir

    2017-01-01

    Introduction: Major advancements in DNA sequencing methods introduced in the first decade of the new millennium initiated a rapid expansion of sequencing studies, which yielded a tremendous amount of DNA sequence data, including whole sequenced genomes of various species, including plants. A set of novel sequencing platforms, often collectively named as “next-generation sequencing” (NGS) completely transformed the life sciences, by allowing extensive throughput, while greatly reducing the necessary time, labor and cost of any sequencing endeavor. Purpose: of this paper is to present an overview NGS platforms used to produce the current compendium of published draft genomes of various plants, namely the Roche/454, ABI/SOLiD, and Solexa/Illumina, and to determine the most frequently used platform for the whole genome sequencing of plants in light of genotypization of immortelle plant. Materials and methods: 45 papers were selected (with 47 presented plant genome draft sequences), and utilized sequencing techniques and NGS platforms (Roche/454, ABI/SOLiD and Illumina/Solexa) in selected papers were determined. Subsequently, frequency of usage of each platform or combination of platforms was calculated. Results: Illumina/Solexa platforms are by used either as sole sequencing tool in 40.42% of published genomes, or in combination with other platforms - additional 48.94% of published genomes, followed by Roche/454 platforms, used in combination with traditional Sanger sequencing method (10.64%), and never as a sole tool. ABI/SOLiD was only used in combination with Illumina/Solexa and Roche/454 in 4.25% of publications. Conclusions: Illumina/Solexa platforms are by far most preferred by researchers, most probably due to most affordable sequencing costs. Taking into consideration the current economic situation in the Balkans region, Illumina Solexa is the best (if not the only) platform choice if the sequencing of immortelle plant (Helichrysium arenarium) is to be performed by the researchers in this region. PMID:28974852

  12. Real-Time DNA Sequencing in the Antarctic Dry Valleys Using the Oxford Nanopore Sequencer

    PubMed Central

    Johnson, Sarah S.; Zaikova, Elena; Goerlitz, David S.; Bai, Yu; Tighe, Scott W.

    2017-01-01

    The ability to sequence DNA outside of the laboratory setting has enabled novel research questions to be addressed in the field in diverse areas, ranging from environmental microbiology to viral epidemics. Here, we demonstrate the application of offline DNA sequencing of environmental samples using a hand-held nanopore sequencer in a remote field location: the McMurdo Dry Valleys, Antarctica. Sequencing was performed using a MK1B MinION sequencer from Oxford Nanopore Technologies (ONT; Oxford, United Kingdom) that was equipped with software to operate without internet connectivity. One-direction (1D) genomic libraries were prepared using portable field techniques on DNA isolated from desiccated microbial mats. By adequately insulating the sequencer and laptop, it was possible to run the sequencing protocol for up to 2½ h under arduous conditions. PMID:28337073

  13. High-fidelity target sequencing of individual molecules identified using barcode sequences: de novo detection and absolute quantitation of mutations in plasma cell-free DNA from cancer patients.

    PubMed

    Kukita, Yoji; Matoba, Ryo; Uchida, Junji; Hamakawa, Takuya; Doki, Yuichiro; Imamura, Fumio; Kato, Kikuya

    2015-08-01

    Circulating tumour DNA (ctDNA) is an emerging field of cancer research. However, current ctDNA analysis is usually restricted to one or a few mutation sites due to technical limitations. In the case of massively parallel DNA sequencers, the number of false positives caused by a high read error rate is a major problem. In addition, the final sequence reads do not represent the original DNA population due to the global amplification step during the template preparation. We established a high-fidelity target sequencing system of individual molecules identified in plasma cell-free DNA using barcode sequences; this system consists of the following two steps. (i) A novel target sequencing method that adds barcode sequences by adaptor ligation. This method uses linear amplification to eliminate the errors introduced during the early cycles of polymerase chain reaction. (ii) The monitoring and removal of erroneous barcode tags. This process involves the identification of individual molecules that have been sequenced and for which the number of mutations have been absolute quantitated. Using plasma cell-free DNA from patients with gastric or lung cancer, we demonstrated that the system achieved near complete elimination of false positives and enabled de novo detection and absolute quantitation of mutations in plasma cell-free DNA. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  14. Expectation, information processing, and subjective duration.

    PubMed

    Simchy-Gross, Rhimmon; Margulis, Elizabeth Hellmuth

    2018-01-01

    In research on psychological time, it is important to examine the subjective duration of entire stimulus sequences, such as those produced by music (Teki, Frontiers in Neuroscience, 10, 2016). Yet research on the temporal oddball illusion (according to which oddball stimuli seem longer than standard stimuli of the same duration) has examined only the subjective duration of single events contained within sequences, not the subjective duration of sequences themselves. Does the finding that oddballs seem longer than standards translate to entire sequences, such that entire sequences that contain oddballs seem longer than those that do not? Is this potential translation influenced by the mode of information processing-whether people are engaged in direct or indirect temporal processing? Two experiments aimed to answer both questions using different manipulations of information processing. In both experiments, musical sequences either did or did not contain oddballs (auditory sliding tones). To manipulate information processing, we varied the task (Experiment 1), the sequence event structure (Experiments 1 and 2), and the sequence familiarity (Experiment 2) independently within subjects. Overall, in both experiments, the sequences that contained oddballs seemed shorter than those that did not when people were engaged in direct temporal processing, but longer when people were engaged in indirect temporal processing. These findings support the dual-process contingency model of time estimation (Zakay, Attention, Perception & Psychophysics, 54, 656-664, 1993). Theoretical implications for attention-based and memory-based models of time estimation, the pacemaker accumulator and coding efficiency hypotheses of time perception, and dynamic attending theory are discussed.

  15. Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions.

    PubMed

    Mai, Te-Lun; Hu, Geng-Ming; Chen, Chi-Ming

    2016-07-01

    Research in the recent decade has demonstrated the usefulness of protein network knowledge in furthering the study of molecular evolution of proteins, understanding the robustness of cells to perturbation, and annotating new protein functions. In this study, we aimed to provide a general clustering approach to visualize the sequence-structure-function relationship of protein networks, and investigate possible causes for inconsistency in the protein classifications based on sequences, structures, and functions. Such visualization of protein networks could facilitate our understanding of the overall relationship among proteins and help researchers comprehend various protein databases. As a demonstration, we clustered 1437 enzymes by their sequences and structures using the minimum span clustering (MSC) method. The general structure of this protein network was delineated at two clustering resolutions, and the second level MSC clustering was found to be highly similar to existing enzyme classifications. The clustering of these enzymes based on sequence, structure, and function information is consistent with each other. For proteases, the Jaccard's similarity coefficient is 0.86 between sequence and function classifications, 0.82 between sequence and structure classifications, and 0.78 between structure and function classifications. From our clustering results, we discussed possible examples of divergent evolution and convergent evolution of enzymes. Our clustering approach provides a panoramic view of the sequence-structure-function network of proteins, helps visualize the relation between related proteins intuitively, and is useful in predicting the structure and function of newly determined protein sequences.

  16. A Rapid Method of Isolating Neoantigen-specific T Cell Receptor Sequences | NCI Technology Transfer Center | TTC

    Cancer.gov

    Recent research has demonstrated that neoantigen-specific T-cell receptors (TCRs) can be isolated from a cancer patient’s lymphocytes. These TCRs may be used to engineer populations of tumor-reactive T cells for cancer immunotherapies. Obtaining sequences of these functional TCRs is a critical initial step in preparing this type of personalized cancer treatment; however, current methods are time-consuming and labor-intensive. Scientists at the National Cancer Institute (NCI) have developed a rapid and robust method of isolating the sequences of mutation-specific TCRs to alleviate these issues; they seek licensing and/or co-development research collaborations for the development of a method for isolating the sequences of tumor-reactive TCRs. For collaboration opportunities, please contact Steven A. Rosenberg, M.D., Ph.D. at sar@nih.gov.

  17. Chronodes: Interactive Multifocus Exploration of Event Sequences

    PubMed Central

    POLACK, PETER J.; CHEN, SHANG-TSE; KAHNG, MINSUK; DE BARBARO, KAYA; BASOLE, RAHUL; SHARMIN, MOUSHUMI; CHAU, DUEN HORNG

    2018-01-01

    The advent of mobile health (mHealth) technologies challenges the capabilities of current visualizations, interactive tools, and algorithms. We present Chronodes, an interactive system that unifies data mining and human-centric visualization techniques to support explorative analysis of longitudinal mHealth data. Chronodes extracts and visualizes frequent event sequences that reveal chronological patterns across multiple participant timelines of mHealth data. It then combines novel interaction and visualization techniques to enable multifocus event sequence analysis, which allows health researchers to interactively define, explore, and compare groups of participant behaviors using event sequence combinations. Through summarizing insights gained from a pilot study with 20 behavioral and biomedical health experts, we discuss Chronodes’s efficacy and potential impact in the mHealth domain. Ultimately, we outline important open challenges in mHealth, and offer recommendations and design guidelines for future research. PMID:29515937

  18. What you learn is more than what you see: what can sequencing effects tell us about inductive category learning?

    PubMed Central

    Carvalho, Paulo F.; Goldstone, Robert L.

    2015-01-01

    Inductive category learning takes place across time. As such, it is not surprising that the sequence in which information is studied has an impact in what is learned and how efficient learning is. In this paper we review research on different learning sequences and how this impacts learning. We analyze different aspects of interleaved (frequent alternation between categories during study) and blocked study (infrequent alternation between categories during study) that might explain how and when one sequence of study results in improved learning. While these different sequences of study differ in the amount of temporal spacing and temporal juxtaposition between items of different categories, these aspects do not seem to account for the majority of the results available in the literature. However, differences in the type of category being studied and the duration of the retention interval between study and test may play an important role. We conclude that there is no single aspect that is able to account for all the evidence available. Understanding learning as a process of sequential comparisons in time and how different sequences fundamentally alter the statistics of this experience offers a promising framework for understanding sequencing effects in category learning. We use this framework to present novel predictions and hypotheses for future research on sequencing effects in inductive category learning. PMID:25983699

  19. Characterization of 47 MHC class I sequences in Filipino cynomolgus macaques

    PubMed Central

    Campbell, Kevin J.; Detmer, Ann M.; Karl, Julie A.; Wiseman, Roger W.; Blasky, Alex J.; Hughes, Austin L.; Bimber, Benjamin N.; O’Connor, Shelby L.; O’Connor, David H.

    2009-01-01

    Cynomolgus macaques (Macaca fascicularis) provide increasingly common models for infectious disease research. Several geographically distinct populations of these macaques from Southeast Asia and the Indian Ocean island of Mauritius are available for pathogenesis studies. Though host genetics may profoundly impact results of such studies, similarities and differences between populations are often overlooked. In this study we identified 47 full-length MHC class I nucleotide sequences in 16 cynomolgus macaques of Filipino origin. The majority of MHC class I sequences characterized (39 of 47) were unique to this regional population. However, we discovered eight sequences with perfect identity and six sequences with close similarity to previously defined MHC class I sequences from other macaque populations. We identified two ancestral MHC haplotypes that appear to be shared between Filipino and Mauritian cynomolgus macaques, notably a Mafa-B haplotype that has previously been shown to protect Mauritian cynomolgus macaques against challenge with a simian/human immunodeficiency virus, SHIV89.6P. We also identified a Filipino cynomolgus macaque MHC class I sequence for which the predicted protein sequence differs from Mamu-B*17 by a single amino acid. This is important because Mamu-B*17 is strongly associated with protection against simian immunodeficiency virus (SIV) challenge in Indian rhesus macaques. These findings have implications for the evolutionary history of Filipino cynomolgus macaques as well as for the use of this model in SIV/SHIV research protocols. PMID:19107381

  20. Designing of deployment sequence for braking and drift systems in atmosphere of Mars and Venus

    NASA Astrophysics Data System (ADS)

    Vorontsov, Victor

    2006-07-01

    Analysis of project development and space research using contact method, namely, by means of automatic descent modules and balloons shows that designing formation of entry, descent and landing (EDL) sequence and operation in the atmosphere are of great importance. This process starts at the very beginning of designing, has undergone a lot of iterations and influences processing of normal operation results. Along with designing of descent module systems, including systems of braking in the atmosphere, designing of flight operation sequence and trajectories of motion in the atmosphere is performed. As the entire operation sequence and transfer from one phase to another was correctly chosen, the probability of experiment success on the whole and efficiency of application of various systems vary. By now the most extensive experience of Russian specialists in research of terrestrial planets has been gained with the help of automatic interplanetary stations “Mars”, “Venera”, “Vega” which had descent modules and drifting in the atmosphere balloons. Particular interest and complicity of formation of EDL and drift sequence in the atmosphere of these planets arise from radically different operation conditions, in particular, strongly rarefied atmosphere of the one planet and extremely dense atmosphere of another. Consequently, this determines the choice of braking systems and their parameters and method of EDL consequence formation. At the same time there are general fundamental methods and designed research techniques that allowed taking general technical approach to designing of EDL and drift sequence in the atmosphere.

  1. Does the sequence of data collection influence participants' responses to closed and open-ended questions? A methodological study.

    PubMed

    Covell, Christine L; Sidani, Souraya; Ritchie, Judith A

    2012-06-01

    The sequence used for collecting quantitative and qualitative data in concurrent mixed-methods research may influence participants' responses. Empirical evidence is needed to determine if the order of data collection in concurrent mixed methods research biases participants' responses to closed and open-ended questions. To examine the influence of the quantitative-qualitative sequence on responses to closed and open-ended questions when assessing the same variables or aspects of a phenomenon simultaneously within the same study phase. A descriptive cross-sectional, concurrent mixed-methods design was used to collect quantitative (survey) and qualitative (interview) data. The setting was a large multi-site health care centre in Canada. A convenience sample of 50 registered nurses was selected and participated in the study. Participants were randomly assigned to one of two sequences for data collection, quantitative-qualitative or qualitative-quantitative. Independent t-tests were performed to compare the two groups' responses to the survey items. Directed content analysis was used to compare the participants' responses to the interview questions. The sequence of data collection did not greatly affect the participants' responses to the closed-ended questions (survey items) or the open-ended questions (interview questions). The sequencing of data collection, when using both survey and semi-structured interviews, may not bias participants' responses to closed or open-ended questions. Additional research is required to confirm these findings. Copyright © 2011 Elsevier Ltd. All rights reserved.

  2. VaDiR: an integrated approach to Variant Detection in RNA.

    PubMed

    Neums, Lisa; Suenaga, Seiji; Beyerlein, Peter; Anders, Sara; Koestler, Devin; Mariani, Andrea; Chien, Jeremy

    2018-02-01

    Advances in next-generation DNA sequencing technologies are now enabling detailed characterization of sequence variations in cancer genomes. With whole-genome sequencing, variations in coding and non-coding sequences can be discovered. But the cost associated with it is currently limiting its general use in research. Whole-exome sequencing is used to characterize sequence variations in coding regions, but the cost associated with capture reagents and biases in capture rate limit its full use in research. Additional limitations include uncertainty in assigning the functional significance of the mutations when these mutations are observed in the non-coding region or in genes that are not expressed in cancer tissue. We investigated the feasibility of uncovering mutations from expressed genes using RNA sequencing datasets with a method called Variant Detection in RNA(VaDiR) that integrates 3 variant callers, namely: SNPiR, RVBoost, and MuTect2. The combination of all 3 methods, which we called Tier 1 variants, produced the highest precision with true positive mutations from RNA-seq that could be validated at the DNA level. We also found that the integration of Tier 1 variants with those called by MuTect2 and SNPiR produced the highest recall with acceptable precision. Finally, we observed a higher rate of mutation discovery in genes that are expressed at higher levels. Our method, VaDiR, provides a possibility of uncovering mutations from RNA sequencing datasets that could be useful in further functional analysis. In addition, our approach allows orthogonal validation of DNA-based mutation discovery by providing complementary sequence variation analysis from paired RNA/DNA sequencing datasets.

  3. The value of new genome references.

    PubMed

    Worley, Kim C; Richards, Stephen; Rogers, Jeffrey

    2017-09-15

    Genomic information has become a ubiquitous and almost essential aspect of biological research. Over the last 10-15 years, the cost of generating sequence data from DNA or RNA samples has dramatically declined and our ability to interpret those data increased just as remarkably. Although it is still possible for biologists to conduct interesting and valuable research on species for which genomic data are not available, the impact of having access to a high quality whole genome reference assembly for a given species is nothing short of transformational. Research on a species for which we have no DNA or RNA sequence data is restricted in fundamental ways. In contrast, even access to an initial draft quality genome (see below for definitions) opens a wide range of opportunities that are simply not available without that reference genome assembly. Although a complete discussion of the impact of genome sequencing and assembly is beyond the scope of this short paper, the goal of this review is to summarize the most common and highest impact contributions that whole genome sequencing and assembly has had on comparative and evolutionary biology. Copyright © 2016. Published by Elsevier Inc.

  4. The Biofuel Feedstock Genomics Resource: a web-based portal and database to enable functional genomics of plant biofuel feedstock species.

    PubMed

    Childs, Kevin L; Konganti, Kranti; Buell, C Robin

    2012-01-01

    Major feedstock sources for future biofuel production are likely to be high biomass producing plant species such as poplar, pine, switchgrass, sorghum and maize. One active area of research in these species is genome-enabled improvement of lignocellulosic biofuel feedstock quality and yield. To facilitate genomic-based investigations in these species, we developed the Biofuel Feedstock Genomic Resource (BFGR), a database and web-portal that provides high-quality, uniform and integrated functional annotation of gene and transcript assembly sequences from species of interest to lignocellulosic biofuel feedstock researchers. The BFGR includes sequence data from 54 species and permits researchers to view, analyze and obtain annotation at the gene, transcript, protein and genome level. Annotation of biochemical pathways permits the identification of key genes and transcripts central to the improvement of lignocellulosic properties in these species. The integrated nature of the BFGR in terms of annotation methods, orthologous/paralogous relationships and linkage to seven species with complete genome sequences allows comparative analyses for biofuel feedstock species with limited sequence resources. Database URL: http://bfgr.plantbiology.msu.edu.

  5. The report of my death was an exaggeration: A review for researchers using microsatellites in the 21st century1

    PubMed Central

    Hodel, Richard G. J.; Segovia-Salcedo, M. Claudia; Landis, Jacob B.; Crowl, Andrew A.; Sun, Miao; Liu, Xiaoxian; Gitzendanner, Matthew A.; Douglas, Norman A.; Germain-Aubrey, Charlotte C.; Chen, Shichao; Soltis, Douglas E.; Soltis, Pamela S.

    2016-01-01

    Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers. PMID:27347456

  6. A public HTLV-1 molecular epidemiology database for sequence management and data mining.

    PubMed

    Araujo, Thessika Hialla Almeida; Souza-Brito, Leandro Inacio; Libin, Pieter; Deforche, Koen; Edwards, Dustin; de Albuquerque-Junior, Antonio Eduardo; Vandamme, Anne-Mieke; Galvao-Castro, Bernardo; Alcantara, Luiz Carlos Junior

    2012-01-01

    It is estimated that 15 to 20 million people are infected with the human T-cell lymphotropic virus type 1 (HTLV-1). At present, there are more than 2,000 unique HTLV-1 isolate sequences published. A central database to aggregate sequence information from a range of epidemiological aspects including HTLV-1 infections, pathogenesis, origins, and evolutionary dynamics would be useful to scientists and physicians worldwide. Described here, we have developed a database that collects and annotates sequence data and can be accessed through a user-friendly search interface. The HTLV-1 Molecular Epidemiology Database website is available at http://htlv1db.bahia.fiocruz.br/. All data was obtained from publications available at GenBank or through contact with the authors. The database was developed using Apache Webserver 2.1.6 and SGBD MySQL. The webpage interfaces were developed in HTML and sever-side scripting written in PHP. The HTLV-1 Molecular Epidemiology Database is hosted on the Gonçalo Moniz/FIOCRUZ Research Center server. There are currently 2,457 registered sequences with 2,024 (82.37%) of those sequences representing unique isolates. Of these sequences, 803 (39.67%) contain information about clinical status (TSP/HAM, 17.19%; ATL, 7.41%; asymptomatic, 12.89%; other diseases, 2.17%; and no information, 60.32%). Further, 7.26% of sequences contain information on patient gender while 5.23% of sequences provide the age of the patient. The HTLV-1 Molecular Epidemiology Database retrieves and stores annotated HTLV-1 proviral sequences from clinical, epidemiological, and geographical studies. The collected sequences and related information are now accessible on a publically available and user-friendly website. This open-access database will support clinical research and vaccine development related to viral genotype.

  7. A method for automatically extracting infectious disease-related primers and probes from the literature

    PubMed Central

    2010-01-01

    Background Primer and probe sequences are the main components of nucleic acid-based detection systems. Biologists use primers and probes for different tasks, some related to the diagnosis and prescription of infectious diseases. The biological literature is the main information source for empirically validated primer and probe sequences. Therefore, it is becoming increasingly important for researchers to navigate this important information. In this paper, we present a four-phase method for extracting and annotating primer/probe sequences from the literature. These phases are: (1) convert each document into a tree of paper sections, (2) detect the candidate sequences using a set of finite state machine-based recognizers, (3) refine problem sequences using a rule-based expert system, and (4) annotate the extracted sequences with their related organism/gene information. Results We tested our approach using a test set composed of 297 manuscripts. The extracted sequences and their organism/gene annotations were manually evaluated by a panel of molecular biologists. The results of the evaluation show that our approach is suitable for automatically extracting DNA sequences, achieving precision/recall rates of 97.98% and 95.77%, respectively. In addition, 76.66% of the detected sequences were correctly annotated with their organism name. The system also provided correct gene-related information for 46.18% of the sequences assigned a correct organism name. Conclusions We believe that the proposed method can facilitate routine tasks for biomedical researchers using molecular methods to diagnose and prescribe different infectious diseases. In addition, the proposed method can be expanded to detect and extract other biological sequences from the literature. The extracted information can also be used to readily update available primer/probe databases or to create new databases from scratch. PMID:20682041

  8. Genome Sequencing Technologies and Nursing: What Are the Roles of Nurses and Nurse Scientists?

    PubMed Central

    Taylor, Jacquelyn Y.; Wright, Michelle L.; Hickey, Kathleen T.; Housman, David

    2016-01-01

    Background Advances in DNA sequencing technology have resulted in an abundance of personalized data with challenging clinical utility and meaning for clinicians. This wealth of data has potential to dramatically impact the quality of healthcare. Nurses are at the focal point in educating patients regarding relevant healthcare needs; therefore, an understanding of sequencing technology and utilizing these data are critical. Aim The objective of this paper is to explicate the role of nurses and nurse scientists as integral members of healthcare teams in improving understanding of DNA sequencing data and translational genomics for patients. Approach A history of the nurse role in newborn screening is used as an exemplar. Discussion This paper serves as an exemplar on how genome sequencing has been utilized in nursing science and incorporates linkages of other omics approaches used by nurses that are included in this special issue. This special issue showcased nurse scientists conducting multi-omic research from various methods, including targeted candidate genes, pharmacogenomics, proteomics, epigenomics and the microbiome. From this vantage point, we provide an overview of the roles of nurse scientists in genome sequencing research and provide recommendations for the best utilization of nurses and nurse scientists related to genome sequencing. PMID:28252579

  9. A streamlined method for analysing genome-wide DNA methylation patterns from low amounts of FFPE DNA.

    PubMed

    Ludgate, Jackie L; Wright, James; Stockwell, Peter A; Morison, Ian M; Eccles, Michael R; Chatterjee, Aniruddha

    2017-08-31

    Formalin fixed paraffin embedded (FFPE) tumor samples are a major source of DNA from patients in cancer research. However, FFPE is a challenging material to work with due to macromolecular fragmentation and nucleic acid crosslinking. FFPE tissue particularly possesses challenges for methylation analysis and for preparing sequencing-based libraries relying on bisulfite conversion. Successful bisulfite conversion is a key requirement for sequencing-based methylation analysis. Here we describe a complete and streamlined workflow for preparing next generation sequencing libraries for methylation analysis from FFPE tissues. This includes, counting cells from FFPE blocks and extracting DNA from FFPE slides, testing bisulfite conversion efficiency with a polymerase chain reaction (PCR) based test, preparing reduced representation bisulfite sequencing libraries and massively parallel sequencing. The main features and advantages of this protocol are: An optimized method for extracting good quality DNA from FFPE tissues. An efficient bisulfite conversion and next generation sequencing library preparation protocol that uses 50 ng DNA from FFPE tissue. Incorporation of a PCR-based test to assess bisulfite conversion efficiency prior to sequencing. We provide a complete workflow and an integrated protocol for performing DNA methylation analysis at the genome-scale and we believe this will facilitate clinical epigenetic research that involves the use of FFPE tissue.

  10. The impact of visual sequencing of graphic symbols on the sentence construction output of children who have acquired language.

    PubMed

    Alant, Erna; du Plooy, Amelia; Dada, Shakila

    2007-01-01

    Although the sequence of graphic or pictorial symbols displayed on a communication board can have an impact on the language output of children, very little research has been conducted to describe this. Research in this area is particularly relevant for prioritising the importance of specific visual and graphic features in providing more effective and user-friendly access to communication boards. This study is concerned with understanding the impact ofspecific sequences of graphic symbol input on the graphic and spoken output of children who have acquired language. Forty participants were divided into two comparable groups. Each group was exposed to graphic symbol input with a certain word order sequence. The structure of input was either in typical English word order sequence Subject- Verb-Object (SVO) or in the word order sequence of Subject-Object-Verb (SOV). Both input groups had to answer six questions by using graphic output as well as speech. The findings indicated that there are significant differences in the PCS graphic output patterns of children who are exposed to graphic input in the SOV and SVO sequences. Furthermore, the output produced in the graphic mode differed considerably to the output produced in the spoken mode. Clinical implications of these findings are discussed

  11. How challenges in auditory fMRI led to general advancements for the field.

    PubMed

    Talavage, Thomas M; Hall, Deborah A

    2012-08-15

    In the early years of fMRI research, the auditory neuroscience community sought to expand its knowledge of the underlying physiology of hearing, while also seeking to come to grips with the inherent acoustic disadvantages of working in the fMRI environment. Early collaborative efforts between prominent auditory research laboratories and prominent fMRI centers led to development of a number of key technical advances that have subsequently been widely used to elucidate principles of auditory neurophysiology. Perhaps the key imaging advance was the simultaneous and parallel development of strategies to use pulse sequences in which the volume acquisitions were "clustered," providing gaps in which stimuli could be presented without direct masking. Such sequences have become widespread in fMRI studies using auditory stimuli and also in a range of translational research domains. This review presents the parallel stories of the people and the auditory neurophysiology research that led to these sequences. Copyright © 2011 Elsevier Inc. All rights reserved.

  12. [Advance on genome research of Yersinia pestis bacteriophage].

    PubMed

    Tan, H L; Wang, P; Li, W

    2017-04-10

    Completion of the genome sequences on Yersinia pestis bacteriophage offered unprecedented opportunity for researchers to carry out related genomic studies. This review was based on the genomic sequences and provided a genomic perspective in describing the essential features of genome on Yersinia pestis bacteriophage. Based on the comparative genomics, genetic evolutionary relationship was discussed. Description of functions from the gene prediction and protein annotation provided evidence for further related studies.

  13. Draft Genome Sequences of Two Janthinobacterium lividum Strains, Isolated from Pristine Groundwater Collected from the Oak Ridge Field Research Center

    PubMed Central

    Wu, Xiaoqin; Deutschbauer, Adam M.; Kazakov, Alexey E.; Wetmore, Kelly M.; Cwick, Bryson A.; Walker, Robert M.; Novichkov, Pavel S.; Arkin, Adam P.

    2017-01-01

    ABSTRACT We present here the draft genome sequences of two Janthinobacterium lividum strains, GW456P and GW458P, isolated from groundwater samples collected from a background site at the Oak Ridge Field Research Center. Production of a purple pigment by these two strains was observed when grown on diluted (1/10) LB agar plates. PMID:28663297

  14. Brassica ASTRA: an integrated database for Brassica genomic research.

    PubMed

    Love, Christopher G; Robinson, Andrew J; Lim, Geraldine A C; Hopkins, Clare J; Batley, Jacqueline; Barker, Gary; Spangenberg, German C; Edwards, David

    2005-01-01

    Brassica ASTRA is a public database for genomic information on Brassica species. The database incorporates expressed sequences with Swiss-Prot and GenBank comparative sequence annotation as well as secondary Gene Ontology (GO) annotation derived from the comparison with Arabidopsis TAIR GO annotations. Simple sequence repeat molecular markers are identified within resident sequences and mapped onto the closely related Arabidopsis genome sequence. Bacterial artificial chromosome (BAC) end sequences derived from the Multinational Brassica Genome Project are also mapped onto the Arabidopsis genome sequence enabling users to identify candidate Brassica BACs corresponding to syntenic regions of Arabidopsis. This information is maintained in a MySQL database with a web interface providing the primary means of interrogation. The database is accessible at http://hornbill.cspp.latrobe.edu.au.

  15. dictyExpress: a web-based platform for sequence data management and analytics in Dictyostelium and beyond.

    PubMed

    Stajdohar, Miha; Rosengarten, Rafael D; Kokosar, Janez; Jeran, Luka; Blenkus, Domen; Shaulsky, Gad; Zupan, Blaz

    2017-06-02

    Dictyostelium discoideum, a soil-dwelling social amoeba, is a model for the study of numerous biological processes. Research in the field has benefited mightily from the adoption of next-generation sequencing for genomics and transcriptomics. Dictyostelium biologists now face the widespread challenges of analyzing and exploring high dimensional data sets to generate hypotheses and discovering novel insights. We present dictyExpress (2.0), a web application designed for exploratory analysis of gene expression data, as well as data from related experiments such as Chromatin Immunoprecipitation sequencing (ChIP-Seq). The application features visualization modules that include time course expression profiles, clustering, gene ontology enrichment analysis, differential expression analysis and comparison of experiments. All visualizations are interactive and interconnected, such that the selection of genes in one module propagates instantly to visualizations in other modules. dictyExpress currently stores the data from over 800 Dictyostelium experiments and is embedded within a general-purpose software framework for management of next-generation sequencing data. dictyExpress allows users to explore their data in a broader context by reciprocal linking with dictyBase-a repository of Dictyostelium genomic data. In addition, we introduce a companion application called GenBoard, an intuitive graphic user interface for data management and bioinformatics analysis. dictyExpress and GenBoard enable broad adoption of next generation sequencing based inquiries by the Dictyostelium research community. Labs without the means to undertake deep sequencing projects can mine the data available to the public. The entire information flow, from raw sequence data to hypothesis testing, can be accomplished in an efficient workspace. The software framework is generalizable and represents a useful approach for any research community. To encourage more wide usage, the backend is open-source, available for extension and further development by bioinformaticians and data scientists.

  16. Two-Stage orders sequencing system for mixed-model assembly

    NASA Astrophysics Data System (ADS)

    Zemczak, M.; Skolud, B.; Krenczyk, D.

    2015-11-01

    In the paper, the authors focus on the NP-hard problem of orders sequencing, formulated similarly to Car Sequencing Problem (CSP). The object of the research is the assembly line in an automotive industry company, on which few different models of products, each in a certain number of versions, are assembled on the shared resources, set in a line. Such production type is usually determined as a mixed-model production, and arose from the necessity of manufacturing customized products on the basis of very specific orders from single clients. The producers are nowadays obliged to provide each client the possibility to determine a huge amount of the features of the product they are willing to buy, as the competition in the automotive market is large. Due to the previously mentioned nature of the problem (NP-hard), in the given time period only satisfactory solutions are sought, as the optimal solution method has not yet been found. Most of the researchers that implemented inaccurate methods (e.g. evolutionary algorithms) to solving sequencing problems dropped the research after testing phase, as they were not able to obtain reproducible results, and met problems while determining the quality of the received solutions. Therefore a new approach to solving the problem, presented in this paper as a sequencing system is being developed. The sequencing system consists of a set of determined rules, implemented into computer environment. The system itself works in two stages. First of them is connected with the determination of a place in the storage buffer to which certain production orders should be sent. In the second stage of functioning, precise sets of sequences are determined and evaluated for certain parts of the storage buffer under certain criteria.

  17. Microchannel DNA Sequencing by End-Labelled Free Solution Electrophoresis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Barron, A.

    2005-09-29

    The further development of End-Labeled Free-Solution Electrophoresis will greatly simplify DNA separation and sequencing on microfluidic devices. The development and optimization of drag-tags is critical to the success of this research.

  18. Clinical genomics, big data, and electronic medical records: reconciling patient rights with research when privacy and science collide.

    PubMed

    Kulynych, Jennifer; Greely, Henry T

    2017-04-01

    Widespread use of medical records for research, without consent, attracts little scrutiny compared to biospecimen research, where concerns about genomic privacy prompted recent federal proposals to mandate consent. This paper explores an important consequence of the proliferation of electronic health records (EHRs) in this permissive atmosphere: with the advent of clinical gene sequencing, EHR-based secondary research poses genetic privacy risks akin to those of biospecimen research, yet regulators still permit researchers to call gene sequence data 'de-identified', removing such data from the protection of the federal Privacy Rule and federal human subjects regulations. Medical centers and other providers seeking to offer genomic 'personalized medicine' now confront the problem of governing the secondary use of clinical genomic data as privacy risks escalate. We argue that regulators should no longer permit HIPAA-covered entities to treat dense genomic data as de-identified health information. Even with this step, the Privacy Rule would still permit disclosure of clinical genomic data for research, without consent, under a data use agreement, so we also urge that providers give patients specific notice before disclosing clinical genomic data for research, permitting (where possible) some degree of choice and control. To aid providers who offer clinical gene sequencing, we suggest both general approaches and specific actions to reconcile patients' rights and interests with genomic research.

  19. Clinical genomics, big data, and electronic medical records: reconciling patient rights with research when privacy and science collide

    PubMed Central

    Greely, Henry T.

    2017-01-01

    Abstract Widespread use of medical records for research, without consent, attracts little scrutiny compared to biospecimen research, where concerns about genomic privacy prompted recent federal proposals to mandate consent. This paper explores an important consequence of the proliferation of electronic health records (EHRs) in this permissive atmosphere: with the advent of clinical gene sequencing, EHR-based secondary research poses genetic privacy risks akin to those of biospecimen research, yet regulators still permit researchers to call gene sequence data ‘de-identified’, removing such data from the protection of the federal Privacy Rule and federal human subjects regulations. Medical centers and other providers seeking to offer genomic ‘personalized medicine’ now confront the problem of governing the secondary use of clinical genomic data as privacy risks escalate. We argue that regulators should no longer permit HIPAA-covered entities to treat dense genomic data as de-identified health information. Even with this step, the Privacy Rule would still permit disclosure of clinical genomic data for research, without consent, under a data use agreement, so we also urge that providers give patients specific notice before disclosing clinical genomic data for research, permitting (where possible) some degree of choice and control. To aid providers who offer clinical gene sequencing, we suggest both general approaches and specific actions to reconcile patients’ rights and interests with genomic research. PMID:28852559

  20. Instances of erroneous DNA barcoding of metazoan invertebrates: Are universal cox1 gene primers too "universal"?

    PubMed

    Mioduchowska, Monika; Czyż, Michał Jan; Gołdyn, Bartłomiej; Kur, Jarosław; Sell, Jerzy

    2018-01-01

    The cytochrome c oxidase subunit I (cox1) gene is the main mitochondrial molecular marker playing a pivotal role in phylogenetic research and is a crucial barcode sequence. Folmer's "universal" primers designed to amplify this gene in metazoan invertebrates allowed quick and easy barcode and phylogenetic analysis. On the other hand, the increase in the number of studies on barcoding leads to more frequent publishing of incorrect sequences, due to amplification of non-target taxa, and insufficient analysis of the obtained sequences. Consequently, some sequences deposited in genetic databases are incorrectly described as obtained from invertebrates, while being in fact bacterial sequences. In our study, in which we used Folmer's primers to amplify COI sequences of the crustacean fairy shrimp Branchipus schaefferi (Fischer 1834), we also obtained COI sequences of microbial contaminants from Aeromonas sp. However, when we searched the GenBank database for sequences closely matching these contaminations we found entries described as representatives of Gastrotricha and Mollusca. When these entries were compared with other sequences bearing the same names in the database, the genetic distance between the incorrect and correct sequences amplified from the same species was c.a. 65%. Although the responsibility for the correct molecular identification of species rests on researchers, the errors found in already published sequences data have not been re-evaluated so far. On the basis of the standard sampling technique we have estimated with 95% probability that the chances of finding incorrectly described metazoan sequences in the GenBank depend on the systematic group, and variety from less than 1% (Mollusca and Arthropoda) up to 6.9% (Gastrotricha). Consequently, the increasing popularity of DNA barcoding and metabarcoding analysis may lead to overestimation of species diversity. Finally, the study also discusses the sources of the problems with amplification of non-target sequences.

  1. Whole-Genome Sequences of Thirteen Isolates of Borrelia burgdorferi

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schutzer S. E.; Dunn J.; Fraser-Liggett, C. M.

    2011-02-01

    Borrelia burgdorferi is a causative agent of Lyme disease in North America and Eurasia. The first complete genome sequence of B. burgdorferi strain 31, available for more than a decade, has assisted research on the pathogenesis of Lyme disease. Because a single genome sequence is not sufficient to understand the relationship between genotypic and geographic variation and disease phenotype, we determined the whole-genome sequences of 13 additional B. burgdorferi isolates that span the range of natural variation. These sequences should allow improved understanding of pathogenesis and provide a foundation for novel detection, diagnosis, and prevention strategies.

  2. Locomotive crashworthiness research

    DOT National Transportation Integrated Search

    2015-04-01

    conducts research on locomotive crashworthiness. The research approach includes four phases: : 1. Accident investigations to assemble sequences of events leading to injury and fatality. : 2. Locomotive performance is analyzed, and potential improveme...

  3. Draft Genome Sequence of Microbacterium sp. Strain UCD-TDU (Phylum Actinobacteria)

    PubMed Central

    Bendiks, Zachary A.; Lang, Jenna M.; Darling, Aaron E.; Coil, David A.

    2013-01-01

    Here, we present the draft genome sequence of Microbacterium sp. strain UCD-TDU, a member of the phylum Actinobacteria. The assembly contains 3,746,321 bp (in 8 scaffolds). This strain was isolated from a residential toilet as part of an undergraduate student research project to sequence reference genomes of microbes from the built environment. PMID:23516225

  4. Data Portal | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The CPTAC Data Portal is a centralized repository for the public dissemination of proteomic sequence datasets collected by CPTAC, along with corresponding genomic sequence datasets.  In addition, available are analyses of CPTAC's raw mass spectrometry-based data files (mapping of spectra to peptide sequences and protein identification) by individual investigators from CPTAC and by a Common Data Analysis Pipeline.

  5. Not All Order Memory Is Equal: Test Demands Reveal Dissociations in Memory for Sequence Information

    ERIC Educational Resources Information Center

    Jonker, Tanya R.; MacLeod, Colin M.

    2017-01-01

    Remembering the order of a sequence of events is a fundamental feature of episodic memory. Indeed, a number of formal models represent temporal context as part of the memory system, and memory for order has been researched extensively. Yet, the nature of the code(s) underlying sequence memory is still relatively unknown. Across 4 experiments that…

  6. Sequence Stratigraphy of the Dakota Sandstone, Eastern San Juan Basin, New Mexico, and its Relationship to Reservoir Compartmentalization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Varney, Peter J.

    2002-04-23

    This research established the Dakota-outcrop sequence stratigraphy in part of the eastern San Juan Basin, New Mexico, and relates reservoir quality lithologies in depositional sequences to structure and reservoir compartmentalization in the South Lindrith Field area. The result was a predictive tool that will help guide further exploration and development.

  7. Drinking from the Fire Hose: Why the Flight Management System Can Be Hard to Train and Difficult to Use

    NASA Technical Reports Server (NTRS)

    Sherry, Lance; Feary, Michael; Polson, Peter; Fennell, Karl

    2003-01-01

    The Flight Management Computer (FMC) and its interface, the Multi-function Control and Display Unit (MCDU) have been identified by researchers and airlines as difficult to train and use. Specifically, airline pilots have described the "drinking from the fire-hose" effect during training. Previous research has identified memorized action sequences as a major factor in a user s ability to learn and operate complex devices. This paper discusses the use of a method to examine the quantity of memorized action sequences required to perform a sample of 102 tasks, using features of the Boeing 777 Flight Management Computer Interface. The analysis identified a large number of memorized action sequences that must be learned during training and then recalled during line operations. Seventy-five percent of the tasks examined require recall of at least one memorized action sequence. Forty-five percent of the tasks require recall of a memorized action sequence and occur infrequently. The large number of memorized action sequences may provide an explanation for the difficulties in training and usage of the automation. Based on these findings, implications for training and the design of new user-interfaces are discussed.

  8. Generalized lessons about sequence learning from the study of the serial reaction time task

    PubMed Central

    Schwarb, Hillary; Schumacher, Eric H.

    2012-01-01

    Over the last 20 years researchers have used the serial reaction time (SRT) task to investigate the nature of spatial sequence learning. They have used the task to identify the locus of spatial sequence learning, identify situations that enhance and those that impair learning, and identify the important cognitive processes that facilitate this type of learning. Although controversies remain, the SRT task has been integral in enhancing our understanding of implicit sequence learning. It is important, however, to ask what, if anything, the discoveries made using the SRT task tell us about implicit learning more generally. This review analyzes the state of the current spatial SRT sequence learning literature highlighting the stimulus-response rule hypothesis of sequence learning which we believe provides a unifying account of discrepant SRT data. It also challenges researchers to use the vast body of knowledge acquired with the SRT task to understand other implicit learning literatures too often ignored in the context of this particular task. This broad perspective will make it possible to identify congruences among data acquired using various different tasks that will allow us to generalize about the nature of implicit learning. PMID:22723815

  9. VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research

    PubMed Central

    Lai, Zhongwu; Markovets, Aleksandra; Ahdesmaki, Miika; Chapman, Brad; Hofmann, Oliver; McEwen, Robert; Johnson, Justin; Dougherty, Brian; Barrett, J. Carl; Dry, Jonathan R.

    2016-01-01

    Abstract Accurate variant calling in next generation sequencing (NGS) is critical to understand cancer genomes better. Here we present VarDict, a novel and versatile variant caller for both DNA- and RNA-sequencing data. VarDict simultaneously calls SNV, MNV, InDels, complex and structural variants, expanding the detected genetic driver landscape of tumors. It performs local realignments on the fly for more accurate allele frequency estimation. VarDict performance scales linearly to sequencing depth, enabling ultra-deep sequencing used to explore tumor evolution or detect tumor DNA circulating in blood. In addition, VarDict performs amplicon aware variant calling for polymerase chain reaction (PCR)-based targeted sequencing often used in diagnostic settings, and is able to detect PCR artifacts. Finally, VarDict also detects differences in somatic and loss of heterozygosity variants between paired samples. VarDict reprocessing of The Cancer Genome Atlas (TCGA) Lung Adenocarcinoma dataset called known driver mutations in KRAS, EGFR, BRAF, PIK3CA and MET in 16% more patients than previously published variant calls. We believe VarDict will greatly facilitate application of NGS in clinical cancer research. PMID:27060149

  10. Attomole-level Genomics with Single-molecule Direct DNA, cDNA and RNA Sequencing Technologies.

    PubMed

    Ozsolak, Fatih

    2016-01-01

    With the introduction of next-generation sequencing (NGS) technologies in 2005, the domination of microarrays in genomics quickly came to an end due to NGS's superior technical performance and cost advantages. By enabling genetic analysis capabilities that were not possible previously, NGS technologies have started to play an integral role in all areas of biomedical research. This chapter outlines the low-quantity DNA and cDNA sequencing capabilities and applications developed with the Helicos single molecule DNA sequencing technology.

  11. Random sequences generation through optical measurements by phase-shifting interferometry

    NASA Astrophysics Data System (ADS)

    François, M.; Grosges, T.; Barchiesi, D.; Erra, R.; Cornet, A.

    2012-04-01

    The development of new techniques for producing random sequences with a high level of security is a challenging topic of research in modern cryptographics. The proposed method is based on the measurement by phase-shifting interferometry of the speckle signals of the interaction between light and structures. We show how the combination of amplitude and phase distributions (maps) under a numerical process can produce random sequences. The produced sequences satisfy all the statistical requirements of randomness and can be used in cryptographic schemes.

  12. [Sequencing technology in gene diagnosis and its application].

    PubMed

    Yibin, Guo

    2014-11-01

    The study of gene mutation is one of the hot topics in the field of life science nowadays, and the related detection methods and diagnostic technology have been developed rapidly. Sequencing technology plays an indispensable role in the definite diagnosis and classification of genetic diseases. In this review, we summarize the research progress in sequencing technology, evaluate the advantages and disadvantages of 1(st) ~3(rd) generation of sequencing technology, and describe its application in gene diagnosis. Also we made forecasts and prospects on its development trend.

  13. Next-Generation Sequencing Platforms

    NASA Astrophysics Data System (ADS)

    Mardis, Elaine R.

    2013-06-01

    Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.

  14. Children's discrimination of vowel sequences

    NASA Astrophysics Data System (ADS)

    Coady, Jeffry A.; Kluender, Keith R.; Evans, Julia

    2003-10-01

    Children's ability to discriminate sequences of steady-state vowels was investigated. Vowels (as in ``beet,'' ``bat,'' ``bought,'' and ``boot'') were synthesized at durations of 40, 80, 160, 320, 640, and 1280 ms. Four different vowel sequences were created by concatenating different orders of vowels for each duration, separated by 10-ms intervening silence. Thus, sequences differed in vowel order and duration (rate). Sequences were 12 s in duration, with amplitude ramped linearly over the first and last 2 s. Sequence pairs included both same (identical sequences) and different trials (sequences with vowels in different orders). Sequences with vowel of equal duration were presented on individual trials. Children aged 7;0 to 10;6 listened to pairs of sequences (with 100 ms between sequences) and responded whether sequences sounded the same or different. Results indicate that children are best able to discriminate sequences of intermediate-duration vowels, typical of conversational speaking rate. Children were less accurate with both shorter and longer vowels. Results are discussed in terms of auditory processing (shortest vowels) and memory (longest vowels). [Research supported by NIDCD DC-05263, DC-04072, and DC-005650.

  15. [Current applications of high-throughput DNA sequencing technology in antibody drug research].

    PubMed

    Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong

    2012-03-01

    Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.

  16. Company profile: Complete Genomics Inc.

    PubMed

    Reid, Clifford

    2011-02-01

    Complete Genomics Inc. is a life sciences company that focuses on complete human genome sequencing. It is taking a completely different approach to DNA sequencing than other companies in the industry. Rather than building a general-purpose platform for sequencing all organisms and all applications, it has focused on a single application - complete human genome sequencing. The company's Complete Genomics Analysis Platform (CGA™ Platform) comprises an integrated package of biochemistry, instrumentation and software that sequences human genomes at the highest quality, lowest cost and largest scale available. Complete Genomics offers a turnkey service that enables customers to outsource their human genome sequencing to the company's genome sequencing center in Mountain View, CA, USA. Customers send in their DNA samples, the company does all the library preparation, DNA sequencing, assembly and variant analysis, and customers receive research-ready data that they can use for biological discovery.

  17. RefSeq microbial genomes database: new representation and annotation strategy.

    PubMed

    Tatusova, Tatiana; Ciufo, Stacy; Fedorov, Boris; O'Neill, Kathleen; Tolstoy, Igor

    2014-01-01

    The source of the microbial genomic sequences in the RefSeq collection is the set of primary sequence records submitted to the International Nucleotide Sequence Database public archives. These can be accessed through the Entrez search and retrieval system at http://www.ncbi.nlm.nih.gov/genome. Next-generation sequencing has enabled researchers to perform genomic sequencing at rates that were unimaginable in the past. Microbial genomes can now be sequenced in a matter of hours, which has led to a significant increase in the number of assembled genomes deposited in the public archives. This huge increase in DNA sequence data presents new challenges for the annotation, analysis and visualization bioinformatics tools. New strategies have been developed for the annotation and representation of reference genomes and sequence variations derived from population studies and clinical outbreaks.

  18. Whole Genome Sequencing

    MedlinePlus

    ... exons, the parts of DNA that code for proteins in the body. Researchers like this method because it is faster and cheaper. Learn More More still needs to be done before whole genome sequencing becomes a routine part of medical care. Many ...

  19. DOE Research and Development Accomplishments

    Science.gov Websites

    sector to explore the possibility of sequencing the human genome. This Workshop was sponsored by DOE and approach to sequence the human genome. The Human Genome Project (HGP) was formalized in mid-February 1990

  20. ATRF Houses the Latest DNA Sequencing Technologies | Poster

    Cancer.gov

    By Ashley DeVine, Staff Writer By the end of October, the Advanced Technology Research Facility (ATRF) will be one of the few facilities in the world to house all of the latest DNA sequencing technologies.

  1. Genome sequence of the exopolysaccharide-producing Salipiger mucosus type strain (DSM 16094(T)), a moderately halophilic member of the Roseobacter clade.

    PubMed

    Riedel, Thomas; Spring, Stefan; Fiebig, Anne; Petersen, Jörn; Kyrpides, Nikos C; Göker, Markus; Klenk, Hans-Peter

    2014-06-15

    Salipiger mucosus Martínez-Cànovas et al. 2004 is the type species of the genus Salipiger, a moderately halophilic and exopolysaccharide-producing representative of the Roseobacter lineage within the alphaproteobacterial family Rhodobacteraceae. Members of this family were shown to be the most abundant bacteria especially in coastal and polar waters, but were also found in microbial mats and sediments. Here we describe the features of the S. mucosus strain DSM 16094(T) together with its genome sequence and annotation. The 5,689,389-bp genome sequence consists of one chromosome and several extrachromosomal elements. It contains 5,650 protein-coding genes and 95 RNA genes. The genome of S. mucosus DSM 16094(T) was sequenced as part of the activities of the Transregional Collaborative Research Center 51 (TRR51) funded by the German Research Foundation (DFG).

  2. Genome-derived vaccines.

    PubMed

    De Groot, Anne S; Rappuoli, Rino

    2004-02-01

    Vaccine research entered a new era when the complete genome of a pathogenic bacterium was published in 1995. Since then, more than 97 bacterial pathogens have been sequenced and at least 110 additional projects are now in progress. Genome sequencing has also dramatically accelerated: high-throughput facilities can draft the sequence of an entire microbe (two to four megabases) in 1 to 2 days. Vaccine developers are using microarrays, immunoinformatics, proteomics and high-throughput immunology assays to reduce the truly unmanageable volume of information available in genome databases to a manageable size. Vaccines composed by novel antigens discovered from genome mining are already in clinical trials. Within 5 years we can expect to see a novel class of vaccines composed by genome-predicted, assembled and engineered T- and Bcell epitopes. This article addresses the convergence of three forces--microbial genome sequencing, computational immunology and new vaccine technologies--that are shifting genome mining for vaccines onto the forefront of immunology research.

  3. A Retrospective Examination of Feline Leukemia Subgroup Characterization: Viral Interference Assays to Deep Sequencing.

    PubMed

    Chiu, Elliott S; Hoover, Edward A; VandeWoude, Sue

    2018-01-10

    Feline leukemia virus (FeLV) was the first feline retrovirus discovered, and is associated with multiple fatal disease syndromes in cats, including lymphoma. The original research conducted on FeLV employed classical virological techniques. As methods have evolved to allow FeLV genetic characterization, investigators have continued to unravel the molecular pathology associated with this fascinating agent. In this review, we discuss how FeLV classification, transmission, and disease-inducing potential have been defined sequentially by viral interference assays, Sanger sequencing, PCR, and next-generation sequencing. In particular, we highlight the influences of endogenous FeLV and host genetics that represent FeLV research opportunities on the near horizon.

  4. Elliptic net and its cryptographic application

    NASA Astrophysics Data System (ADS)

    Muslim, Norliana; Said, Mohamad Rushdan Md

    2017-11-01

    Elliptic net is a generalization of elliptic divisibility sequence and in cryptography field, most cryptographic pairings that are based on elliptic curve such as Tate pairing can be improved by applying elliptic nets algorithm. The elliptic net is constructed by using n dimensional array of values in rational number satisfying nonlinear recurrence relations that arise from elliptic divisibility sequences. The two main properties hold in the recurrence relations are for all positive integers m>n, hm +nhm -n=hm +1hm -1hn2-hn +1hn -1hm2 and hn divides hm whenever n divides m. In this research, we discuss elliptic divisibility sequence associated with elliptic nets based on cryptographic perspective and its possible research direction.

  5. The A, C, G, and T of Genome Assembly.

    PubMed

    Wajid, Bilal; Sohail, Muhammad U; Ekti, Ali R; Serpedin, Erchin

    2016-01-01

    Genome assembly in its two decades of history has produced significant research, in terms of both biotechnology and computational biology. This contribution delineates sequencing platforms and their characteristics, examines key steps involved in filtering and processing raw data, explains assembly frameworks, and discusses quality statistics for the assessment of the assembled sequence. Furthermore, the paper explores recent Ubuntu-based software environments oriented towards genome assembly as well as some avenues for future research.

  6. Draft Genome Sequences of Two Janthinobacteriumlividum Strains, Isolated from Pristine Groundwater Collected from the Oak Ridge Field Research Center.

    PubMed

    Wu, Xiaoqin; Deutschbauer, Adam M; Kazakov, Alexey E; Wetmore, Kelly M; Cwick, Bryson A; Walker, Robert M; Novichkov, Pavel S; Arkin, Adam P; Chakraborty, Romy

    2017-06-29

    We present here the draft genome sequences of two Janthinobacterium lividum strains, GW456P and GW458P, isolated from groundwater samples collected from a background site at the Oak Ridge Field Research Center. Production of a purple pigment by these two strains was observed when grown on diluted (1/10) LB agar plates. Copyright © 2017 Wu et al.

  7. Return of Genomic Results to Research Participants: The Floor, the Ceiling, and the Choices In Between

    PubMed Central

    Jarvik, Gail P.; Amendola, Laura M.; Berg, Jonathan S.; Brothers, Kyle; Clayton, Ellen W.; Chung, Wendy; Evans, Barbara J.; Evans, James P.; Fullerton, Stephanie M.; Gallego, Carlos J.; Garrison, Nanibaa’ A.; Gray, Stacy W.; Holm, Ingrid A.; Kullo, Iftikhar J.; Lehmann, Lisa Soleymani; McCarty, Cathy; Prows, Cynthia A.; Rehm, Heidi L.; Sharp, Richard R.; Salama, Joseph; Sanderson, Saskia; Van Driest, Sara L.; Williams, Marc S.; Wolf, Susan M.; Wolf, Wendy A.; Harley, John; Myers, Melanie; Namjou, Bahram; Vinks, Sander; Connolly, John; Keating, Brendan; Gerhard, Glenn; Sundaresan, Agnes; Tromp, Gerard; Crosslin, David; Leppig, Kathy; Wicklund, Cathy; Chute, Christopher; Lynch, John; De Andrade, Mariza; Heit, John; McCormick, Jen; Brilliant, Murray; Kitchner, Terrie; Ritchie, Marylyn; Böttinger, Erwin; Peter, Inga; Persell, Stephen; Rasmussen-Torvik, Laura; McGregor, Tracy; Roden, Dan; Antommaria, Armand; Chiavacci, Rosetta; Faucett, Andy; Ledbetter, David; Williams, Janet; Hartzler, Andrea; Vitek, Carolyn R. Rohrer; Frost, Norm; Ferryman, Kadija; Horowitz, Carol; Rhodes, Rosamond; Zinberg, Randi; Aufox, Sharon; Pan, Vivian; Long, Rochelle; Ramos, Erin; Odgis, Jackie; Wise, Anastasia; Hull, Sara; Gitlin, Jonathan; Green, Robert; Metterville, Danielle; McGuire, Amy; Kong, Sek Won; Trinidad, Sue; Veenstra, David; Roche, Myra; Skinner, Debra; Raspberry, Kelly; O’Daniel, Julianne; Parsons, Will; Eng, Christine; Hilsenbeck, Susan; Karavite, Dean; Conlin, Laura; Spinner, Nancy; Krantz, Ian; Falk, Marni; Santani, Avni; Dechene, Elizabeth; Dulik, Matthew; Bernhardt, Barbara; Schuetze, Scott; Everett, Jessica; Gornick, Michele Caroline; Wilfond, Ben; Tabor, Holly; Lemke, Amy A.; Richards, Sue; Goddard, Katrina; Cooper, Greg; East, Kelly; Barsh, Greg; Koenig, Barbara; Van Allen, Eliezer; Garber, Judy; Garrett, Jeremy; Zawati, Ma’n; Lewis, Michelle; Savage, Sarah; Smith, Maureen; Roychowdhury, Sameek; Bailey, Alice; Berkman, Benjamin; Anan, Charlisse Caga; Hindorff, Lucia; Hutter, Carolyn; King, Rosalind; Li, Rongling; Lockhart, Nicole; McEwen, Jean; Scholes, Derek; Schully, Sheri; Sun, Kathie; Burke, Wylie

    2014-01-01

    As more research studies incorporate next-generation sequencing (including whole-genome or whole-exome sequencing), investigators and institutional review boards face difficult questions regarding which genomic results to return to research participants and how. An American College of Medical Genetics and Genomics 2013 policy paper suggesting that pathogenic mutations in 56 specified genes should be returned in the clinical setting has raised the question of whether comparable recommendations should be considered in research settings. The Clinical Sequencing Exploratory Research (CSER) Consortium and the Electronic Medical Records and Genomics (eMERGE) Network are multisite research programs that aim to develop practical strategies for addressing questions concerning the return of results in genomic research. CSER and eMERGE committees have identified areas of consensus regarding the return of genomic results to research participants. In most circumstances, if results meet an actionability threshold for return and the research participant has consented to return, genomic results, along with referral for appropriate clinical follow-up, should be offered to participants. However, participants have a right to decline the receipt of genomic results, even when doing so might be viewed as a threat to the participants’ health. Research investigators should be prepared to return research results and incidental findings discovered in the course of their research and meeting an actionability threshold, but they have no ethical obligation to actively search for such results. These positions are consistent with the recognition that clinical research is distinct from medical care in both its aims and its guiding moral principles. PMID:24814192

  8. Linking GPS and travel diary data using sequence alignment in a study of children's independent mobility

    PubMed Central

    2011-01-01

    Background Global positioning systems (GPS) are increasingly being used in health research to determine the location of study participants. Combining GPS data with data collected via travel/activity diaries allows researchers to assess where people travel in conjunction with data about trip purpose and accompaniment. However, linking GPS and diary data is problematic and to date the only method has been to match the two datasets manually, which is time consuming and unlikely to be practical for larger data sets. This paper assesses the feasibility of a new sequence alignment method of linking GPS and travel diary data in comparison with the manual matching method. Methods GPS and travel diary data obtained from a study of children's independent mobility were linked using sequence alignment algorithms to test the proof of concept. Travel diaries were assessed for quality by counting the number of errors and inconsistencies in each participant's set of diaries. The success of the sequence alignment method was compared for higher versus lower quality travel diaries, and for accompanied versus unaccompanied trips. Time taken and percentage of trips matched were compared for the sequence alignment method and the manual method. Results The sequence alignment method matched 61.9% of all trips. Higher quality travel diaries were associated with higher match rates in both the sequence alignment and manual matching methods. The sequence alignment method performed almost as well as the manual method and was an order of magnitude faster. However, the sequence alignment method was less successful at fully matching trips and at matching unaccompanied trips. Conclusions Sequence alignment is a promising method of linking GPS and travel diary data in large population datasets, especially if limitations in the trip detection algorithm are addressed. PMID:22142322

  9. From cheek swabs to consensus sequences: an A to Z protocol for high-throughput DNA sequencing of complete human mitochondrial genomes

    PubMed Central

    2014-01-01

    Background Next-generation DNA sequencing (NGS) technologies have made huge impacts in many fields of biological research, but especially in evolutionary biology. One area where NGS has shown potential is for high-throughput sequencing of complete mtDNA genomes (of humans and other animals). Despite the increasing use of NGS technologies and a better appreciation of their importance in answering biological questions, there remain significant obstacles to the successful implementation of NGS-based projects, especially for new users. Results Here we present an ‘A to Z’ protocol for obtaining complete human mitochondrial (mtDNA) genomes – from DNA extraction to consensus sequence. Although designed for use on humans, this protocol could also be used to sequence small, organellar genomes from other species, and also nuclear loci. This protocol includes DNA extraction, PCR amplification, fragmentation of PCR products, barcoding of fragments, sequencing using the 454 GS FLX platform, and a complete bioinformatics pipeline (primer removal, reference-based mapping, output of coverage plots and SNP calling). Conclusions All steps in this protocol are designed to be straightforward to implement, especially for researchers who are undertaking next-generation sequencing for the first time. The molecular steps are scalable to large numbers (hundreds) of individuals and all steps post-DNA extraction can be carried out in 96-well plate format. Also, the protocol has been assembled so that individual ‘modules’ can be swapped out to suit available resources. PMID:24460871

  10. EST-PAC a web package for EST annotation and protein sequence prediction

    PubMed Central

    Strahm, Yvan; Powell, David; Lefèvre, Christophe

    2006-01-01

    With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST) annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1) searching local or remote biological databases for sequence similarities using Blast services, 2) predicting protein coding sequence from EST data and, 3) annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics. PMID:17147782

  11. Heuristic reusable dynamic programming: efficient updates of local sequence alignment.

    PubMed

    Hong, Changjin; Tewfik, Ahmed H

    2009-01-01

    Recomputation of the previously evaluated similarity results between biological sequences becomes inevitable when researchers realize errors in their sequenced data or when the researchers have to compare nearly similar sequences, e.g., in a family of proteins. We present an efficient scheme for updating local sequence alignments with an affine gap model. In principle, using the previous matching result between two amino acid sequences, we perform a forward-backward alignment to generate heuristic searching bands which are bounded by a set of suboptimal paths. Given a correctly updated sequence, we initially predict a new score of the alignment path for each contour to select the best candidates among them. Then, we run the Smith-Waterman algorithm in this confined space. Furthermore, our heuristic alignment for an updated sequence shows that it can be further accelerated by using reusable dynamic programming (rDP), our prior work. In this study, we successfully validate "relative node tolerance bound" (RNTB) in the pruned searching space. Furthermore, we improve the computational performance by quantifying the successful RNTB tolerance probability and switch to rDP on perturbation-resilient columns only. In our searching space derived by a threshold value of 90 percent of the optimal alignment score, we find that 98.3 percent of contours contain correctly updated paths. We also find that our method consumes only 25.36 percent of the runtime cost of sparse dynamic programming (sDP) method, and to only 2.55 percent of that of a normal dynamic programming with the Smith-Waterman algorithm.

  12. Verona Coding Definitions of Emotional Sequences (VR-CoDES): Conceptual framework and future directions.

    PubMed

    Piccolo, Lidia Del; Finset, Arnstein; Mellblom, Anneli V; Figueiredo-Braga, Margarida; Korsvold, Live; Zhou, Yuefang; Zimmermann, Christa; Humphris, Gerald

    2017-12-01

    To discuss the theoretical and empirical framework of VR-CoDES and potential future direction in research based on the coding system. The paper is based on selective review of papers relevant to the construction and application of VR-CoDES. VR-CoDES system is rooted in patient-centered and biopsychosocial model of healthcare consultations and on a functional approach to emotion theory. According to the VR-CoDES, emotional interaction is studied in terms of sequences consisting of an eliciting event, an emotional expression by the patient and the immediate response by the clinician. The rationale for the emphasis on sequences, on detailed classification of cues and concerns, and on the choices of explicit vs. non-explicit responses and providing vs. reducing room for further disclosure, as basic categories of the clinician responses, is described. Results from research on VR-CoDES may help raise awareness of emotional sequences. Future directions in applying VR-CoDES in research may include studies on predicting patient and clinician behavior within the consultation, qualitative analyses of longer sequences including several VR-CoDES triads, and studies of effects of emotional communication on health outcomes. VR-CoDES may be applied to develop interventions to promote good handling of patients' emotions in healthcare encounters. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Identification of MHC class I sequences in Chinese-origin rhesus macaques

    PubMed Central

    Karl, Julie A.; Wiseman, Roger W.; Campbell, Kevin J.; Blasky, Alex J.; Hughes, Austin L.; Ferguson, Betsy; Read, Daniel S.

    2010-01-01

    The rhesus macaque (Macaca mulatta) is an excellent model for human disease and vaccine research. Two populations exhibiting distinctive morphological and physiological characteristics, Indian- and Chinese-origin rhesus macaques, are commonly used in research. Genetic analysis has focused on the Indian macaque population, but the accessibility of these animals for research is limited. Due to their greater availability, Chinese rhesus macaques are now being used more frequently, particularly in vaccine and biodefense studies, although relatively little is known about their immunogenetics. In this study, we discovered major histocompatibility complex (MHC) class I cDNAs in 12 Chinese rhesus macaques and detected 41 distinct Mamu-A and Mamu-B sequences. Twenty-seven of these class I cDNAs were novel, while six and eight of these sequences were previously reported in Chinese and Indian rhesus macaques, respectively. We then performed microsatellite analysis on DNA from these 12 animals, as well as an additional 18 animals, and developed sequence specific primer PCR (PCR-SSP) assays for eight cDNAs found in multiple animals. We also examined our cohort for potential admixture of Chinese and Indian origin animals using a recently developed panel of single nucleotide polymorphisms (SNPs). The discovery of 27 novel MHC class I sequences in this analysis underscores the genetic diversity of Chinese rhesus macaques and contributes reagents that will be valuable for studying cellular immunology in this population. PMID:18097659

  14. Lessons learnt on the analysis of large sequence data in animal genomics.

    PubMed

    Biscarini, F; Cozzi, P; Orozco-Ter Wengel, P

    2018-04-06

    The 'omics revolution has made a large amount of sequence data available to researchers and the industry. This has had a profound impact in the field of bioinformatics, stimulating unprecedented advancements in this discipline. Mostly, this is usually looked at from the perspective of human 'omics, in particular human genomics. Plant and animal genomics, however, have also been deeply influenced by next-generation sequencing technologies, with several genomics applications now popular among researchers and the breeding industry. Genomics tends to generate huge amounts of data, and genomic sequence data account for an increasing proportion of big data in biological sciences, due largely to decreasing sequencing and genotyping costs and to large-scale sequencing and resequencing projects. The analysis of big data poses a challenge to scientists, as data gathering currently takes place at a faster pace than does data processing and analysis, and the associated computational burden is increasingly taxing, making even simple manipulation, visualization and transferring of data a cumbersome operation. The time consumed by the processing and analysing of huge data sets may be at the expense of data quality assessment and critical interpretation. Additionally, when analysing lots of data, something is likely to go awry-the software may crash or stop-and it can be very frustrating to track the error. We herein review the most relevant issues related to tackling these challenges and problems, from the perspective of animal genomics, and provide researchers that lack extensive computing experience with guidelines that will help when processing large genomic data sets. © 2018 Stichting International Foundation for Animal Genetics.

  15. An Evaluation of a Progressive High-Probability Instructional Sequence Combined with Low-Probability Demand Fading in the Treatment of Food Selectivity

    ERIC Educational Resources Information Center

    Penrod, Becky; Gardella, Laura; Fernand, Jonathan

    2012-01-01

    Few studies have examined the effects of the high-probability instructional sequence in the treatment of food selectivity, and results of these studies have been mixed (e.g., Dawson et al., 2003; Patel et al., 2007). The present study extended previous research on the high-probability instructional sequence by combining this procedure with…

  16. Complete Genome Sequence of a Putative New Bacterial Strain, I507, Isolated from the Indian Ocean

    PubMed Central

    Wang, Shu-yan; Wei, Jia-qiang

    2018-01-01

    ABSTRACT Bacterial strain I507 was isolated from the central Indian Ocean and may be a potential novel species, according to the 16S rRNA gene sequence. Here, we present its complete genome sequence and expect that it will provide researchers with valuable information to further understand its classification and function in the future. PMID:29674539

  17. Towards a global cancer knowledge network: dissecting the current international cancer genomic sequencing landscape.

    PubMed

    Vis, D J; Lewin, J; Liao, R G; Mao, M; Andre, F; Ward, R L; Calvo, F; Teh, B T; Camargo, A A; Knoppers, B M; Sawyers, C L; Wessels, L F A; Lawler, M; Siu, L L; Voest, E

    2017-05-01

    While next generation sequencing has enhanced our understanding of the biological basis of malignancy, current knowledge on global practices for sequencing cancer samples is limited. To address this deficiency, we developed a survey to provide a snapshot of current sequencing activities globally, identify barriers to data sharing and use this information to develop sustainable solutions for the cancer research community. A multi-item survey was conducted assessing demographics, clinical data collection, genomic platforms, privacy/ethics concerns, funding sources and data sharing barriers for sequencing initiatives globally. Additionally, respondents were asked as to provide the primary intent of their initiative (clinical diagnostic, research or combination). Of 107 initiatives invited to participate, 59 responded (response rate = 55%). Whole exome sequencing (P = 0.03) and whole genome sequencing (P = 0.01) were utilized less frequently in clinical diagnostic than in research initiatives. Procedures to identify cancer-specific variants were heterogeneous, with bioinformatics pipelines employing different mutation calling/variant annotation algorithms. Measurement of treatment efficacy varied amongst initiatives, with time on treatment (57%) and RECIST (53%) being the most common; however, other parameters were also employed. Whilst 72% of initiatives indicated data sharing, its scope varied, with a number of restrictions in place (e.g. transfer of raw data). The largest perceived barriers to data harmonization were the lack of financial support (P < 0.01) and bioinformatics concerns (e.g. lack of interoperability) (P = 0.02). Capturing clinical data was more likely to be perceived as a barrier to data sharing by larger initiatives than by smaller initiatives (P = 0.01). These results identify the main barriers, as perceived by the cancer sequencing community, to effective sharing of cancer genomic and clinical data. They highlight the need for greater harmonization of technical, ethical and data capture processes in cancer sample sequencing worldwide, in order to support effective and responsible data sharing for the benefit of patients. © The Author 2017. Published by Oxford University Press on behalf of the European Society for Medical Oncology.

  18. De Novo Assembly and Characterization of Four Anthozoan (Phylum Cnidaria) Transcriptomes.

    PubMed

    Kitchen, Sheila A; Crowder, Camerron M; Poole, Angela Z; Weis, Virginia M; Meyer, Eli

    2015-09-17

    Many nonmodel species exemplify important biological questions but lack the sequence resources required to study the genes and genomic regions underlying traits of interest. Reef-building corals are famously sensitive to rising seawater temperatures, motivating ongoing research into their stress responses and long-term prospects in a changing climate. A comprehensive understanding of these processes will require extending beyond the sequenced coral genome (Acropora digitifera) to encompass diverse coral species and related anthozoans. Toward that end, we have assembled and annotated reference transcriptomes to develop catalogs of gene sequences for three scleractinian corals (Fungia scutaria, Montastraea cavernosa, Seriatopora hystrix) and a temperate anemone (Anthopleura elegantissima). High-throughput sequencing of cDNA libraries produced ~20-30 million reads per sample, and de novo assembly of these reads produced ~75,000-110,000 transcripts from each sample with size distributions (mean ~1.4 kb, N50 ~2 kb), comparable to the distribution of gene models from the coral genome (mean ~1.7 kb, N50 ~2.2 kb). Each assembly includes matches for more than half the gene models from A. digitifera (54-67%) and many reasonably complete transcripts (~5300-6700) spanning nearly the entire gene (ortholog hit ratios ≥0.75). The catalogs of gene sequences developed in this study made it possible to identify hundreds to thousands of orthologs across diverse scleractinian species and related taxa. We used these sequences for phylogenetic inference, recovering known relationships and demonstrating superior performance over phylogenetic trees constructed using single mitochondrial loci. The resources developed in this study provide gene sequences and genetic markers for several anthozoan species. To enhance the utility of these resources for the research community, we developed searchable databases enabling researchers to rapidly recover sequences for genes of interest. Our analysis of de novo assembly quality highlights metrics that we expect will be useful for evaluating the relative quality of other de novo transcriptome assemblies. The identification of orthologous sequences and phylogenetic reconstruction demonstrates the feasibility of these methods for clarifying the substantial uncertainties in the existing scleractinian phylogeny. Copyright © 2015 Kitchen et al.

  19. [GNU Pattern: open source pattern hunter for biological sequences based on SPLASH algorithm].

    PubMed

    Xu, Ying; Li, Yi-xue; Kong, Xiang-yin

    2005-06-01

    To construct a high performance open source software engine based on IBM SPLASH algorithm for later research on pattern discovery. Gpat, which is based on SPLASH algorithm, was developed by using open source software. GNU Pattern (Gpat) software was developped, which efficiently implemented the core part of SPLASH algorithm. Full source code of Gpat was also available for other researchers to modify the program under the GNU license. Gpat is a successful implementation of SPLASH algorithm and can be used as a basic framework for later research on pattern recognition in biological sequences.

  20. The quest for rare variants: pooled multiplexed next generation sequencing in plants.

    PubMed

    Marroni, Fabio; Pinosio, Sara; Morgante, Michele

    2012-01-01

    Next generation sequencing (NGS) instruments produce an unprecedented amount of sequence data at contained costs. This gives researchers the possibility of designing studies with adequate power to identify rare variants at a fraction of the economic and labor resources required by individual Sanger sequencing. As of today, few research groups working in plant sciences have exploited this potentiality, showing that pooled NGS provides results in excellent agreement with those obtained by individual Sanger sequencing. The aim of this review is to convey to the reader the general ideas underlying the use of pooled NGS for the identification of rare variants. To facilitate a thorough understanding of the possibilities of the method, we will explain in detail the possible experimental and analytical approaches and discuss their advantages and disadvantages. We will show that information on allele frequency obtained by pooled NGS can be used to accurately compute basic population genetics indexes such as allele frequency, nucleotide diversity, and Tajima's D. Finally, we will discuss applications and future perspectives of the multiplexed NGS approach.

  1. An Integrated Physical, Genetic and Cytogenetic Map of Brachypodium distachyon, a Model System for Grass Research

    PubMed Central

    Febrer, Melanie; Goicoechea, Jose Luis; Wright, Jonathan; McKenzie, Neil; Song, Xiang; Lin, Jinke; Collura, Kristi; Wissotski, Marina; Yu, Yeisoo; Ammiraju, Jetty S. S.; Wolny, Elzbieta; Idziak, Dominika; Betekhtin, Alexander; Kudrna, Dave; Hasterok, Robert; Wing, Rod A.; Bevan, Michael W.

    2010-01-01

    The pooid subfamily of grasses includes some of the most important crop, forage and turf species, such as wheat, barley and Lolium. Developing genomic resources, such as whole-genome physical maps, for analysing the large and complex genomes of these crops and for facilitating biological research in grasses is an important goal in plant biology. We describe a bacterial artificial chromosome (BAC)-based physical map of the wild pooid grass Brachypodium distachyon and integrate this with whole genome shotgun sequence (WGS) assemblies using BAC end sequences (BES). The resulting physical map contains 26 contigs spanning the 272 Mb genome. BES from the physical map were also used to integrate a genetic map. This provides an independent vaildation and confirmation of the published WGS assembly. Mapped BACs were used in Fluorescence In Situ Hybridisation (FISH) experiments to align the integrated physical map and sequence assemblies to chromosomes with high resolution. The physical, genetic and cytogenetic maps, integrated with whole genome shotgun sequence assemblies, enhance the accuracy and durability of this important genome sequence and will directly facilitate gene isolation. PMID:20976139

  2. Ancient genomics

    PubMed Central

    Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic

    2015-01-01

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338

  3. India Allele Finder: a web-based annotation tool for identifying common alleles in next-generation sequencing data of Indian origin.

    PubMed

    Zhang, Jimmy F; James, Francis; Shukla, Anju; Girisha, Katta M; Paciorkowski, Alex R

    2017-06-27

    We built India Allele Finder, an online searchable database and command line tool, that gives researchers access to variant frequencies of Indian Telugu individuals, using publicly available fastq data from the 1000 Genomes Project. Access to appropriate population-based genomic variant annotation can accelerate the interpretation of genomic sequencing data. In particular, exome analysis of individuals of Indian descent will identify population variants not reflected in European exomes, complicating genomic analysis for such individuals. India Allele Finder offers improved ease-of-use to investigators seeking to identify and annotate sequencing data from Indian populations. We describe the use of India Allele Finder to identify common population variants in a disease quartet whole exome dataset, reducing the number of candidate single nucleotide variants from 84 to 7. India Allele Finder is freely available to investigators to annotate genomic sequencing data from Indian populations. Use of India Allele Finder allows efficient identification of population variants in genomic sequencing data, and is an example of a population-specific annotation tool that simplifies analysis and encourages international collaboration in genomics research.

  4. Ancient dna from pleistocene fossils: Preservation, recovery, and utility of ancient genetic information for quaternary research

    NASA Astrophysics Data System (ADS)

    Yang, Hong

    Until recently, recovery and analysis of genetic information encoded in ancient DNA sequences from Pleistocene fossils were impossible. Recent advances in molecular biology offered technical tools to obtain ancient DNA sequences from well-preserved Quaternary fossils and opened the possibilities to directly study genetic changes in fossil species to address various biological and paleontological questions. Ancient DNA studies involving Pleistocene fossil material and ancient DNA degradation and preservation in Quaternary deposits are reviewed. The molecular technology applied to isolate, amplify, and sequence ancient DNA is also presented. Authentication of ancient DNA sequences and technical problems associated with modern and ancient DNA contamination are discussed. As illustrated in recent studies on ancient DNA from proboscideans, it is apparent that fossil DNA sequence data can shed light on many aspects of Quaternary research such as systematics and phylogeny. conservation biology, evolutionary theory, molecular taphonomy, and forensic sciences. Improvement of molecular techniques and a better understanding of DNA degradation during fossilization are likely to build on current strengths and to overcome existing problems, making fossil DNA data a unique source of information for Quaternary scientists.

  5. Molecular-Sized DNA or RNA Sequencing Machine | NCI Technology Transfer Center | TTC

    Cancer.gov

    The National Cancer Institute's Gene Regulation and Chromosome Biology Laboratory is seeking statements of capability or interest from parties interested in collaborative research to co-develop a molecular-sized DNA or RNA sequencing machine.

  6. Project Apollo Flight Sequence

    NASA Image and Video Library

    1966-08-01

    Lunar Orbiter's "Typical Flight sequence of Events" turned out to be quite typical indeed, as all five spacecraft performed exactly as planned. -- Published in James R. Hansen, Spaceflight Revolution: NASA Langley Research Center From Sputnik to Apollo, (Washington: NASA, 1995), p. 340.

  7. Positive Evaluation of Student Answers in Classroom Instruction

    ERIC Educational Resources Information Center

    Margutti, Piera; Drew, Paul

    2014-01-01

    Within the context of teacher/whole-class instruction sequences, researchers have associated teacher evaluation of pupils' answers to forms of traditional pedagogic discourse, also referred to as "triadic dialogue", "monologic discourse", "recitation" and "Initiation-Response-Evaluation (IRE) sequences".…

  8. Sequence analysis of the genome of carnation (Dianthus caryophyllus L.).

    PubMed

    Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi

    2014-06-01

    The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. 'Francesco' was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568,887,315 bp, consisting of 45,088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16,644 bp and 60,737 bp, respectively, and the longest scaffold was 1,287,144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼ 98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  9. Comparison of single cell sequencing data between two whole genome amplification methods on two sequencing platforms.

    PubMed

    Chen, DaYang; Zhen, HeFu; Qiu, Yong; Liu, Ping; Zeng, Peng; Xia, Jun; Shi, QianYu; Xie, Lin; Zhu, Zhu; Gao, Ya; Huang, GuoDong; Wang, Jian; Yang, HuanMing; Chen, Fang

    2018-03-21

    Research based on a strategy of single-cell low-coverage whole genome sequencing (SLWGS) has enabled better reproducibility and accuracy for detection of copy number variations (CNVs). The whole genome amplification (WGA) method and sequencing platform are critical factors for successful SLWGS (<0.1 × coverage). In this study, we compared single cell and multiple cells sequencing data produced by the HiSeq2000 and Ion Proton platforms using two WGA kits and then comprehensively evaluated the GC-bias, reproducibility, uniformity and CNV detection among different experimental combinations. Our analysis demonstrated that the PicoPLEX WGA Kit resulted in higher reproducibility, lower sequencing error frequency but more GC-bias than the GenomePlex Single Cell WGA Kit (WGA4 kit) independent of the cell number on the HiSeq2000 platform. While on the Ion Proton platform, the WGA4 kit (both single cell and multiple cells) had higher uniformity and less GC-bias but lower reproducibility than those of the PicoPLEX WGA Kit. Moreover, on these two sequencing platforms, depending on cell number, the performance of the two WGA kits was different for both sensitivity and specificity on CNV detection. The results can help researchers who plan to use SLWGS on single or multiple cells to select appropriate experimental conditions for their applications.

  10. Sequence verification of synthetic DNA by assembly of sequencing reads

    PubMed Central

    Wilson, Mandy L.; Cai, Yizhi; Hanlon, Regina; Taylor, Samantha; Chevreux, Bastien; Setubal, João C.; Tyler, Brett M.; Peccoud, Jean

    2013-01-01

    Gene synthesis attempts to assemble user-defined DNA sequences with base-level precision. Verifying the sequences of construction intermediates and the final product of a gene synthesis project is a critical part of the workflow, yet one that has received the least attention. Sequence validation is equally important for other kinds of curated clone collections. Ensuring that the physical sequence of a clone matches its published sequence is a common quality control step performed at least once over the course of a research project. GenoREAD is a web-based application that breaks the sequence verification process into two steps: the assembly of sequencing reads and the alignment of the resulting contig with a reference sequence. GenoREAD can determine if a clone matches its reference sequence. Its sophisticated reporting features help identify and troubleshoot problems that arise during the sequence verification process. GenoREAD has been experimentally validated on thousands of gene-sized constructs from an ORFeome project, and on longer sequences including whole plasmids and synthetic chromosomes. Comparing GenoREAD results with those from manual analysis of the sequencing data demonstrates that GenoREAD tends to be conservative in its diagnostic. GenoREAD is available at www.genoread.org. PMID:23042248

  11. Transcriptome analysis of Houttuynia cordata Thunb. by Illumina paired-end RNA sequencing and SSR marker discovery.

    PubMed

    Wei, Lin; Li, Shenghua; Liu, Shenggui; He, Anna; Wang, Dan; Wang, Jie; Tang, Yulian; Wu, Xianjin

    2014-01-01

    Houttuynia cordata Thunb. is an important traditional medical herb in China and other Asian countries, with high medicinal and economic value. However, a lack of available genomic information has become a limitation for research on this species. Thus, we carried out high-throughput transcriptomic sequencing of H. cordata to generate an enormous transcriptome sequence dataset for gene discovery and molecular marker development. Illumina paired-end sequencing technology produced over 56 million sequencing reads from H. cordata mRNA. Subsequent de novo assembly yielded 63,954 unigenes, 39,982 (62.52%) and 26,122 (40.84%) of which had significant similarity to proteins in the NCBI nonredundant protein and Swiss-Prot databases (E-value <10(-5)), respectively. Of these annotated unigenes, 30,131 and 15,363 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In addition, 24,434 (38.21%) unigenes were mapped onto 128 pathways using the KEGG pathway database and 17,964 (44.93%) unigenes showed homology to Vitis vinifera (Vitaceae) genes in BLASTx analysis. Furthermore, 4,800 cDNA SSRs were identified as potential molecular markers. Fifty primer pairs were randomly selected to detect polymorphism among 30 samples of H. cordata; 43 (86%) produced fragments of expected size, suggesting that the unigenes were suitable for specific primer design and of high quality, and the SSR marker could be widely used in marker-assisted selection and molecular breeding of H. cordata in the future. This is the first application of Illumina paired-end sequencing technology to investigate the whole transcriptome of H. cordata and to assemble RNA-seq reads without a reference genome. These data should help researchers investigating the evolution and biological processes of this species. The SSR markers developed can be used for construction of high-resolution genetic linkage maps and for gene-based association analyses in H. cordata. This work will enable future functional genomic research and research into the distinctive active constituents of this genus.

  12. A disruptive sequencer meets disruptive publishing.

    PubMed

    Loman, Nick; Goodwin, Sarah; Jansen, Hans; Loose, Matt

    2015-01-01

    Nanopore sequencing was recently made available to users in the form of the Oxford Nanopore MinION. Released to users through an early access programme, the MinION is made unique by its tiny form factor and ability to generate very long sequences from single DNA molecules. The platform is undergoing rapid evolution with three distinct nanopore types and five updates to library preparation chemistry in the last 18 months. To keep pace with the rapid evolution of this sequencing platform, and to provide a space where new analysis methods can be openly discussed, we present a new F1000Research channel devoted to updates to and analysis of nanopore sequence data.

  13. A Bayesian taxonomic classification method for 16S rRNA gene sequences with improved species-level accuracy.

    PubMed

    Gao, Xiang; Lin, Huaiying; Revanna, Kashi; Dong, Qunfeng

    2017-05-10

    Species-level classification for 16S rRNA gene sequences remains a serious challenge for microbiome researchers, because existing taxonomic classification tools for 16S rRNA gene sequences either do not provide species-level classification, or their classification results are unreliable. The unreliable results are due to the limitations in the existing methods which either lack solid probabilistic-based criteria to evaluate the confidence of their taxonomic assignments, or use nucleotide k-mer frequency as the proxy for sequence similarity measurement. We have developed a method that shows significantly improved species-level classification results over existing methods. Our method calculates true sequence similarity between query sequences and database hits using pairwise sequence alignment. Taxonomic classifications are assigned from the species to the phylum levels based on the lowest common ancestors of multiple database hits for each query sequence, and further classification reliabilities are evaluated by bootstrap confidence scores. The novelty of our method is that the contribution of each database hit to the taxonomic assignment of the query sequence is weighted by a Bayesian posterior probability based upon the degree of sequence similarity of the database hit to the query sequence. Our method does not need any training datasets specific for different taxonomic groups. Instead only a reference database is required for aligning to the query sequences, making our method easily applicable for different regions of the 16S rRNA gene or other phylogenetic marker genes. Reliable species-level classification for 16S rRNA or other phylogenetic marker genes is critical for microbiome research. Our software shows significantly higher classification accuracy than the existing tools and we provide probabilistic-based confidence scores to evaluate the reliability of our taxonomic classification assignments based on multiple database matches to query sequences. Despite its higher computational costs, our method is still suitable for analyzing large-scale microbiome datasets for practical purposes. Furthermore, our method can be applied for taxonomic classification of any phylogenetic marker gene sequences. Our software, called BLCA, is freely available at https://github.com/qunfengdong/BLCA .

  14. Getting to the root of plant biology: impact of the Arabidopsis genome sequence on root research

    PubMed Central

    Benfey, Philip N.; Bennett, Malcolm; Schiefelbein, John

    2010-01-01

    Summary Prior to the availability of the genome sequence, the root of Arabidopsis had attracted a small but ardent group of researchers drawn to its accessibility and developmental simplicity. Roots are easily observed when grown on the surface of nutrient agar media, facilitating analysis of responses to stimuli such as gravity and touch. Developmental biologists were attracted to the simple radial organization of primary root tissues, which form a series of concentric cylinders around the central vascular tissue. Equally attractive was the mode of propagation, with stem cells at the tip giving rise to progeny that were confined to cell files. These properties of root development reduced the normal four-dimensional problem of development (three spatial dimensions and time) to a two-dimensional problem, with cell type on the radial axis and developmental time along the longitudinal axis. The availability of the complete Arabidopsis genome sequence has dramatically accelerated traditional genetic research on root biology, and has also enabled entirely new experimental strategies to be applied. Here we review examples of the ways in which availability of the Arabidopsis genome sequence has enhanced progress in understanding root biology. PMID:20409273

  15. Normalization of Complete Genome Characteristics: Application to Evolution from Primitive Organisms to Homo sapiens.

    PubMed

    Sorimachi, Kenji; Okayasu, Teiji; Ohhira, Shuji

    2015-04-01

    Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism's genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.

  16. Biodiversity of air-borne microorganisms at Halley Station, Antarctica.

    PubMed

    Pearce, David A; Hughes, K A; Lachlan-Cope, T; Harangozo, S A; Jones, A E

    2010-03-01

    A study of air-borne microbial biodiversity over an isolated scientific research station on an ice-shelf in continental Antarctica was undertaken to establish the potential source of microbial colonists. The study aimed to assess: (1) whether microorganisms were likely to have a local (research station) or distant (marine or terrestrial) origin, (2) the effect of changes in sea ice extent on microbial biodiversity and (3) the potential human impact on the environment. Air samples were taken above Halley Research Station during the austral summer and austral winter over a 2-week period. Overall, a low microbial biodiversity was detected, which included many sequence replicates. No significant patterns were detected in the aerial biodiversity between the austral summer and the austral winter. In common with other environmental studies, particularly in the polar regions, many of the sequences obtained were from as yet uncultivated organisms. Very few marine sequences were detected irrespective of the distance to open water, and around one-third of sequences detected were similar to those identified in human studies, though both of these might reflect prevailing wind conditions. The detected aerial microorganisms were markedly different from those obtained in earlier studies over the Antarctic Peninsula in the maritime Antarctic.

  17. DMINDA: an integrated web server for DNA motif identification and analyses.

    PubMed

    Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying

    2014-07-01

    DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Metagenomic ventures into outer sequence space.

    PubMed

    Dutilh, Bas E

    Sequencing DNA or RNA directly from the environment often results in many sequencing reads that have no homologs in the database. These are referred to as "unknowns," and reflect the vast unexplored microbial sequence space of our biosphere, also known as "biological dark matter." However, unknowns also exist because metagenomic datasets are not optimally mined. There is a pressure on researchers to publish and move on, and the unknown sequences are often left for what they are, and conclusions drawn based on reads with annotated homologs. This can cause abundant and widespread genomes to be overlooked, such as the recently discovered human gut bacteriophage crAssphage. The unknowns may be enriched for bacteriophage sequences, the most abundant and genetically diverse component of the biosphere and of sequence space. However, it remains an open question, what is the actual size of biological sequence space? The de novo assembly of shotgun metagenomes is the most powerful tool to address this question.

  19. Effects of Sequences of Cognitions on Group Performance Over Time

    PubMed Central

    Molenaar, Inge; Chiu, Ming Ming

    2017-01-01

    Extending past research showing that sequences of low cognitions (low-level processing of information) and high cognitions (high-level processing of information through questions and elaborations) influence the likelihoods of subsequent high and low cognitions, this study examines whether sequences of cognitions are related to group performance over time; 54 primary school students (18 triads) discussed and wrote an essay about living in another country (32,375 turns of talk). Content analysis and statistical discourse analysis showed that within each lesson, groups with more low cognitions or more sequences of low cognition followed by high cognition added more essay words. Groups with more high cognitions, sequences of low cognition followed by low cognition, or sequences of high cognition followed by an action followed by low cognition, showed different words and sequences, suggestive of new ideas. The links between cognition sequences and group performance over time can inform facilitation and assessment of student discussions. PMID:28490854

  20. Effects of Sequences of Cognitions on Group Performance Over Time.

    PubMed

    Molenaar, Inge; Chiu, Ming Ming

    2017-04-01

    Extending past research showing that sequences of low cognitions (low-level processing of information) and high cognitions (high-level processing of information through questions and elaborations) influence the likelihoods of subsequent high and low cognitions, this study examines whether sequences of cognitions are related to group performance over time; 54 primary school students (18 triads) discussed and wrote an essay about living in another country (32,375 turns of talk). Content analysis and statistical discourse analysis showed that within each lesson, groups with more low cognitions or more sequences of low cognition followed by high cognition added more essay words. Groups with more high cognitions, sequences of low cognition followed by low cognition, or sequences of high cognition followed by an action followed by low cognition, showed different words and sequences, suggestive of new ideas. The links between cognition sequences and group performance over time can inform facilitation and assessment of student discussions.

  1. SeqCompress: an algorithm for biological sequence compression.

    PubMed

    Sardaraz, Muhammad; Tahir, Muhammad; Ikram, Ataul Aziz; Bajwa, Hassan

    2014-10-01

    The growth of Next Generation Sequencing technologies presents significant research challenges, specifically to design bioinformatics tools that handle massive amount of data efficiently. Biological sequence data storage cost has become a noticeable proportion of total cost in the generation and analysis. Particularly increase in DNA sequencing rate is significantly outstripping the rate of increase in disk storage capacity, which may go beyond the limit of storage capacity. It is essential to develop algorithms that handle large data sets via better memory management. This article presents a DNA sequence compression algorithm SeqCompress that copes with the space complexity of biological sequences. The algorithm is based on lossless data compression and uses statistical model as well as arithmetic coding to compress DNA sequences. The proposed algorithm is compared with recent specialized compression tools for biological sequences. Experimental results show that proposed algorithm has better compression gain as compared to other existing algorithms. Copyright © 2014 Elsevier Inc. All rights reserved.

  2. NCBI-compliant genome submissions: tips and tricks to save time and money.

    PubMed

    Pirovano, Walter; Boetzer, Marten; Derks, Martijn F L; Smit, Sandra

    2017-03-01

    Genome sequences nowadays play a central role in molecular biology and bioinformatics. These sequences are shared with the scientific community through sequence databases. The sequence repositories of the International Nucleotide Sequence Database Collaboration (INSDC, comprising GenBank, ENA and DDBJ) are the largest in the world. Preparing an annotated sequence in such a way that it will be accepted by the database is challenging because many validation criteria apply. In our opinion, it is an undesirable situation that researchers who want to submit their sequence need either a lot of experience or help from partners to get the job done. To save valuable time and money, we list a number of recommendations for people who want to submit an annotated genome to a sequence database, as well as for tool developers, who could help to ease the process. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  3. Forensic Loci Allele Database (FLAD): Automatically generated, permanent identifiers for sequenced forensic alleles.

    PubMed

    Van Neste, Christophe; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip

    2016-01-01

    It is difficult to predict if and when massively parallel sequencing of forensic STR loci will replace capillary electrophoresis as the new standard technology in forensic genetics. The main benefits of sequencing are increased multiplexing scales and SNP detection. There is not yet a consensus on how sequenced profiles should be reported. We present the Forensic Loci Allele Database (FLAD) service, made freely available on http://forensic.ugent.be/FLAD/. It offers permanent identifiers for sequenced forensic alleles (STR or SNP) and their microvariants for use in forensic allele nomenclature. Analogous to Genbank, its aim is to provide permanent identifiers for forensically relevant allele sequences. Researchers that are developing forensic sequencing kits or are performing population studies, can register on http://forensic.ugent.be/FLAD/ and add loci and allele sequences with a short and simple application interface (API). Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  4. Conifer genomics and adaptation: at the crossroads of genetic diversity and genome function.

    PubMed

    Prunier, Julien; Verta, Jukka-Pekka; MacKay, John J

    2016-01-01

    Conifers have been understudied at the genomic level despite their worldwide ecological and economic importance but the situation is rapidly changing with the development of next generation sequencing (NGS) technologies. With NGS, genomics research has simultaneously gained in speed, magnitude and scope. In just a few years, genomes of 20-24 gigabases have been sequenced for several conifers, with several others expected in the near future. Biological insights have resulted from recent sequencing initiatives as well as genetic mapping, gene expression profiling and gene discovery research over nearly two decades. We review the knowledge arising from conifer genomics research emphasizing genome evolution and the genomic basis of adaptation, and outline emerging questions and knowledge gaps. We discuss future directions in three areas with potential inputs from NGS technologies: the evolutionary impacts of adaptation in conifers based on the adaptation-by-speciation model; the contributions of genetic variability of gene expression in adaptation; and the development of a broader understanding of genetic diversity and its impacts on genome function. These research directions promise to sustain research aimed at addressing the emerging challenges of adaptation that face conifer trees. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  5. The A, C, G, and T of Genome Assembly

    PubMed Central

    Wajid, Bilal; Sohail, Muhammad U.; Ekti, Ali R.; Serpedin, Erchin

    2016-01-01

    Genome assembly in its two decades of history has produced significant research, in terms of both biotechnology and computational biology. This contribution delineates sequencing platforms and their characteristics, examines key steps involved in filtering and processing raw data, explains assembly frameworks, and discusses quality statistics for the assessment of the assembled sequence. Furthermore, the paper explores recent Ubuntu-based software environments oriented towards genome assembly as well as some avenues for future research. PMID:27247941

  6. Snake Genome Sequencing: Results and Future Prospects

    PubMed Central

    Kerkkamp, Harald M. I.; Kini, R. Manjunatha; Pospelov, Alexey S.; Vonk, Freek J.; Henkel, Christiaan V.; Richardson, Michael K.

    2016-01-01

    Snake genome sequencing is in its infancy—very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression. PMID:27916957

  7. Snake Genome Sequencing: Results and Future Prospects.

    PubMed

    Kerkkamp, Harald M I; Kini, R Manjunatha; Pospelov, Alexey S; Vonk, Freek J; Henkel, Christiaan V; Richardson, Michael K

    2016-12-01

    Snake genome sequencing is in its infancy-very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression.

  8. Evaluating a robust contour tracker on echocardiographic sequences.

    PubMed

    Jacob, G; Noble, J A; Mulet-Parada, M; Blake, A

    1999-03-01

    In this paper we present an evaluation of a robust visual image tracker on echocardiographic image sequences. We show how the tracking framework can be customized to define an appropriate shape space that describes heart shape deformations that can be learnt from a training data set. We also investigate energy-based temporal boundary enhancement methods to improve image feature measurement. Results are presented demonstrating real-time tracking on real normal heart motion data sequences and abnormal synthesized and real heart motion data sequences. We conclude by discussing some of our current research efforts.

  9. Modern Computational Techniques for the HMMER Sequence Analysis

    PubMed Central

    2013-01-01

    This paper focuses on the latest research and critical reviews on modern computing architectures, software and hardware accelerated algorithms for bioinformatics data analysis with an emphasis on one of the most important sequence analysis applications—hidden Markov models (HMM). We show the detailed performance comparison of sequence analysis tools on various computing platforms recently developed in the bioinformatics society. The characteristics of the sequence analysis, such as data and compute-intensive natures, make it very attractive to optimize and parallelize by using both traditional software approach and innovated hardware acceleration technologies. PMID:25937944

  10. Data Analysis of Sequences and qPCR for Microbial Communities during Algal Blooms

    EPA Pesticide Factsheets

    A training opportunity is open to a highly microbial-research-motivated student to conduct sequence analysis, explore novel genes and metabolic pathways, validate resultant findings using qPCR/RT-qPCR and summarize the findings

  11. Using timed event sequential data in nursing research.

    PubMed

    Pecanac, Kristen E; Doherty-King, Barbara; Yoon, Ju Young; Brown, Roger; Schiefelbein, Tony

    2015-01-01

    Measuring behavior is important in nursing research, and innovative technologies are needed to capture the "real-life" complexity of behaviors and events. The purpose of this article is to describe the use of timed event sequential data in nursing research and to demonstrate the use of this data in a research study. Timed event sequencing allows the researcher to capture the frequency, duration, and sequence of behaviors as they occur in an observation period and to link the behaviors to contextual details. Timed event sequential data can easily be collected with handheld computers, loaded with a software program designed for capturing observations in real time. Timed event sequential data add considerable strength to analysis of any nursing behavior of interest, which can enhance understanding and lead to improvement in nursing practice.

  12. Social network analysis of a project-based introductory physics course

    NASA Astrophysics Data System (ADS)

    Oakley, Christopher

    2016-03-01

    Research suggests that students benefit from peer interaction and active engagement in the classroom. The quality, nature, effect of these interactions is currently being explored by Physics Education Researchers. Spelman College offers an introductory physics sequence that addresses content and research skills by engaging students in open-ended research projects, a form of Project-Based Learning. Students have been surveyed at regular intervals during the second semester of trigonometry-based course to determine the frequency of interactions in and out of class. These interactions can be with current or past students, tutors, and instructors. This line of inquiry focuses on metrics of Social Network analysis, such as centrality of participants as well as segmentation of groups. Further research will refine and highlight deeper questions regarding student performance in this pedagogy and course sequence.

  13. Research in Stochastic Processes.

    DTIC Science & Technology

    1982-12-01

    constant high level boundary. References 1. Jurg Husler , Extremie values of non-stationary sequ-ences ard the extr-rmal index, Center for Stochastic...A. Weron, Oct. 82. 20. "Extreme values of non-stationary sequences and the extremal index." Jurg Husler , Oct. 82. 21. "A finitely additive white noise...string model, Y. Miyahara, Carleton University and Nagoya University. Sept. 22 On extremfe values of non-stationary sequences, J. Husler , University of

  14. Complete genome sequence of Anaeromyxobacter sp. Fw109-5, an Anaerobic, Metal-Reducing Bacterium Isolated from a Contaminated Subsurface Environment

    DOE PAGES

    Hwang, C.; Copeland, A.; Lucas, Susan; ...

    2015-01-22

    We report the genome sequence of Anaeromyxobacter sp. Fw109-5, isolated from nitrate- and uranium-contaminated subsurface sediment of the Oak Ridge Integrated Field-Scale Subsurface Research Challenge (IFC) site, Oak Ridge Reservation, TN. The bacterium’s genome sequence will elucidate its physiological potential in subsurface sediments undergoing in situ uranium bioremediation and natural attenuation.

  15. Bioinformatic Workflows for Generating Complete Plastid Genome Sequences-An Example from Cabomba (Cabombaceae) in the Context of the Phylogenomic Analysis of the Water-Lily Clade.

    PubMed

    Gruenstaeudl, Michael; Gerschler, Nico; Borsch, Thomas

    2018-06-21

    The sequencing and comparison of plastid genomes are becoming a standard method in plant genomics, and many researchers are using this approach to infer plant phylogenetic relationships. Due to the widespread availability of next-generation sequencing, plastid genome sequences are being generated at breakneck pace. This trend towards massive sequencing of plastid genomes highlights the need for standardized bioinformatic workflows. In particular, documentation and dissemination of the details of genome assembly, annotation, alignment and phylogenetic tree inference are needed, as these processes are highly sensitive to the choice of software and the precise settings used. Here, we present the procedure and results of sequencing, assembling, annotating and quality-checking of three complete plastid genomes of the aquatic plant genus Cabomba as well as subsequent gene alignment and phylogenetic tree inference. We accompany our findings by a detailed description of the bioinformatic workflow employed. Importantly, we share a total of eleven software scripts for each of these bioinformatic processes, enabling other researchers to evaluate and replicate our analyses step by step. The results of our analyses illustrate that the plastid genomes of Cabomba are highly conserved in both structure and gene content.

  16. BreaKmer: detection of structural variation in targeted massively parallel sequencing data using kmers.

    PubMed

    Abo, Ryan P; Ducar, Matthew; Garcia, Elizabeth P; Thorner, Aaron R; Rojas-Rudilla, Vanesa; Lin, Ling; Sholl, Lynette M; Hahn, William C; Meyerson, Matthew; Lindeman, Neal I; Van Hummelen, Paul; MacConaill, Laura E

    2015-02-18

    Genomic structural variation (SV), a common hallmark of cancer, has important predictive and therapeutic implications. However, accurately detecting SV using high-throughput sequencing data remains challenging, especially for 'targeted' resequencing efforts. This is critically important in the clinical setting where targeted resequencing is frequently being applied to rapidly assess clinically actionable mutations in tumor biopsies in a cost-effective manner. We present BreaKmer, a novel approach that uses a 'kmer' strategy to assemble misaligned sequence reads for predicting insertions, deletions, inversions, tandem duplications and translocations at base-pair resolution in targeted resequencing data. Variants are predicted by realigning an assembled consensus sequence created from sequence reads that were abnormally aligned to the reference genome. Using targeted resequencing data from tumor specimens with orthogonally validated SV, non-tumor samples and whole-genome sequencing data, BreaKmer had a 97.4% overall sensitivity for known events and predicted 17 positively validated, novel variants. Relative to four publically available algorithms, BreaKmer detected SV with increased sensitivity and limited calls in non-tumor samples, key features for variant analysis of tumor specimens in both the clinical and research settings. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.

    PubMed

    Taheri, Sima; Lee Abdullah, Thohirah; Yusop, Mohd Rafii; Hanafi, Mohamed Musa; Sahebi, Mahbod; Azizi, Parisa; Shamshiri, Redmond Ramin

    2018-02-13

    Microsatellites, or simple sequence repeats (SSRs), are one of the most informative and multi-purpose genetic markers exploited in plant functional genomics. However, the discovery of SSRs and development using traditional methods are laborious, time-consuming, and costly. Recently, the availability of high-throughput sequencing technologies has enabled researchers to identify a substantial number of microsatellites at less cost and effort than traditional approaches. Illumina is a noteworthy transcriptome sequencing technology that is currently used in SSR marker development. Although 454 pyrosequencing datasets can be used for SSR development, this type of sequencing is no longer supported. This review aims to present an overview of the next generation sequencing, with a focus on the efficient use of de novo transcriptome sequencing (RNA-Seq) and related tools for mining and development of microsatellites in plants.

  18. Object tracking using plenoptic image sequences

    NASA Astrophysics Data System (ADS)

    Kim, Jae Woo; Bae, Seong-Joon; Park, Seongjin; Kim, Do Hyung

    2017-05-01

    Object tracking is a very important problem in computer vision research. Among the difficulties of object tracking, partial occlusion problem is one of the most serious and challenging problems. To address the problem, we proposed novel approaches to object tracking on plenoptic image sequences. Our approaches take advantage of the refocusing capability that plenoptic images provide. Our approaches input the sequences of focal stacks constructed from plenoptic image sequences. The proposed image selection algorithms select the sequence of optimal images that can maximize the tracking accuracy from the sequence of focal stacks. Focus measure approach and confidence measure approach were proposed for image selection and both of the approaches were validated by the experiments using thirteen plenoptic image sequences that include heavily occluded target objects. The experimental results showed that the proposed approaches were satisfactory comparing to the conventional 2D object tracking algorithms.

  19. Advanced Applications of Next-Generation Sequencing Technologies to Orchid Biology.

    PubMed

    Yeh, Chuan-Ming; Liu, Zhong-Jian; Tsai, Wen-Chieh

    2018-01-01

    Next-generation sequencing technologies are revolutionizing biology by permitting, transcriptome sequencing, whole-genome sequencing and resequencing, and genome-wide single nucleotide polymorphism profiling. Orchid research has benefited from this breakthrough, and a few orchid genomes are now available; new biological questions can be approached and new breeding strategies can be designed. The first part of this review describes the unique features of orchid biology. The second part provides an overview of the current next-generation sequencing platforms, many of which are already used in plant laboratories. The third part summarizes the state of orchid transcriptome and genome sequencing and illustrates current achievements. The genetic sequences currently obtained will not only provide a broad scope for the study of orchid biology, but also serves as a starting point for uncovering the mystery of orchid evolution.

  20. The power and promise of RNA-seq in ecology and evolution.

    PubMed

    Todd, Erica V; Black, Michael A; Gemmell, Neil J

    2016-03-01

    Reference is regularly made to the power of new genomic sequencing approaches. Using powerful technology, however, is not the same as having the necessary power to address a research question with statistical robustness. In the rush to adopt new and improved genomic research methods, limitations of technology and experimental design may be initially neglected. Here, we review these issues with regard to RNA sequencing (RNA-seq). RNA-seq adds large-scale transcriptomics to the toolkit of ecological and evolutionary biologists, enabling differential gene expression (DE) studies in nonmodel species without the need for prior genomic resources. High biological variance is typical of field-based gene expression studies and means that larger sample sizes are often needed to achieve the same degree of statistical power as clinical studies based on data from cell lines or inbred animal models. Sequencing costs have plummeted, yet RNA-seq studies still underutilize biological replication. Finite research budgets force a trade-off between sequencing effort and replication in RNA-seq experimental design. However, clear guidelines for negotiating this trade-off, while taking into account study-specific factors affecting power, are currently lacking. Study designs that prioritize sequencing depth over replication fail to capitalize on the power of RNA-seq technology for DE inference. Significant recent research effort has gone into developing statistical frameworks and software tools for power analysis and sample size calculation in the context of RNA-seq DE analysis. We synthesize progress in this area and derive an accessible rule-of-thumb guide for designing powerful RNA-seq experiments relevant in eco-evolutionary and clinical settings alike. © 2016 John Wiley & Sons Ltd.

Top