Science.gov

Sample records for acid sequence suggested

  1. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  2. Uses of phage display in agriculture: sequence analysis and comparative modeling of late embryogenesis abundant client proteins suggest protein-nucleic acid binding functionality.

    PubMed

    Kushwaha, Rekha; Downie, A Bruce; Payne, Christina M

    2013-01-01

    A group of intrinsically disordered, hydrophilic proteins-Late Embryogenesis Abundant (LEA) proteins-has been linked to survival in plants and animals in periods of stress, putatively through safeguarding enzymatic function and prevention of aggregation in times of dehydration/heat. Yet despite decades of effort, the molecular-level mechanisms defining this protective function remain unknown. A recent effort to understand LEA functionality began with the unique application of phage display, wherein phage display and biopanning over recombinant Seed Maturation Protein homologs from Arabidopsis thaliana and Glycine max were used to retrieve client proteins at two different temperatures, with one intended to represent heat stress. From this previous study, we identified 21 client proteins for which clones were recovered, sometimes repeatedly. Here, we use sequence analysis and homology modeling of the client proteins to ascertain common sequence and structural properties that may contribute to binding affinity with the protective LEA protein. Our methods uncover what appears to be a predilection for protein-nucleic acid interactions among LEA client proteins, which is suggestive of subcellular residence. The results from this initial computational study will guide future efforts to uncover the protein protective mechanisms during heat stress, potentially leading to phage-display-directed evolution of synthetic LEA molecules.

  3. California foreshock sequences suggest aseismic triggering process

    NASA Astrophysics Data System (ADS)

    Chen, Xiaowei; Shearer, Peter M.

    2013-06-01

    Foreshocks are one of the few well-documented precursors to large earthquakes; therefore, understanding their nature is very important for earthquake prediction and hazard mitigation. However, the triggering role of foreshocks is not yet clear. It is possible that foreshocks are a self-triggering cascade of events that simply happen to trigger an unusually large aftershock; alternatively, foreshocks might originate from an external aseismic process that ultimately triggers the mainshock. In the former case, the foreshocks will have limited utility for forecasting. The latter case has been observed for several individual large earthquakes; however, it remains unclear how common it is and how to distinguish foreshock sequences from other seismicity clusters that do not lead to large earthquakes. Here we analyze foreshocks of three M>7 mainshocks in southern California. These foreshock sequences appear similar to earthquake swarms, in that they do not start with their largest events and they exhibit spatial migration of seismicity. Analysis of source spectra shows that all three foreshock sequences feature lower average stress drops and depletion of high-frequency energy compared with the aftershocks of their corresponding mainshocks. Using a longer-term stress-drop catalog, we find that the average stress drop of the Landers and Hector Mine foreshock sequences is comparable to nearby swarms. Our observations suggest that these foreshock sequences are manifestations of aseismic transients occurring close to the mainshock hypocenters, possibly related to localized fault zone complexity, which have promoted the occurrence of both the foreshocks and the eventual mainshock.

  4. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  5. High speed nucleic acid sequencing

    SciTech Connect

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  6. Ribosomal RNA sequence suggest microsporidia are extremely ancient eukaryotes

    NASA Technical Reports Server (NTRS)

    Vossbrinck, C. R.; Maddox, J. V.; Friedman, S.; Debrunner-Vossbrinck, B. A.; Woese, C. R.

    1987-01-01

    A comparative sequence analysis of the 18S small subunit ribosomal RNA (rRNA) of the microsporidium Vairimorpha necatrix is presented. The results show that this rRNA sequence is more unlike those of other eukaryotes than any known eukaryote rRNA sequence. It is concluded that the lineage leading to microsporidia branched very early from that leading to other eukaryotes.

  7. Mitochondrial sequence variation suggests an African influence in Portuguese cattle.

    PubMed Central

    Cymbron, T; Loftus, R T; Malheiro, M I; Bradley, D G

    1999-01-01

    A total of 49 samples from indigenous Portuguese cattle breeds were analysed for sequence variation in the hypervariable region of the mitochondrial DNA D-loop. Sequence comparison and phylogenetic analyses revealed that haplotypes fell into two distinct groups. These corresponded with two separate haplotype clusters into which, respectively, all African, or alternatively all sequences of European origin, have previously been shown to fall. Here, the majority of sequences of African type were encountered in three southern, as compared to three northern breeds. This pattern of African influence may reflect an intercontinental admixture in the initial origins of Iberian breeds, or it is perhaps an introgression dating from the long and influential Moorish occupation of the south of the Iberian peninsula. PMID:10212450

  8. Agouti sequence polymorphisms in coyotes, wolves and dogs suggest hybridization.

    PubMed

    Schmutz, Sheila M; Berryere, Thomas G; Barta, Jodi L; Reddick, Kimberley D; Schmutz, Josef K

    2007-01-01

    Domestic dogs have been shown to have multiple alleles of the Agouti Signal Peptide (ASIP) in exon 4 and we wished to determine the level of polymorphism in the common wild canids of Canada, wolves and coyotes, in comparison. All Canadian coyotes and most wolves have banded hairs. The ASIP coding sequence of the wolf did not vary from the domestic dog but one variant was detected in exon 4 of coyotes that did not alter the arginine at this position. Two other differences were found in the sequence flanking exon 4 of coyotes compared with the 45 dogs and 1 wolf. The coyotes also demonstrated a relatively common polymorphism in the 3' UTR sequence that could be used for population studies. One of the ASIP alleles (R96C) in domestic dogs causes a solid black coat color in homozygotes. Although some wolves are melanistic, this phenotype does not appear to be caused by this same mutation. However, one wolf, potentially a dog-wolf hybrid or descendant thereof, was heterozygous for this allele. Likewise 2 coyotes, potentially dog-coyote or wolf-coyote hybrid descendants, were heterozygous for the several polymorphisms in and flanking exon 4. We could conclude that these were coyote-dog hybrids because both were heterozygous for 2 mutations causing fawn coat color in dogs.

  9. Sequence of the bphD gene encoding 2-hydroxy-6-oxo-(phenyl/chlorophenyl)hexa-2,4-dienoic acid (HOP/cPDA) hydrolase involved in the biphenyl/polychlorinated biphenyl degradation pathway in Comamonas testosteroni: evidence suggesting involvement of Ser112 in catalytic activity.

    PubMed

    Ahmad, D; Fraser, J; Sylvestre, M; Larose, A; Khan, A; Bergeron, J; Juteau, J M; Sondossi, M

    1995-04-14

    The nucleotide sequence of bphD, encoding 2-hydroxy-6-oxo-(phenyl/chlorophenyl)hexa-2,4-dienoic acid hydrolase involved in the biphenyl/polychlorinated biphenyl degradation pathway of Comamonas testosteroni strain B-356, was determined. Comparison of the deduced amino-acid sequence with published sequences led to the identification of a 'lipase box', containing a consensus pentapeptide sequence GlyXaaSerXaaGly. This suggested that the mechanism of action of this enzyme may involve an Asp-Ser-His catalytic triad similar to that of classical lipases and serine hydrolases. Further biochemical and genetic evidence for the active-site involvement of Ser112 was obtained by showing that a semipurified enzyme was inhibited by PMSF, a classic inhibitor of serine hydrolases, and by site-directed Ser112-->Ala mutagenesis.

  10. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  11. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  12. The complete amino acid sequence of prochymosin.

    PubMed Central

    Foltmann, B; Pedersen, V B; Jacobsen, H; Kauffman, D; Wybrandt, G

    1977-01-01

    The total sequence of 365 amino acid residues in bovine prochymosin is presented. Alignment with the amino acid sequence of porcine pepsinogen shows that 204 amino acid residues are common to the two zymogens. Further comparison and alignment with the amino acid sequence of penicillopepsin shows that 66 residues are located at identical positions in all three proteases. The three enzymes belong to a large group of proteases with two aspartate residues in the active center. This group forms a family derived from one common ancestor. PMID:329280

  13. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  14. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  15. Comparative genomic analysis of equilibrative nucleoside transporters suggests conserved protein structure despite limited sequence identity.

    PubMed

    Sankar, Narendra; Machado, Jerry; Abdulla, Parween; Hilliker, Arthur J; Coe, Imogen R

    2002-10-15

    Equilibrative nucleoside transporters (ENTs) are a recently characterized and poorly understood group of membrane proteins that are important in the uptake of endogenous nucleosides required for nucleic acid and nucleoside triphosphate synthesis. Despite their central importance in cellular metabolism and nucleoside analog chemotherapy, no human ENT gene has been described and nothing is known about gene structure and function. To gain insight into the ENT gene family, we used experimental and in silico comparative genomic approaches to identify ENT genes in three evolutionarily diverse organisms with completely (or almost completely) sequenced genomes, Homo sapiens, Caenorhabditis elegans and Drosophila melanogaster. We describe the chromosomal location, the predicted ENT gene structure and putative structural topologies of predicted ENT proteins derived from the open reading frames. Despite variations in genomic layout and limited ortholog protein sequence identity (< or =27.45%), predicted topologies of ENT proteins are strikingly similar, suggesting an evolutionary conservation of a prototypic structure. In addition, a similar distribution of protein domains on exons is apparent in all three taxa. These data demonstrate that comparative sequence analyses should be combined with other approaches (such as genomic and proteomic analyses) to fully understand structure, function and evolution of protein families.

  16. Amino acid sequence of mouse submaxillary gland renin.

    PubMed Central

    Misono, K S; Chang, J J; Inagami, T

    1982-01-01

    The complete amino acid sequences of the heavy chain and light chain of mouse submaxillary gland renin have been determined. The heavy chain consists of 288 amino acid residues having a Mr of 31,036 calculated from the sequence. The light chain contains 48 amino acid residues with a Mr of 5,458. The sequence of the heavy chain was determined by automated Edman degradations of the cyanogen bromide peptides and tryptic peptides generated after citraconylation, as well as other peptides generated therefrom. The sequence of the light chain was derived from sequence analyses of the peptides generated by cyanogen bromide cleavage or by digestion with Staphylococcus aureus protease. The sequences in the active site regions in renin containing two catalytically essential aspartyl residues 32 and 215 were found identical with those in pepsin, chymosin, and penicillopepsin. Comparison of the amino acid sequence of renin with that of porcine pepsin indicated a 42% sequence identity of the heavy chain with the amino-terminal and middle regions and a 46% identity of the light chain with the carboxyl-terminal region of the porcine pepsin sequence. Residues identical in renin and pepsin are distributed throughout the length of the molecules, suggesting a similarity in their overall structures. PMID:6812055

  17. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  18. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  19. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  20. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  1. Differential expression in normal-adenoma-carcinoma sequence suggests complex molecular carcinogenesis in colon.

    PubMed

    Lee, Seungkoo; Bang, Seunghyun; Song, Kyuyoung; Lee, Inchul

    2006-10-01

    The majority of colon cancers develop from pre-existing adenomas. We analyzed the expression profiles in the sequence of normal colon crypts, adenomas and early-stage carcinomas using microdissected cells from tubular adenomas with foci of malignant transformation. Differentially expressed genes were detected between normal-adenoma and adenoma-carcinoma, and were grouped according to the patterns of expression changes in the sequence. Down-regulated genes in the sequence included PLA2G2A, TSPAN1, PDCD4, FCGBP, AATK, EPLIN, FABP1, AGR2, MTUS1, TSC1, galectin 4 and MT1F. PLA2G2A has been shown to suppress colon tumorigenesis in mice, but the pathobiological role in humans has been controversial. Our data showed continuous down-regulation of PLA2G2A in the sequence supporting an implication in human colon cancer. Tumor suppressor and/ or proapoptotic activities have also been reported in other genes. Up-regulated genes included ribosomal proteins, IER3 and TPR. TGF-beta2 and matrix metalloproteinase 23B were up-regulated in carcinoma but not in adenoma, supporting the pathobiological roles in malignant transformation. Differentially expressed genes partly coincided with those in the adenoma-carcinoma sequence of the stomach, which was published previously, suggesting a partial overlap between the adenoma-carcinoma sequences of the colon and stomach.

  2. Extensive amino acid sequence homologies between animal lectins

    SciTech Connect

    Paroutaud, P.; Levi, G.; Teichberg, V.I.; Strosberg, A.D.

    1987-09-01

    The authors have established the amino acid sequence of the ..beta..-D-galactoside binding lectin from the electric eel and the sequences of several peptides from a similar lectin isolated from human placenta. These sequences were compared with the published sequences of peptides derived from the ..beta..-D-galactoside binding lectin from human lung and with sequences deduced from cDNAs assigned to the ..beta..-D-galactoside binding lectins from chicken embryo skin and human hepatomas. Significant homologies were observed. One of the highly conserved regions that contains a tryptophan residue and two glutamic acid resides is probably part of the ..beta..-D-galactoside binding site, which, on the basis of spectroscopic studies of the electric eel lectin, is expected to contain such residues. The similarity of the hydropathy profiles and the predicted secondary structure of the lectins from chicken skin and electric eel, in spite of differences in their amino acid sequences, strongly suggests that these proteins have maintained structural homologies during evolution and together with the other ..beta..-D-galactoside binding lectins were derived form a common ancestor gene.

  3. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  4. Boric acid inhibits embryonic histone deacetylases: a suggested mechanism to explain boric acid-related teratogenicity.

    PubMed

    Di Renzo, Francesca; Cappelletti, Graziella; Broccia, Maria L; Giavini, Erminio; Menegola, Elena

    2007-04-15

    Histone deacetylases (HDAC) control gene expression by changing histonic as well as non histonic protein conformation. HDAC inhibitors (HDACi) are considered to be among the most promising drugs for epigenetic treatment for cancer. Recently a strict relationship between histone hyperacetylation in specific tissues of mouse embryos exposed to two HDACi (valproic acid and trichostatin A) and specific axial skeleton malformations has been demonstrated. The aim of this study is to verify if boric acid (BA), that induces in rodents malformations similar to those valproic acid and trichostatin A-related, acts through similar mechanisms: HDAC inhibition and histone hyperacetylation. Pregnant mice were treated intraperitoneally with a teratogenic dose of BA (1000 mg/kg, day 8 of gestation). Western blot analysis and immunostaining were performed with anti hyperacetylated histone 4 (H4) antibody on embryos explanted 1, 3 or 4 h after treatment and revealed H4 hyperacetylation at the level of somites. HDAC enzyme assay was performed on embryonic nuclear extracts. A significant HDAC inhibition activity (compatible with a mixed type partial inhibition mechanism) was evident with BA. Kinetic analyses indicate that BA modifies substrate affinity by a factor alpha=0.51 and maximum velocity by a factor beta=0.70. This work provides the first evidence for HDAC inhibition by BA and suggests such a molecular mechanism for the induction of BA-related malformations.

  5. Boric acid inhibits embryonic histone deacetylases: A suggested mechanism to explain boric acid-related teratogenicity

    SciTech Connect

    Di Renzo, Francesca; Cappelletti, Graziella; Broccia, Maria L.; Giavini, Erminio; Menegola, Elena . E-mail: elena.menegola@unimi.it

    2007-04-15

    Histone deacetylases (HDAC) control gene expression by changing histonic as well as non histonic protein conformation. HDAC inhibitors (HDACi) are considered to be among the most promising drugs for epigenetic treatment for cancer. Recently a strict relationship between histone hyperacetylation in specific tissues of mouse embryos exposed to two HDACi (valproic acid and trichostatin A) and specific axial skeleton malformations has been demonstrated. The aim of this study is to verify if boric acid (BA), that induces in rodents malformations similar to those valproic acid and trichostatin A-related, acts through similar mechanisms: HDAC inhibition and histone hyperacetylation. Pregnant mice were treated intraperitoneally with a teratogenic dose of BA (1000 mg/kg, day 8 of gestation). Western blot analysis and immunostaining were performed with anti hyperacetylated histone 4 (H4) antibody on embryos explanted 1, 3 or 4 h after treatment and revealed H4 hyperacetylation at the level of somites. HDAC enzyme assay was performed on embryonic nuclear extracts. A significant HDAC inhibition activity (compatible with a mixed type partial inhibition mechanism) was evident with BA. Kinetic analyses indicate that BA modifies substrate affinity by a factor {alpha} = 0.51 and maximum velocity by a factor {beta} = 0.70. This work provides the first evidence for HDAC inhibition by BA and suggests such a molecular mechanism for the induction of BA-related malformations.

  6. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  7. Phylogenetic analyses of complete mitochondrial genome sequences suggest a basal divergence of the enigmatic rodent Anomalurus

    PubMed Central

    Horner, David S; Lefkimmiatis, Konstantinos; Reyes, Aurelio; Gissi, Carmela; Saccone, Cecilia; Pesole, Graziano

    2007-01-01

    Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires) have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires) is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies. PMID:17288612

  8. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  9. Los Alamos sequence analysis package for nucleic acids and proteins.

    PubMed Central

    Kanehisa, M I

    1982-01-01

    An interactive system for computer analysis of nucleic acid and protein sequences has been developed for the Los Alamos DNA Sequence Database. It provides a convenient way to search or verify various sequence features, e.g., restriction enzyme sites, protein coding frames, and properties of coded proteins. Further, the comprehensive analysis package on a large-scale database can be used for comparative studies on sequence and structural homologies in order to find unnoted information stored in nucleic acid sequences. PMID:6174934

  10. Exome sequence analysis suggests genetic burden contributes to phenotypic variability and complex neuropathy

    PubMed Central

    Gonzaga-Jauregui, Claudia; Harel, Tamar; Gambin, Tomasz; Kousi, Maria; Griffin, Laurie B.; Francescatto, Ludmila; Ozes, Burcak; Karaca, Ender; Jhangiani, Shalini; Bainbridge, Matthew N.; Lawson, Kim S.; Pehlivan, Davut; Okamoto, Yuji; Withers, Marjorie; Mancias, Pedro; Slavotinek, Anne; Reitnauer, Pamela J; Goksungur, Meryem T.; Shy, Michael; Crawford, Thomas O.; Koenig, Michel; Willer, Jason; Flores, Brittany N.; Pediaditrakis, Igor; Us, Onder; Wiszniewski, Wojciech; Parman, Yesim; Antonellis, Anthony; Muzny, Donna M.; Katsanis, Nicholas; Battaloglu, Esra; Boerwinkle, Eric; Gibbs, Richard A.; Lupski, James R.

    2015-01-01

    Charcot-Marie-Tooth (CMT) disease is a clinically and genetically heterogeneous distal symmetric polyneuropathy. Whole-exome sequencing (WES) of 40 individuals from 37 unrelated families with CMT-like peripheral neuropathy refractory to molecular diagnosis identified apparent causal mutations in ~45% (17/37) of families. Three candidate disease genes are proposed, supported by a combination of genetic and in vivo studies. Aggregate analysis of mutation data revealed a significantly increased number of rare variants across 58 neuropathy associated genes in subjects versus controls; confirmed in a second ethnically discrete neuropathy cohort, suggesting mutation burden potentially contributes to phenotypic variability. Neuropathy genes shown to have highly penetrant Mendelizing variants (HMPVs) and implicated by burden in families were shown to interact genetically in a zebrafish assay exacerbating the phenotype established by the suppression of single genes. Our findings suggest that the combinatorial effect of rare variants contributes to disease burden and variable expressivity. PMID:26257172

  11. Multiple pathways for steel regulation suggested by genomic and sequence analysis of the murine Steel gene

    SciTech Connect

    Bedell, M.A.; Copeland, N.G.; Jenkins, N.A.

    1996-03-01

    The Steel (Sl) locus encodes mast cell growth factor (Mgf) that is required for the development of germ cells, hematopoietic cells and melanocytes. Although the expression patterns of the Mgf gene are well characterized, little is known of the factors which regulate its expression. Here, we describe the cloning and sequence of the full-length transcription unit and the 5{prime} flanking region of the murine Mgf gene. The full-length Mgf mRNA consists of a short 5{prime} untranslated region (UTR), a 0.8-kb ORF and a long 3{prime} UTR. A single transcription initiation site is used in a number of mouse tissues and is located just downstream of binding sites for several known transcription factors. In the 5{prime} UTR, two ATGs were found upstream of the initiator methionine and are conserved among different species, suggesting that Mgf may be translationally regulated. At least two Mgf mRNAs are produced by alternative use of polyadenylation sites, but numerous other potential polyadenylation sites were found in the 3{prime} UTR. In addition, the 3{prime} UTR contains numerous sequence motifs that may regulate Mgf mRNA stability. These studies suggest multiple ways in which expression of Mgf may be regulated. 39 refs., 4 figs.

  12. Multiple Pathways for Steel Regulation Suggested by Genomic and Sequence Analysis of the Murine Steel Gene

    PubMed Central

    Bedell, M. A.; Copeland, N. G.; Jenkins, N. A.

    1996-01-01

    The Steel (Sl) locus encodes mast cell growth factor (Mgf) that is required for the development of germ cells, hematopoietic cells and melanocytes. Although the expression patterns of the Mgf gene are well characterized, little is known of the factors which regulate its expression. Here, we describe the cloning and sequence of the full-length transcription unit and the 5' flanking region of the murine Mgf gene. The full-length Mgf mRNA consists of a short 5' untranslated region (UTR), a 0.8-kb ORF and a long 3' UTR. A single transcription initiation site is used in a number of mouse tissues and is located just downstream of binding sites for several known transcription factors. In the 5' UTR, two ATGs were found upstream of the initiator methionine and are conserved among different species, suggesting that Mgf may be translationally regulated. At least two Mgf mRNAs are produced by alternative use of polyadenylation sites, but numerous other potential polyadenylation sites were found in the 3' UTR. In addition, the 3' UTR contains numerous sequence motifs that may regulate Mgf mRNA stability. These studies suggest multiple ways in which expression of Mgf may be regulated. PMID:8849898

  13. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  14. Sequence and domain conservation of the coelacanth Hsp40 and Hsp90 chaperones suggests conservation of function.

    PubMed

    Bishop, Özlem Tastan; Edkins, Adrienne Lesley; Blatch, Gregory Lloyd

    2014-09-01

    Molecular chaperones and their associated co-chaperones play an important role in preserving and regulating the active conformational state of cellular proteins. The chaperone complement of the Indonesian Coelacanth, Latimeria menadoensis, was elucidated using transcriptomic sequences. Heat shock protein 90 (Hsp90) and heat shock protein 40 (Hsp40) chaperones, and associated co-chaperones were focused on, and homologous human sequences were used to search the sequence databases. Coelacanth homologs of the cytosolic, mitochondrial and endoplasmic reticulum (ER) homologs of human Hsp90 were identified, as well as all of the major co-chaperones of the cytosolic isoform. Most of the human Hsp40s were found to have coelacanth homologs, and the data suggested that all of the chaperone machinery for protein folding at the ribosome, protein translocation to cellular compartments such as the ER and protein degradation were conserved. Some interesting similarities and differences were identified when interrogating human, mouse, and zebrafish homologs. For example, DnaJB13 is predicted to be a non-functional Hsp40 in humans, mouse, and zebrafish due to a corrupted histidine-proline-aspartic acid (HPD) motif, while the coelacanth homolog has an intact HPD. These and other comparisons enabled important functional and evolutionary questions to be posed for future experimental studies.

  15. Reconstruction of cyclooxygenase evolution in animals suggests variable, lineage-specific duplications, and homologs with low sequence identity.

    PubMed

    Havird, Justin C; Kocot, Kevin M; Brannock, Pamela M; Cannon, Johanna T; Waits, Damien S; Weese, David A; Santos, Scott R; Halanych, Kenneth M

    2015-04-01

    Cyclooxygenase (COX) enzymatically converts arachidonic acid into prostaglandin G/H in animals and has importance during pregnancy, digestion, and other physiological functions in mammals. COX genes have mainly been described from vertebrates, where gene duplications are common, but few studies have examined COX in invertebrates. Given the increasing ease in generating genomic data, as well as recent, although incomplete descriptions of potential COX sequences in Mollusca, Crustacea, and Insecta, assessing COX evolution across Metazoa is now possible. Here, we recover 40 putative COX orthologs by searching publicly available genomic resources as well as ~250 novel invertebrate transcriptomic datasets. Results suggest the common ancestor of Cnidaria and Bilateria possessed a COX homolog similar to those of vertebrates, although such homologs were not found in poriferan and ctenophore genomes. COX was found in most crustaceans and the majority of molluscs examined, but only specific taxa/lineages within Cnidaria and Annelida. For example, all octocorallians appear to have COX, while no COX homologs were found in hexacorallian datasets. Most species examined had a single homolog, although species-specific COX duplications were found in members of Annelida, Mollusca, and Cnidaria. Additionally, COX genes were not found in Hemichordata, Echinodermata, or Platyhelminthes, and the few previously described COX genes in Insecta lacked appreciable sequence homology (although structural analyses suggest these may still be functional COX enzymes). This analysis provides a benchmark for identifying COX homologs in future genomic and transcriptomic datasets, and identifies lineages for future studies of COX.

  16. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  17. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  18. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  19. Reduced representation genome sequencing suggests low diversity on the sex chromosomes of tonkean macaque monkeys.

    PubMed

    Evans, Ben J; Zeng, Kai; Esselstyn, Jacob A; Charlesworth, Brian; Melnick, Don J

    2014-09-01

    In species with separate sexes, social systems can differ in the relative variances of male versus female reproductive success. Papionin monkeys (macaques, mangabeys, mandrills, drills, baboons, and geladas) exhibit hallmarks of a high variance in male reproductive success, including a female-biased adult sex ratio and prominent sexual dimorphism. To explore the potential genomic consequences of such sex differences, we used a reduced representation genome sequencing approach to quantifying polymorphism at sites on autosomes and sex chromosomes of the tonkean macaque (Macaca tonkeana), a species endemic to the Indonesian island of Sulawesi. The ratio of nucleotide diversity of the X chromosome to that of the autosomes was less than the value (0.75) expected with a 1:1 sex ratio and no sex differences in the variance in reproductive success. However, the significance of this difference was dependent on which outgroup was used to standardize diversity levels. Using a new model that includes the effects of varying population size, sex differences in mutation rate between the autosomes and X chromosome, and GC-biased gene conversion (gBGC) or selection on GC content, we found that the maximum-likelihood estimate of the ratio of effective population size of the X chromosome to that of the autosomes was 0.68, which did not differ significantly from 0.75. We also found evidence for 1) a higher level of purifying selection on genic than nongenic regions, 2) gBGC or natural selection favoring increased GC content, 3) a dynamic demography characterized by population growth and contraction, 4) a higher mutation rate in males than females, and 5) a very low polymorphism level on the Y chromosome. These findings shed light on the population genomic consequences of sex differences in the variance in reproductive success, which appear to be modest in the tonkean macaque; they also suggest the occurrence of hitchhiking on the Y chromosome.

  20. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  1. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  2. Red Sea isolation history suggested by Plio-Pleistocene seismic reflection sequences

    NASA Astrophysics Data System (ADS)

    Mitchell, Neil C.; Ligi, Marco; Rohling, Eelco J.

    2015-11-01

    High evaporation rates in the desert climate of the Red Sea ensure that, during glacial sea level lowstands when water exchange with the Indian Ocean was more restricted, water salinity and δ18 O became unusually extreme. Modeling of the effect on Red Sea sedimentary δ18 O has been used previously to reconstruct relative sea level to 500 ka and now poses the question of whether that sea-level model could be extended if continuous core material of older sediment became available. We attempt to address this question here by examining seismic reflection data. The upper Pleistocene hemipelagic sediments in the Red Sea contain intervals of inorganic aragonite precipitated during supersaturated conditions of sea-level lowstands. Seismic impedance changes associated with boundaries to those aragonite-rich layers appear to explain seismic reflection sequences. A segment of Chirp sediment profiler data from the central Red Sea reveals prominent reflections at ∼1, ∼5, ∼23, ∼26 and ∼36 ms two-way travel time (TWT) from the seabed. Based on depths to the glacial marine isotope stages (MIS) in cores, we relate the upper three reflections to the tops of aragonite-rich layers and hence the sea level rises immediately following MIS 2, 6 and 12. The reflection at 26 ms is related to an unusually rapid fall into MIS 12 predicted by one sea level reconstruction, which may have created an abrupt lower boundary to the MIS 12 aragonite-rich layer. With the aid of seismogram modeling, we tentatively associate the ∼36 ms reflection with the top of an aragonite-rich layer formed during MIS 16. Furthermore, some segments of lower frequency (airgun and sparker) seismic data from the central and southern Red Sea show a lower (earlier) Plio-Pleistocene (PP) interval that is less reflective than the upper (late) PP interval. This implies less variability in sediment impedance and that extreme variability in water salinity did not develop; water exchange with the Indian Ocean

  3. Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

    NASA Astrophysics Data System (ADS)

    Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

    2000-02-01

    Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.

  4. Describing sequencing results of structural chromosome rearrangements with a suggested next-generation cytogenetic nomenclature.

    PubMed

    Ordulu, Zehra; Wong, Kristen E; Currall, Benjamin B; Ivanov, Andrew R; Pereira, Shahrin; Althari, Sara; Gusella, James F; Talkowski, Michael E; Morton, Cynthia C

    2014-05-01

    With recent rapid advances in genomic technologies, precise delineation of structural chromosome rearrangements at the nucleotide level is becoming increasingly feasible. In this era of "next-generation cytogenetics" (i.e., an integration of traditional cytogenetic techniques and next-generation sequencing), a consensus nomenclature is essential for accurate communication and data sharing. Currently, nomenclature for describing the sequencing data of these aberrations is lacking. Herein, we present a system called Next-Gen Cytogenetic Nomenclature, which is concordant with the International System for Human Cytogenetic Nomenclature (2013). This system starts with the alignment of rearrangement sequences by BLAT or BLAST (alignment tools) and arrives at a concise and detailed description of chromosomal changes. To facilitate usage and implementation of this nomenclature, we are developing a program designated BLA(S)T Output Sequence Tool of Nomenclature (BOSToN), a demonstrative version of which is accessible online. A standardized characterization of structural chromosomal rearrangements is essential both for research analyses and for application in the clinical setting.

  5. Protein Analysis of Sapienic Acid-Treated Porphyromonas gingivalis Suggests Differential Regulation of Multiple Metabolic Pathways

    PubMed Central

    Dawson, Deborah V.; Blanchette, Derek R.; Drake, David R.; Wertz, Philip W.; Brogden, Kim A.

    2015-01-01

    ABSTRACT Lipids endogenous to skin and mucosal surfaces exhibit potent antimicrobial activity against Porphyromonas gingivalis, an important colonizer of the oral cavity implicated in periodontitis. Our previous work demonstrated the antimicrobial activity of the fatty acid sapienic acid (C16:1Δ6) against P. gingivalis and found that sapienic acid treatment alters both protein and lipid composition from those in controls. In this study, we further examined whole-cell protein differences between sapienic acid-treated bacteria and untreated controls, and we utilized open-source functional association and annotation programs to explore potential mechanisms for the antimicrobial activity of sapienic acid. Our analyses indicated that sapienic acid treatment induces a unique stress response in P. gingivalis resulting in differential expression of proteins involved in a variety of metabolic pathways. This network of differentially regulated proteins was enriched in protein-protein interactions (P = 2.98 × 10−8), including six KEGG pathways (P value ranges, 2.30 × 10−5 to 0.05) and four Gene Ontology (GO) molecular functions (P value ranges, 0.02 to 0.04), with multiple suggestive enriched relationships in KEGG pathways and GO molecular functions. Upregulated metabolic pathways suggest increases in energy production, lipid metabolism, iron acquisition and processing, and respiration. Combined with a suggested preferential metabolism of serine, which is necessary for fatty acid biosynthesis, these data support our previous findings that the site of sapienic acid antimicrobial activity is likely at the bacterial membrane. IMPORTANCE P. gingivalis is an important opportunistic pathogen implicated in periodontitis. Affecting nearly 50% of the population, periodontitis is treatable, but the resulting damage is irreversible and eventually progresses to tooth loss. There is a great need for natural products that can be used to treat and/or prevent the overgrowth of

  6. Amino Acid Sequence of Human Cholinesterase

    DTIC Science & Technology

    1985-10-01

    liquid chromatography (HPLC). Activity testing of the aged, DFP-labeled cholinesterase showed that 99.8% of the active sites had been labeled, since...acids were quantitated by ninhydrin at the AAA Labs, or by derivatization with phenylisothiocyanate at the University of Michigan. The latter method

  7. Recovery of partial 16S rDNA sequences suggests the presence of Crenarchaeota in the human digestive ecosystem.

    PubMed

    Rieu-Lesme, Françoise; Delbès, Céline; Sollelis, Lauriane

    2005-11-01

    Human feces collected from 10 healthy teenagers was analyzed for the presence of Crenarchaeota. After a first polymerase chain reaction (PCR) with Archaea-specific primers, a nested real-time PCR was performed using Crenarchaeota-specific primers. Real-time Crenarchaeotal PCR products detected from four subjects were cloned and the sequencing revealed that most of the partial 16S rRNA gene sequences were highly similar (> or = 97% homology) to sequences affiliated to the Sulfolobus group of the Crenarchaeota phylum. Our findings suggest for the first time that Crenarchaeota might be present in the microbiota of the human digestive ecosystem in which this phylum has never been found yet.

  8. Monophyletic origin of Lake Victoria cichlid fishes suggested by mitochondrial DNA sequences.

    PubMed

    Meyer, A; Kocher, T D; Basasibwaki, P; Wilson, A C

    1990-10-11

    Lake Victoria, together with its satellite lakes, harbours roughly 200 endemic forms of cichlid fishes that are classified as 'haplochromines' and yet the lake system is less than a million years old. This 'flock' has attracted attention because of the possibility that it evolved within the lake from one ancestral species and that biologists are thus presented with a case of explosive evolution. Within the past decade, however, morphology has increasingly emphasized the view that the flock may be polyphyletic. We sequenced up to 803 base pairs of mitochondrial DNA from 14 representative Victorian species and 23 additional African species. The flock seems to be monophyletic, and is more akin to that from Lake Malawi than to species from Lake Tanganyika; in addition, it contains less genetic variation than does the human species, and there is virtually no sharing of mitochondrial DNA types among species. These results confirm that the founding event was recent.

  9. Cystatin. Amino acid sequence and possible secondary structure.

    PubMed Central

    Schwabe, C; Anastasi, A; Crow, H; McDonald, J K; Barrett, A J

    1984-01-01

    The amino acid sequence of cystatin, the protein from chicken egg-white that is a tight-binding inhibitor of many cysteine proteinases, is reported. Cystatin is composed of 116 amino acid residues, and the Mr is calculated to be 13 143. No striking similarity to any other known sequence has been detected. The results of computer analysis of the sequence and c.d. spectrometry indicate that the secondary structure includes relatively little alpha-helix (about 20%) and that the remainder is mainly beta-structure. PMID:6712597

  10. Mouse Vk gene classification by nucleic acid sequence similarity.

    PubMed

    Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

    1989-01-01

    Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.

  11. Integrated genome analysis suggests that most conserved non-coding sequences are regulatory factor binding sites

    PubMed Central

    Hemberg, Martin; Gray, Jesse M.; Cloonan, Nicole; Kuersten, Scott; Grimmond, Sean; Greenberg, Michael E.; Kreiman, Gabriel

    2012-01-01

    More than 98% of a typical vertebrate genome does not code for proteins. Although non-coding regions are sprinkled with short (<200 bp) islands of evolutionarily conserved sequences, the function of most of these unannotated conserved islands remains unknown. One possibility is that unannotated conserved islands could encode non-coding RNAs (ncRNAs); alternatively, unannotated conserved islands could serve as promoter-distal regulatory factor binding sites (RFBSs) like enhancers. Here we assess these possibilities by comparing unannotated conserved islands in the human and mouse genomes to transcribed regions and to RFBSs, relying on a detailed case study of one human and one mouse cell type. We define transcribed regions by applying a novel transcript-calling algorithm to RNA-Seq data obtained from total cellular RNA, and we define RFBSs using ChIP-Seq and DNAse-hypersensitivity assays. We find that unannotated conserved islands are four times more likely to coincide with RFBSs than with unannotated ncRNAs. Thousands of conserved RFBSs can be categorized as insulators based on the presence of CTCF or as enhancers based on the presence of p300/CBP and H3K4me1. While many unannotated conserved RFBSs are transcriptionally active to some extent, the transcripts produced tend to be unspliced, non-polyadenylated and expressed at levels 10 to 100-fold lower than annotated coding or ncRNAs. Extending these findings across multiple cell types and tissues, we propose that most conserved non-coding genomic DNA in vertebrate genomes corresponds to promoter-distal regulatory elements. PMID:22684627

  12. Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens.

    PubMed

    Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D; Callac, Philippe

    2016-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated.

  13. Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens

    PubMed Central

    Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D.; Callac, Philippe

    2016-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131

  14. Large scale mitochondrial sequencing in Mexican Americans suggests a reappraisal of Native American origins

    PubMed Central

    2011-01-01

    Background The Asian origin of Native Americans is largely accepted. However uncertainties persist regarding the source population(s) within Asia, the divergence and arrival time(s) of the founder groups, the number of expansion events, and migration routes into the New World. mtDNA data, presented over the past two decades, have been used to suggest a single-migration model for which the Beringian land mass plays an important role. Results In our analysis of 568 mitochondrial genomes, the coalescent age estimates of shared roots between Native American and Siberian-Asian lineages, calculated using two different mutation rates, are A4 (27.5 ± 6.8 kya/22.7 ± 7.4 kya), C1 (21.4 ± 2.7 kya/16.4 ± 1.5 kya), C4 (21.0 ± 4.6 kya/20.0 ± 6.4 kya), and D4e1 (24.1 ± 9.0 kya/17.9 ± 10.0 kya). The coalescent age estimates of pan-American haplogroups calculated using the same two mutation rates (A2:19.5 ± 1.3 kya/16.1 ± 1.5 kya, B2:20.8 ± 2.0 kya/18.1 ± 2.4 kya, C1:21.4 ± 2.7 kya/16.4 ± 1.5 kya and D1:17.2 ± 2.0 kya/14.9 ± 2.2 kya) and estimates of population expansions within America (~21-16 kya), support the pre-Clovis occupation of the New World. The phylogeography of sublineages within American haplogroups A2, B2, D1 and the C1b, C1c andC1d subhaplogroups of C1 are complex and largely specific to geographical North, Central and South America. However some sub-branches (B2b, C1b, C1c, C1d and D1f) already existed in American founder haplogroups before expansion into the America. Conclusions Our results suggest that Native American founders diverged from their Siberian-Asian progenitors sometime during the last glacial maximum (LGM) and expanded into America soon after the LGM peak (~20-16 kya). The phylogeography of haplogroup C1 suggest that this American founder haplogroup differentiated in Siberia-Asia. The situation is less clear for haplogroup B2, however haplogroups A2 and D1 may have differentiated soon after the Native American founders divergence. A

  15. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences

    PubMed Central

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D.; Adir, Noam

    2016-01-01

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  16. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.

  17. Analysis of EST sequences suggests recent origin of allotetraploid colonial and creeping bentgrasses.

    PubMed

    Rotter, David; Bharti, Arvind K; Li, Huaijun Michael; Luo, Chongyuan; Bonos, Stacy A; Bughrara, Suleiman; Jung, Geunhwa; Messing, Joachim; Meyer, William A; Rudd, Stephen; Warnke, Scott E; Belanger, Faith C

    2007-08-01

    Advances in plant genomics have permitted the analysis of several members of the grass family, including the major domesticated species, and provided new insights into the evolution of the major crops on earth. Two members, colonial bentgrass (Agrostis capillaris L.) and creeping bentgrass (A. stolonifera L.) have only recently been domesticated and provide an interesting case of polyploidy and comparison to crops that have undergone human selection for thousands of years. As an initial step of characterizing these genomes, we have sampled roughly 10% of their gene content, thereby also serving as a starting point for the construction of their physical and genetic maps. Sampling mRNA from plants subjected to environmental stress showed a remarkable increase in transcription of transposable elements. Both colonial and creeping bentgrass are allotetraploids and are considered to have one genome in common, designated the A2 genome. Analysis of conserved genes present among the ESTs suggests the colonial and creeping bentgrass A2 genomes diverged from a common ancestor approximately 2.2 million years ago (MYA), thereby providing an enhanced evolutionary zoom in respect to the origin of maize, which formed 4.8 MYA, and tetraploid wheat, which formed only 0.5 MYA and is the progenitor of domesticated hexaploid wheat.

  18. Sequence divergence and diversity suggests ongoing functional diversification of vertebrate NAD metabolism.

    PubMed

    Gossmann, Toni I; Ziegler, Mathias

    2014-11-01

    NAD is not only an important cofactor in redox reactions but has also received attention in recent years because of its physiological importance in metabolic regulation, DNA repair and signaling. In contrast to the redox reactions, these regulatory processes involve degradation of NAD and therefore necessitate a constant replenishment of its cellular pool. NAD biosynthetic enzymes are common to almost all species in all clades, but the number of NAD degrading enzymes varies substantially across taxa. In particular, vertebrates, including humans, have a manifold of NAD degrading enzymes which require a high turnover of NAD. As there is currently a lack of a systematic study of how natural selection has shaped enzymes involved in NAD metabolism we conducted a comprehensive evolutionary analysis based on intraspecific variation and interspecific divergence. We compare NAD biosynthetic and degrading enzymes in four eukaryotic model species and subsequently focus on human NAD metabolic enzymes and their orthologs in other vertebrates. We find that the majority of enzymes involved in NAD metabolism are subject to varying levels of purifying selection. While NAD biosynthetic enzymes appear to experience a rather high level of evolutionary constraint, there is evidence for positive selection among enzymes mediating NAD-dependent signaling. This is particularly evident for members of the PARP family, a diverse protein family involved in DNA damage repair and programmed cell death. Based on haplotype information and substitution rate analysis we pinpoint sites that are potential targets of positive selection. We also link our findings to a three dimensional structure, which suggests that positive selection occurs in domains responsible for DNA binding and polymerization rather than the NAD catalytic domain. Taken together, our results indicate that vertebrate NAD metabolism is still undergoing functional diversification.

  19. Amino acid sequences of proteins from Leptospira serovar pomona.

    PubMed

    Alves, S F; Lefebvre, R B; Probert, W

    2000-01-01

    This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  20. Amino acid sequence of porcine spleen cathepsin D.

    PubMed Central

    Shewale, J G; Tang, J

    1984-01-01

    The amino acid sequence of porcine spleen cathepsin D heavy chain has been determined and, hence, the complete structure of this enzyme is now known. The sequence of heavy chain was constructed by aligning the structures of peptides generated by cyanogen bromide, trypsin, and endo-proteinase Lys C cleavages. The structure of the light chain has been published previously. The cathepsin D molecule contains 339 amino acid residues in two polypeptide chains: a 97-residue light chain and a 242-residue heavy chain, with a combined Mr of 36,779 (without carbohydrate). There are two carbohydrate units linked to asparagine residues 70 and 192. The disulfide bond arrangement in cathepsin D is probably similar to that of pepsin, because the positions of six half-cystine residues are conserved. The active site aspartyl residues, corresponding to aspartic acid-32 and -215 of pepsin, are located at residues 33 and 224 in the cathepsin D molecule. The amino acid sequence around these aspartyl residues is strongly conserved. Cathepsin D shows a strong homology with other acid proteases. When the sequence of cathepsin D, renin, and pepsin are aligned, 32.7% of the residues are identical. The homology is observed throughout the length of the molecules, indicating that three-dimensional structures of all three molecules are similar. PMID:6587385

  1. Stable isotope and signature fatty acid analyses suggest reef manta rays feed on demersal zooplankton.

    PubMed

    Couturier, Lydie I E; Rohner, Christoph A; Richardson, Anthony J; Marshall, Andrea D; Jaine, Fabrice R A; Bennett, Michael B; Townsend, Kathy A; Weeks, Scarla J; Nichols, Peter D

    2013-01-01

    Assessing the trophic role and interaction of an animal is key to understanding its general ecology and dynamics. Conventional techniques used to elucidate diet, such as stomach content analysis, are not suitable for large threatened marine species. Non-lethal sampling combined with biochemical methods provides a practical alternative for investigating the feeding ecology of these species. Stable isotope and signature fatty acid analyses of muscle tissue were used for the first time to examine assimilated diet of the reef manta ray Manta alfredi, and were compared with different zooplankton functional groups (i.e. near-surface zooplankton collected during manta ray feeding events and non-feeding periods, epipelagic zooplankton, demersal zooplankton and several different zooplankton taxa). Stable isotope δ(15)N values confirmed that the reef manta ray is a secondary consumer. This species had relatively high levels of docosahexaenoic acid (DHA) indicating a flagellate-based food source in the diet, which likely reflects feeding on DHA-rich near-surface and epipelagic zooplankton. However, high levels of ω6 polyunsaturated fatty acids and slightly enriched δ(13)C values in reef manta ray tissue suggest that they do not feed solely on pelagic zooplankton, but rather obtain part of their diet from another origin. The closest match was with demersal zooplankton, suggesting it is an important component of the reef manta ray diet. The ability to feed on demersal zooplankton is likely linked to the horizontal and vertical movement patterns of this giant planktivore. These new insights into the habitat use and feeding ecology of the reef manta ray will assist in the effective evaluation of its conservation needs.

  2. Stable Isotope and Signature Fatty Acid Analyses Suggest Reef Manta Rays Feed on Demersal Zooplankton

    PubMed Central

    Couturier, Lydie I. E.; Rohner, Christoph A.; Richardson, Anthony J.; Marshall, Andrea D.; Jaine, Fabrice R. A.; Bennett, Michael B.; Townsend, Kathy A.; Weeks, Scarla J.; Nichols, Peter D.

    2013-01-01

    Assessing the trophic role and interaction of an animal is key to understanding its general ecology and dynamics. Conventional techniques used to elucidate diet, such as stomach content analysis, are not suitable for large threatened marine species. Non-lethal sampling combined with biochemical methods provides a practical alternative for investigating the feeding ecology of these species. Stable isotope and signature fatty acid analyses of muscle tissue were used for the first time to examine assimilated diet of the reef manta ray Manta alfredi, and were compared with different zooplankton functional groups (i.e. near-surface zooplankton collected during manta ray feeding events and non-feeding periods, epipelagic zooplankton, demersal zooplankton and several different zooplankton taxa). Stable isotope δ15N values confirmed that the reef manta ray is a secondary consumer. This species had relatively high levels of docosahexaenoic acid (DHA) indicating a flagellate-based food source in the diet, which likely reflects feeding on DHA-rich near-surface and epipelagic zooplankton. However, high levels of ω6 polyunsaturated fatty acids and slightly enriched δ13C values in reef manta ray tissue suggest that they do not feed solely on pelagic zooplankton, but rather obtain part of their diet from another origin. The closest match was with demersal zooplankton, suggesting it is an important component of the reef manta ray diet. The ability to feed on demersal zooplankton is likely linked to the horizontal and vertical movement patterns of this giant planktivore. These new insights into the habitat use and feeding ecology of the reef manta ray will assist in the effective evaluation of its conservation needs. PMID:24167562

  3. Active site amino acid sequence of human factor D.

    PubMed

    Davis, A E

    1980-08-01

    Factor D was isolated from human plasma by chromatography on CM-Sephadex C50, Sephadex G-75, and hydroxylapatite. Digestion of reduced, S-carboxymethylated factor D with cyanogen bromide resulted in three peptides which were isolated by chromatography on Sephadex G-75 (superfine) equilibrated in 20% formic acid. NH2-Terminal sequences were determined by automated Edman degradation with a Beckman 890C sequencer using a 0.1 M Quadrol program. The smallest peptide (CNBr III) consisted of the NH2-terminal 14 amino acids. The other two peptides had molecular weights of 17,000 (CNBr I) and 7000 (CNBr II). Overlap of the NH2-terminal sequence of factor D with the NH2-terminal sequence of CNBr I established the order of the peptides. The NH2-terminal 53 residues of factor D are somewhat more homologous with the group-specific protease of rat intestine than with other serine proteases. The NH2-terminal sequence of CNBr II revealed the active site serine of factor D. The typical serine protease active site sequence (Gly-Asp-Ser-Gly-Gly-Pro was found at residues 12-17. The region surrounding the active site serine does not appear to be more highly homologous with any one of the other serine proteases. The structural data obtained point out the similarities between factor D and the other proteases. However, complete definition of the degree of relationship between factor D and other proteases will require determination of the remainder of the primary structure.

  4. The amino acid sequence of iguana (Iguana iguana) pancreatic ribonuclease.

    PubMed

    Zhao, W; Beintema, J J; Hofsteenge, J

    1994-01-15

    The pyrimidine-specific ribonuclease superfamily constitutes a group of homologous proteins so far found only in higher vertebrates. Four separate families are found in mammals, which have resulted from gene duplications in mammalian ancestors. To learn more about the evolutionary history of this superfamily, the primary structure and other characteristics of the pancreatic enzyme from iguana (Iguana iguana), a herbivorous lizard species belonging to the reptiles, have been determined. The polypeptide chain consists of 119 amino acid residues. The positions of insertions and deletions in the sequence are identical to those in the enzyme from snapping turtle. However, the two enzymes differ at 54% of the amino acid positions. Iguana ribonuclease contains no carbohydrate, although the enzyme possesses three recognition sites for carbohydrate attachment, and has a high number of acidic residues in a localized part of the sequence.

  5. Amino acid sequence and comparative antigenicity of chicken metallothionein.

    PubMed Central

    McCormick, C C; Fullmer, C S; Garvey, J S

    1988-01-01

    The complete amino acid sequence of metallothionein (MT) from chicken liver is reported. The primary structure was determined by automated sequence analysis of peptides produced by limited acid hydrolysis and by trypsin digestion. The comparative antigenicity of chicken MT was determined by radioimmunoassay using rabbit anti-rat MT polyclonal antibody. Chicken MT consists of 63 amino acids as compared to 61 found in MTs from mammals. One insertion (and two substitutions) occurs in the amino-terminal region, a region considered invariant among mammalian MTs. Eighteen of the 20 cysteines in chicken MT were aligned with cysteines from other mammalian sequences. Two cysteines near the carboxyl terminus are shifted by one residue due to the insertion of proline in that region. Overall, the chicken protein showed approximately equal to 68% sequence identity in a comparison with various mammalian MTs. The affinity of the polyclonal antibody for chicken MT was decreased by 2 orders of magnitude in comparison to that of a mammalian MT (rat MT isoforms). This reduced affinity is attributed to major substitutions in chicken MT in the regions of the principal determinants of mammalian MTs. Theoretical analysis of the primary structure predicted the secondary structure to consist of reverse turns and random coils with no stable beta or helix conformations. There is no evidence that chicken MT differs functionally from mammalian MTs. PMID:2448773

  6. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  7. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  8. Studies on naturally occurring infectious bursal disease viruses suggest that a single amino acid substitution at position 253 in VP2 increases pathogenicity.

    PubMed

    Jackwood, D J; Sreedevi, B; LeFever, L J; Sommer-Wagner, S E

    2008-07-20

    Three classic IBDV strains were previously isolated from commercial layer chicken flocks and shown to be phylogenetically related to vaccine strains but pathogenic in susceptible chickens. In this study, their viral genomes were sequenced and compared to sequences of vaccines being used in those flocks. The vaccine strains examined were sequenced directly from the manufacturer and had identical genome segment B sequences. Compared to these vaccines, the GA-1, H-30 and CS-2-35 isolates each had one silent mutation in the gene that encodes VP1. Compared to the two vaccines used at the time CS-2-35 was isolated, the segment A sequence of CS-2-35 contained numerous nucleotide and amino acid mutations suggesting the CS-2-35 virus was not closely related to these vaccines. This virus however did have amino acid mutations in VP2 that are reported to be necessary for replication in cell culture and lacked two of the three amino acid mutations previously shown to be necessary for virulence. These data suggest that CS-2-35 was a descendant from an attenuated strain of IBDV. When the segment A genomic sequences of the GA-1 and H-30 viruses were compared to the vaccines being used in those flocks they were most closely related to the attenuated D78 vaccine strain. In genome segment A, three nucleotide mutations in GA-1 and four in H-30 were observed compared to the D78 classic vaccine. These nucleotide mutations caused one amino acid (H253N) change in the GA-1 virus and two amino acids (H253Q and G259D) were different in the H-30 virus. In addition, both the GA-1 and H-30 viruses had the amino acid G76 in VP2 that appears to be unique to the vaccine D78. The data suggest that GA-1 and H-30 are genetically related and have a common ancestor even though they were isolated from geographically distant flocks. The evidence also suggests that GA-1, H-30 and CS-2-35 could be reversions from attenuated vaccine viruses or by coincidence genetically resemble classic IBDV vaccines. It

  9. Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

    PubMed Central

    Sinclair, Robert M.; Ravantti, Janne J.

    2017-01-01

    ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids

  10. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  11. Trichomonas vaginalis acidic phospholipase A2: isolation and partial amino acid sequence.

    PubMed

    Escobedo-Guajardo, Brenda L; González-Salazar, Francisco; Palacios-Corona, Rebeca; Torres de la Cruz, Víctor M; Morales-Vallarta, Mario; Mata-Cárdenas, Benito D; Garza-González, Jesús N; Rivera-Silva, Gerardo; Vargas-Villarreal, Javier

    2013-12-01

    Sexually transmitted diseases are a major cause of acute disease worldwide, and trichomoniasis is the most common and curable disease, generating more than 170 million cases annually worldwide. Trichomonas vaginalis is the causal agent of trichomoniasis and has the ability to destroy in vitro cell monolayers of the vaginal mucosa, where the phospholipases A2 (PLA2) have been reported as potential virulence factors. These enzymes have been partially characterized from the subcellular fraction S30 of pathogenic T. vaginalis strains. The main objective of this study was to purify a phospholipase A2 from T. vaginalis, make a partial characterization, obtain a partial amino acid sequence, and determine its enzymatic participation as hemolytic factor causing lysis of erythrocytes. Trichomonas S30, RF30 and UFF30 sub-fractions from GT-15 strain have the capacity to hydrolyze [2-(14)C-PA]-PC at pH 6.0. Proteins from the UFF30 sub-fraction were separated by affinity chromatography into two eluted fractions with detectable PLA A2 activity. The EDTA-eluted fraction was analyzed by HPLC using on-line HPLC-tandem mass spectrometry and two protein peaks were observed at 8.2 and 13 kDa. Peptide sequences were identified from the proteins present in the eluted EDTA UFF30 fraction; bioinformatic analysis using Protein Link Global Server charged with T. vaginalis protein database suggests that eluted peptides correspond a putative ubiquitin protein in the 8.2 kDa fraction and a phospholipase preserved in the 13 kDa fraction. The EDTA-eluted fraction hydrolyzed [2-(14)C-PA]-PC lyses erythrocytes from Sprague-Dawley in a time and dose-dependent manner. The acidic hemolytic activity decreased by 84% with the addition of 100 μM of Rosenthal's inhibitor.

  12. The complementary deoxyribonucleic acid sequence of guinea pig endometrial prorelaxin.

    PubMed

    Lee, Y A; Bryant-Greenwood, G D; Mandel, M; Greenwood, F C

    1992-03-01

    The nucleotide sequence of the relaxin gene transcript in the endometrium of the late pregnant guinea pig has been determined. The strategy used was a combination of polymerase chain reaction (PCR) with primers designed from the mRNA sequence of porcine preprorelaxin, rapid amplification of cDNA ends-PCR, and blunt end cloning in M13 mp18. With heterologous primers, a 226-basepair (bp) segment of the guinea pig relaxin gene sequence was obtained and was used to design a guinea pig-specific primer for use with the rapid amplification of cDNA ends-PCR method. The latter allowed completion of the sequence of 336 bp, with a 96-bp overlap. The sequence obtained shows greater homology at both the nucleotide and amino acid levels with porcine and human relaxins H1 and H2 than with rat relaxin, supporting the thesis that the guinea pig is not a rodent. The transcription of the guinea pig endometrial relaxin gene during pregnancy was confirmed by Northern analysis of guinea pig endometrial tissues with a species-specific cDNA probe. The endometrial relaxin gene is transcribed during pregnancy, but not in lactation, consistent with the observed immunostaining for relaxin.

  13. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  14. A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly

    PubMed Central

    2013-01-01

    Background The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes. Results We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters. Conclusions These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies. PMID:23496952

  15. First complete genome sequence of European turkey coronavirus suggests complex recombination history related with US turkey and guinea fowl coronaviruses.

    PubMed

    Brown, P A; Touzain, F; Briand, F X; Gouilh, A M; Courtillon, C; Allée, C; Lemaitre, E; De Boisséson, C; Blanchard, Y; Eterradossi, N

    2016-01-01

    A full-length genome sequence of 27,739  nt was determined for the only known European turkey coronavirus (TCoV) isolate. In general, the order, number and size of ORFs were consistent with other gammacoronaviruses. Three points of recombination were predicted, one towards the end of 1a, a second in 1b just upstream of S and a third in 3b. Phylogenetic analysis of the four regions defined by these three points supported the previous notion that European and American viruses do indeed have different evolutionary pathways. Very close relationships were revealed between the European TCoV and the European guinea fowl coronavirus in all regions except one, and both were shown to be closely related to the European infectious bronchitis virus (IBV) Italy 2005. None of these regions of sequence grouped European and American TCoVs. The region of sequence containing the S gene was unique in grouping all turkey and guinea fowl coronaviruses together, separating them from IBVs. Interestingly the French guinea fowl virus was more closely related to the North American viruses. These data demonstrate that European turkey and guinea fowl coronaviruses share a common genetic backbone (most likely an ancestor of IBV Italy 2005) and suggest that this recombined in two separate events with different, yet related, unknown avian coronaviruses, acquiring their S-3a genes. The data also showed that the North American viruses do not share a common backbone with European turkey and guinea fowl viruses; however, they do share similar S-3a genes with guinea fowl virus.

  16. Analysis of DNA haplotypes suggests a genetic predisposition to trisomy 21 associated with DNA sequences on chromosome 21.

    PubMed Central

    Antonarakis, S E; Kittur, S D; Metaxotou, C; Watkins, P C; Patel, A S

    1985-01-01

    To test the hypothesis that there is a genetic predisposition to nondisjunction and trisomy 21 associated with DNA sequences on chromosome 21, we used DNA polymorphism haplotypes for chromosomes 21 to examine the distribution of different chromosomes 21 in Down syndrome and control families from the same ethnic group. The chromosomes 21 from 20 Greek families with a Down syndrome child and 27 control Greek families have been examined for DNA polymorphism haplotypes by using four common polymorphic sites adjacent to two closely linked single-copy DNA sequences (namely pW228C and pW236B), which map somewhere near the proximal long arm of chromosome 21. Three haplotypes, +, +---, and - with respective frequencies of 43/108, 24/108, and 23/108, account for the majority of chromosomes 21 in the control families. However, haplotype - was found to be much more commonly associated with chromosomes 21 that underwent nondisjunction in the Down syndrome families (frequency of 21/50; X2 for the two distributions is 9.550; P = 0.023; degrees of freedom, 3). The two populations (control and trisomic families) did not differ in the distribution of haplotypes for two DNA polymorphisms on chromosome 17. The data from this initial study suggest that the chromosome 21, which is marked in Greeks with haplotype - for the four above described polymorphic sites, is found more commonly in chromosomes that participate in nondisjunction than in controls. We propose an increased tendency for nondisjunction due to DNA sequences associated with a subset of chromosomes 21 bearing this haplotype. Images PMID:2987923

  17. Intron sequences of arginine kinase in an intertidal snail suggest an ecotype-specific selective sweep and a gene duplication

    PubMed Central

    Kemppainen, P; Lindskog, T; Butlin, R; Johannesson, K

    2011-01-01

    Many species with restricted gene flow repeatedly respond similarly to local selection pressures. To fully understand the genetic mechanisms behind this process, the phylogeographic history of the species (inferred from neutral markers) as well as the loci under selection need to be known. Here we sequenced an intron in the arginine kinase gene (Ark), which shows strong clinal variation between two locally adapted ecotypes of the flat periwinkle, Littorina fabalis. The ‘small-sheltered' ecotype was almost fixed for one haplotype, H1, in populations on both sides of the North Sea, unlike the ‘large-moderately exposed ecotype', which segregated for ten different haplotypes. This contrasts with neutral markers, where the two ecotypes are equally variable. H1 could have been driven to high frequency in an ancestral population and then repeatedly spread to sheltered habitats due to local selection pressures with the colonization of both sides of the North Sea, after the last glacial maximum (∼18 000 years ago). An alternative explanation is that a positively selected mutation, in or linked to Ark, arose after the range expansion and secondarily spread through sheltered populations throughout the distribution range, causing this ecotype to evolve in a concerted fashion. Also, we were able to sequence up to four haplotypes consistently from some individuals, suggesting a gene duplication in Ark. PMID:20877396

  18. What Is Peromyscus? Evidence from nuclear and mitochondrial DNA sequences suggests the need for a new classification

    PubMed Central

    Platt, Roy N.; Amman, Brian R.; Keith, Megan S.; Thompson, Cody W.; Bradley, Robert D.

    2015-01-01

    The evolutionary relationships between Peromyscus, Habromys, Isthmomys, Megadontomys, Neotomodon, Osgoodomys, and Podomys are poorly understood. In order to further explore the evolutionary boundaries of Peromyscus and compare potential taxonomic solutions for this diverse group and its relatives, we conducted phylogenetic analyses of DNA sequence data from alcohol dehydrogenase (Adh1-I2), beta fibrinogen (Fgb-I7), interphotoreceptor retinoid-binding protein (Rbp3), and cytochrome-b (Cytb). Phylogenetic analyses of mitochondrial and nuclear genes produced similar topologies although levels of nodal support varied. The best-supported topology was obtained by combining nuclear and mitochondrial sequences. No monophyletic Peromyscus clade was supported. Instead, support was found for a clade containing Habromys, Megadontomys, Neotomodon, Osgoodomys, Podomys, and Peromyscus suggesting paraphyly of Peromyscus and confirming previous observations. Our analyses indicated an early divergence of Isthmomys from Peromyscus (approximately 8 million years ago), whereas most other peromyscine taxa emerged within the last 6 million years. To recover a monophyletic taxonomy from Peromyscus and affiliated lineages, we detail 3 taxonomic options in which Habromys, Megadontomys, Neotomodon, Osgoodomys, and Podomys are retained as genera, subsumed as subgenera, or subsumed as species groups within Peromyscus. Each option presents distinct taxonomic challenges, and the appropriate taxonomy must reflect the substantial levels of morphological divergence that characterize this group while maintaining the monophyletic relationships obtained from genetic data. PMID:26937047

  19. What Is Peromyscus? Evidence from nuclear and mitochondrial DNA sequences suggests the need for a new classification.

    PubMed

    Platt, Roy N; Amman, Brian R; Keith, Megan S; Thompson, Cody W; Bradley, Robert D

    2015-08-03

    The evolutionary relationships between Peromyscus, Habromys, Isthmomys, Megadontomys, Neotomodon, Osgoodomys, and Podomys are poorly understood. In order to further explore the evolutionary boundaries of Peromyscus and compare potential taxonomic solutions for this diverse group and its relatives, we conducted phylogenetic analyses of DNA sequence data from alcohol dehydrogenase (Adh1-I2), beta fibrinogen (Fgb-I7), interphotoreceptor retinoid-binding protein (Rbp3), and cytochrome-b (Cytb). Phylogenetic analyses of mitochondrial and nuclear genes produced similar topologies although levels of nodal support varied. The best-supported topology was obtained by combining nuclear and mitochondrial sequences. No monophyletic Peromyscus clade was supported. Instead, support was found for a clade containing Habromys, Megadontomys, Neotomodon, Osgoodomys, Podomys, and Peromyscus suggesting paraphyly of Peromyscus and confirming previous observations. Our analyses indicated an early divergence of Isthmomys from Peromyscus (approximately 8 million years ago), whereas most other peromyscine taxa emerged within the last 6 million years. To recover a monophyletic taxonomy from Peromyscus and affiliated lineages, we detail 3 taxonomic options in which Habromys, Megadontomys, Neotomodon, Osgoodomys, and Podomys are retained as genera, subsumed as subgenera, or subsumed as species groups within Peromyscus. Each option presents distinct taxonomic challenges, and the appropriate taxonomy must reflect the substantial levels of morphological divergence that characterize this group while maintaining the monophyletic relationships obtained from genetic data.

  20. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  1. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  2. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  3. The amino acid sequence of rabbit cardiac troponin I.

    PubMed Central

    Grand, R J; Wilkinson, J M

    1976-01-01

    The complete amino acid sequence of troponin I from rabbit cardiac muscle was determined by the isolation of four unique CNBr fragments, together with overlapping tryptic peptides containing radioactive methionine residues. Overlap data for residues 35-36, 93-94 and 140-145 are incomplete, the sequence at these positions being based on homology with the sequence of the fast-skeletal-muscle protein. Cardiac troponin I is a single polypeptide chain of 206 residues with mol.wt. 23550 and an extinction coefficient, E 1%,1cm/280, of 4.37. The protein has a net positive charge of 14 and is thus somewhat more basic than troponin I from fast-skeletal muscle. Comparison of the sequences of troponin I from cardiac and fast skeletal muscle show that the cardiac protein has 26 extra residues at the N-terminus which account for the larger size of the protein. In the remainder of sequence there is a considerable degree of homology, this being greater in the C-terminal two-thirds of the molecule. The region in the cardiac protein corresponding to the peptide with inhibitory activity from the fast-skeletal-muscle protein is very similar and it seems unlikely that this is the cause of the difference in inhibitory activity between the two proteins. The region responsible for binding troponin C, however, possesses a lower degree of homology. Detailed evidence on which the sequence is based has been deposited as Supplementary Publication SUP 50072 (20 pages), at the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7QB, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1976) 153, 5. PMID:1008822

  4. Amino acid sequence of a mouse immunoglobulin mu chain.

    PubMed Central

    Kehry, M; Sibley, C; Fuhrman, J; Schilling, J; Hood, L E

    1979-01-01

    The complete amino acid sequence of the mouse mu chain from the BALB/c myeloma tumor MOPC 104E is reported. The C mu region contains four consecutive homology regions of approximately 110 residues and a COOH-terminal region of 19 residues. A comparison of this mu chain from mouse with a complete mu sequence from human (Ou) and a partial mu chain sequence from dog (Moo) reveals a striking gradient of increasing homology from the NH2-terminal to the COOH-terminal portion of these mu chains, with the former being the least and the latter the most highly conserved. Four of the five sites of carbohydrate attachment appear to be at identical residue positions when the constant regions of the mouse and human mu chains are compared. The mu chain of MOPC 104E has a carbohydrate moiety attached in the second hypervariable region. This is particularly interesting in view of the fact that MOPC 104E binds alpha-(1 leads to 3)-dextran, a simple carbohydrate. The structural and functional constraints imposed by these comparative sequence analyses are discussed. PMID:111247

  5. Mathematical Models Suggest Facilitated Fatty Acids Crossing of the Luminal Membrane in the Cardiac Muscle.

    PubMed

    Barta, Efrath

    2017-02-01

    Long-chain fatty acids cross a few membranes on their way from the capillary blood to the cardiomyocyte cytosol, where they are utilized as an essential source of energy. Details of the transport mechanism across those membranes remained elusive despite decades of laboratory and theoretical work. Here we inspect several optional scenarios for the crossing of the luminal membrane of the endothelial cell, the first barrier that should be crossed: a passive diffusion, facilitation by receptors for albumin and facilitation by fatty acids transporters. Related measured rate constants are incorporated in a theoretical simulation that is based on reaction-diffusion equations. Asymptotic analytical solutions for the resulting stiff boundary value problems are formulated based on singular perturbations theory. We conclude that a passive diffusion has to be supplemented with facilitation mechanisms in order to meet energy requirements. Binding sites for albumin, scattered on the membrane face, might enhance the flux provided that they internalize the captured fatty acids and speed up the dissociation of the albumin-fatty acids complex. As such enhancement is moderate, another mechanism seems to be essential for an adequate supply of fatty acids. Lack of experimental data prohibits us from computing the quantitative effect of membrane fatty acids transporters but their involvement in the membrane crossing is inferred.

  6. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein

    PubMed Central

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer. PMID:24147211

  7. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein.

    PubMed

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  8. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  9. Docking simulations suggest that all- trans retinoic acid could bind to retinoid X receptors

    NASA Astrophysics Data System (ADS)

    Tsuji, Motonori; Shudo, Koichi; Kagechika, Hiroyuki

    2015-10-01

    Retinoid X receptors (RXRs) are ligand-controlled transcription factors which heterodimerize with other nuclear receptors to regulate gene transcriptions associated with crucial biological events. 9- cis retinoic acid (9cRA), which transactivates RXRs, is believed to be an endogenous RXR ligand. All- trans retinoic acid (ATRA) is a natural ligand for retinoic acid receptors (RARs), which heterodimerize with RXRs. Although the concentration of 9cRA in tissues is very low, ATRA is relatively abundant and some reports show that ATRA activates RXRs. We computationally studied the possibility of ATRA binding to RXRs using two different docking methods with our developed programs to assess the binding affinities of naturally occurring retinoids. The simulations showed good correlations to the reported binding affinities of these molecules for RXRs and RARs.

  10. SUBGROUPS OF AMINO ACID SEQUENCES IN THE VARIABLE REGIONS OF IMMUNOGLOBULIN HEAVY CHAINS*

    PubMed Central

    Cunningham, Bruce A.; Pflumm, Mollie N.; User, Urs Rutisha; Edelman, Gerald M.

    1969-01-01

    The amino acid sequence of the first 133 residues of the heavy (γ) chain from a human γG immunoglobulin (He) has been determined. This γ-chain is identical in Gm type to that of protein Eu, the complete sequence of which has been reported. Comparison of the two sequences substantiates the previous suggestion that there are subgroups of variable regions of heavy chains. The variable region of Eu has been assigned to subgroup I and that of He to subgroup II; on the other hand, the constant regions of the two proteins appear to be identical. Comparison of the sequence of the heavy chain of He with the heavy chain sequences determined in other laboratories suggests that the variable region of subgroup II is at least 118 residues long. The nature and distribution of amino acid variations in this heavy chain subgroup resemble those observed in light chain subgroups. These studies provide evidence that the translocation hypothesis applies to heavy as well as to light chains, viz., genes for variable regions (V) are somatically translocated to genes for constant regions (C) to form complete VC structural genes. Images PMID:5264153

  11. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    PubMed Central

    Kasarda, D D; Okita, T W; Bernardin, J E; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with amino acid sequences derived from both cDNA clones and is virtually identical to one of them. On the basis of sequence information, after loss of the signal sequence, the mature alpha-type gliadins may be divided into five different domains, two of which may have evolved from an ancestral gliadin gene, whereas the remaining three contain repeating sequences that may have developed independently. Images PMID:6589619

  12. Structural gene and complete amino acid sequence of Vibrio alginolyticus collagenase.

    PubMed Central

    Takeuchi, H; Shibano, Y; Morihara, K; Fukushima, J; Inami, S; Keil, B; Gilles, A M; Kawamoto, S; Okuda, K

    1992-01-01

    The DNA encoding the collagenase of Vibrio alginolyticus was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited both collagenase antigen and collagenase activity. The open reading frame from the ATG initiation codon was 2442 bp in length for the collagenase structural gene. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature collagenase consists of 739 amino acids with an Mr of 81875. The amino acid sequences of 20 polypeptide fragments were completely identical with the deduced amino acid sequences of the collagenase gene. The amino acid composition predicted from the DNA sequence was similar to the chemically determined composition of purified collagenase reported previously. The analyses of both the DNA and amino acid sequences of the collagenase gene were rigorously performed, but we could not detect any significant sequence similarity to other collagenases. Images Fig. 2. PMID:1311172

  13. Binding of [alpha, alpha]-Disubstituted Amino Acids to Arginase Suggests New Avenues for Inhibitor Design

    SciTech Connect

    Ilies, Monica; Di Costanzo, Luigi; Dowling, Daniel P.; Thorn, Katherine J.; Christianson, David W.

    2011-10-21

    Arginase is a binuclear manganese metalloenzyme that hydrolyzes L-arginine to form L-ornithine and urea, and aberrant arginase activity is implicated in various diseases such as erectile dysfunction, asthma, atherosclerosis, and cerebral malaria. Accordingly, arginase inhibitors may be therapeutically useful. Continuing our efforts to expand the chemical space of arginase inhibitor design and inspired by the binding of 2-(difluoromethyl)-L-ornithine to human arginase I, we now report the first study of the binding of {alpha},{alpha}-disubstituted amino acids to arginase. Specifically, we report the design, synthesis, and assay of racemic 2-amino-6-borono-2-methylhexanoic acid and racemic 2-amino-6-borono-2-(difluoromethyl)hexanoic acid. X-ray crystal structures of human arginase I and Plasmodium falciparum arginase complexed with these inhibitors reveal the exclusive binding of the L-stereoisomer; the additional {alpha}-substituent of each inhibitor is readily accommodated and makes new intermolecular interactions in the outer active site of each enzyme. Therefore, this work highlights a new region of the protein surface that can be targeted for additional affinity interactions, as well as the first comparative structural insights on inhibitor discrimination between a human and a parasitic arginase.

  14. Binding of α,α-disubstituted amino acids to arginase suggests new avenues for inhibitor design.

    PubMed

    Ilies, Monica; Di Costanzo, Luigi; Dowling, Daniel P; Thorn, Katherine J; Christianson, David W

    2011-08-11

    Arginase is a binuclear manganese metalloenzyme that hydrolyzes L-arginine to form L-ornithine and urea, and aberrant arginase activity is implicated in various diseases such as erectile dysfunction, asthma, atherosclerosis, and cerebral malaria. Accordingly, arginase inhibitors may be therapeutically useful. Continuing our efforts to expand the chemical space of arginase inhibitor design and inspired by the binding of 2-(difluoromethyl)-L-ornithine to human arginase I, we now report the first study of the binding of α,α-disubstituted amino acids to arginase. Specifically, we report the design, synthesis, and assay of racemic 2-amino-6-borono-2-methylhexanoic acid and racemic 2-amino-6-borono-2-(difluoromethyl)hexanoic acid. X-ray crystal structures of human arginase I and Plasmodium falciparum arginase complexed with these inhibitors reveal the exclusive binding of the L-stereoisomer; the additional α-substituent of each inhibitor is readily accommodated and makes new intermolecular interactions in the outer active site of each enzyme. Therefore, this work highlights a new region of the protein surface that can be targeted for additional affinity interactions, as well as the first comparative structural insights on inhibitor discrimination between a human and a parasitic arginase.

  15. Nucleotide sequences of the Pseudomonas savastanoi indoleacetic acid genes show homology with Agrobacterium tumefaciens T-DNA

    PubMed Central

    Yamada, Tetsuji; Palm, Curtis J.; Brooks, Bob; Kosuge, Tsune

    1985-01-01

    We report the nucleotide sequences of iaaM and iaaH, the genetic determinants for, respectively, tryptophan 2-monooxygenase and indoleacetamide hydrolase, the enzymes that catalyze the conversion of L-tryptophan to indoleacetic acid in the tumor-forming bacterium Pseudomonas syringae pv. savastanoi. The sequence analysis indicates that the iaaM locus contains an open reading frame encoding 557 amino acids that would comprise a protein with a molecular weight of 61,783; the iaaH locus contains an open reading frame of 455 amino acids that would comprise a protein with a molecular weight of 48,515. Significant amino acid sequence homology was found between the predicted sequence of the tryptophan monooxygenase of P. savastanoi and the deduced product of the T-DNA tms-1 gene of the octopine-type plasmid pTiA6NC from Agrobacterium tumefaciens. Strong homology was found in the 25 amino acid sequence in the putative FAD-binding region of tryptophan monooxygenase. Homology was also found in the amino acid sequences representing the central regions of the putative products of iaaH and tms-2 T-DNA. The results suggest a strong similarity in the pathways for indoleacetic acid synthesis encoded by genes in P. savastanoi and in A. tumefaciens T-DNA. Images PMID:16593610

  16. Discontinuous Occurrence of the hsp70 (dnaK) Gene among Archaea and Sequence Features of HSP70 Suggest a Novel Outlook on Phylogenies Inferred from This Protein

    PubMed Central

    Gribaldo, Simonetta; Lumia, Valentina; Creti, Roberta; Conway de Macario, Everly; Sanangelantoni, Annamaria; Cammarano, Piero

    1999-01-01

    Occurrence of the hsp70 (dnaK) gene was investigated in various members of the domain Archaea comprising both euryarchaeotes and crenarchaeotes and in the hyperthermophilic bacteria Aquifex pyrophilus and Thermotoga maritima representing the deepest offshoots in phylogenetic trees of bacterial 16S rRNA sequences. The gene was not detected in 8 of 10 archaea examined but was found in A. pyrophilus and T. maritima, from which it was cloned and sequenced. Comparative analyses of the HSP70 amino acid sequences encoded in these genes, and others in the databases, showed that (i) in accordance with the vicinities seen in rRNA-based trees, the proteins from A. pyrophilus and T. maritima form a thermophilic cluster with that from the green nonsulfur bacterium Thermomicrobium roseum and are unrelated to their counterparts from gram-positive bacteria, proteobacteria/mitochondria, chlamydiae/spirochetes, deinococci, and cyanobacteria/chloroplasts; (ii) the T. maritima HSP70 clusters with the homologues from the archaea Methanobacterium thermoautotrophicum and Thermoplasma acidophilum, in contrast to the postulated unique kinship between archaea and gram-positive bacteria; and (iii) there are exceptions to the reported association between an insert in HSP70 and gram negativity, or vice versa, absence of insert and gram positivity. Notably, the HSP70 from T. maritima lacks the insert, although T. maritima is phylogenetically unrelated to the gram-positive bacteria. These results, along with the absence of hsp70 (dnaK) in various archaea and its presence in others, suggest that (i) different taxa retained either one or the other of two hsp70 (dnaK) versions (with or without insert), regardless of phylogenetic position; and (ii) archaea are aboriginally devoid of hsp70 (dnaK), and those that have it must have received it from phylogenetically diverse bacteria via lateral gene transfer events that did not involve replacement of an endogenous hsp70 (dnaK) gene. PMID:9882656

  17. n→π* interactions in poly(lactic acid) suggest a role in protein folding.

    PubMed

    Newberry, Robert W; Raines, Ronald T

    2013-09-11

    Poly(lactic acid) (PLA) is a versatile synthetic polyester. We noted that this depsipeptide analog of polyalanine has a helical structure that resembles a polyproline II helix. Using natural bond orbital analysis, we find that n→π* interactions between sequential ester carbonyl groups contribute 0.44 kcal mol(-1) per monomer to the conformational stability of PLA helices. We conclude that analogous n→π* interactions could direct the folding of a polypeptide chain into a polyproline II helix prior to the formation of hydrogen bonds between backbone amides.

  18. Analyses of HTLV-1 sequences suggest interaction between ORF-I mutations and HAM/TSP outcome.

    PubMed

    Barreto, Fernanda Khouri; Khouri, Ricardo; Rego, Filipe Ferreira de Almeida; Santos, Luciane Amorim; Castro-Amarante, Maria Fernanda de; Bialuk, Izabela; Pise-Masison, Cynthia A; Galvão-Castro, Bernardo; Gessain, Antoine; Jacobson, Steven; Franchini, Genoveffa; Alcantara, Luiz Carlos

    2016-11-01

    The region known as pX in the 3' end of the human T-cell lymphotropic virus type 1 (HTLV-1) genome contains four overlapping open reading frames (ORF) that encode regulatory proteins. HTLV-1 ORF-I produces the protein p12 and its cleavage product p8. The functions of these proteins have been linked to immune evasion and viral infectivity and persistence. It is known that the HTLV-1 infection does not necessarily imply the development of pathological processes and here we evaluated whether natural mutations in HTLV-1 ORF-I can influence the proviral load and clinical manifestation of HTLV-I-associated myelopathy/tropical spastic paraparesis (HAM/TSP). For that, we performed molecular characterization, datamining and phylogenetic analysis with HTLV-1 ORF-I sequences from 156 patients with negative or positive diagnosis for HAM/TSP. Our analyses demonstrated that some mutations may be associated with the outcome of HAM/TSP (C39R, L40F, P45L, S69G and R88K) or with proviral load (P34L and F61L). We further examined the presence of mutations in motifs of HBZ and observed that P45L mutation is located within the HBZ nuclear localization signal and was found more frequently between patients with HAM/TSP and high proviral load. These results indicate that some natural mutations are located in functional domains of ORF-I and suggests a potential association between these mutations and the proviral loads and development of HAM/TSP. Therefore it is necessary to conduct functional studies aimed at evaluating the impact of these mutations on the virus persistence and immune evasion.

  19. Multiple site-selective insertions of non-canonical amino acids into sequence-repetitive polypeptides

    PubMed Central

    Wu, I-Lin; Patterson, Melissa A.; Carpenter Desai, Holly E.; Mehl, Ryan A.; Giorgi, Gianluca

    2013-01-01

    A simple and efficient method is described for introduction of non-canonical amino acids at multiple, structurally defined sites within recombinant polypeptide sequences. E. coli MRA30, a bacterial host strain with attenuated activity for release factor 1 (RF1), is assessed for its ability to support the incorporation of a diverse range of non-canonical amino acids in response to multiple encoded amber (TAG) codons within genetic templates derived from superfolder GFP and an elastin-mimetic protein polymer. Suppression efficiency and isolated protein yield were observed to depend on the identity of the orthogonal aminoacyl-tRNA synthetase/tRNACUA pair and the non-canonical amino acid substrate. This approach afforded elastin-mimetic protein polymers containing non-canonical amino acid derivatives at up to twenty-two positions within the repeat sequence with high levels of substitution. The identity and position of the variant residues was confirmed by mass spectrometric analysis of the full-length polypeptides and proteolytic cleavage fragments resulting from thermolysin digestion. The accumulated data suggest that this multi-site suppression approach permits the preparation of protein-based materials in which novel chemical functionality can be introduced at precisely defined positions within the polypeptide sequence. PMID:23625817

  20. Analysis of 18S rRNA gene sequences suggests significant molecular differences between Macrodasyida and Chaetonotida (Gastrotricha).

    PubMed

    Manylov, Oleg G; Vladychenskaya, Natalia S; Milyutina, Irina A; Kedrova, Olga S; Korokhov, Nikolai P; Dvoryanchikov, Gennady A; Aleshin, Vladimir V; Petrov, Nikolai B

    2004-03-01

    Partial 18S rRNA gene sequences of four macrodasyid and one chaetonotid gastrotrichs were obtained and compared with the available sequences of other gastrotrich species and representatives of various metazoan phyla. Contrary to the earlier molecular data, the gastrotrich sequences did not comprise a monophyletic group but formed two distinct clades, corresponding to the Macrodasyida and Chaetonotida, with the basal position occupied by the sequences of Tetranchyroderma sp. and Xenotrichula sp., respectively. Depending on the taxon sampling and methods of analysis, the two clades were separated by various combinations of clades Rotifera, Gnathostomulida, and Platyhelminthes, and never formed a clade with Nematoda. Thus, monophyly of the Gastrotricha is not confirmed by analysis of the presently available molecular data.

  1. The genetics of colored sequence synesthesia: Suggestive evidence of linkage to 16q and genetic heterogeneity for the condition

    PubMed Central

    Tomson, Steffie N.; Avidan, Nili; Lee, Kwanghyuk; Sarma, Anand K.; Tushe, Rejnal; Milewicz, Dianna M.; Bray, Molly; Leal, Suzanne M.; Eagleman, David M.

    2014-01-01

    Synesthesia is a perceptual condition in which sensory stimulation triggers anomalous sensory experiences. In colored sequence synesthesia (CSS), color experiences are triggered by sequences such as letters or numbers. We performed a family based linkage analysis to identify genetic loci responsible for the increased neural crosstalk underlying CSS. Our results implicate a 23 MB region at 16q12.2-23.1, providing the first step in understanding the molecular basis of CSS. PMID:21504763

  2. Nucleic acid (cDNA) and amino acid sequences of the maize endosperm protein glutelin-2.

    PubMed Central

    Prat, S; Cortadas, J; Puigdomènech, P; Palau, J

    1985-01-01

    The cDNA coding for a glutelin-2 protein from maize endosperm has been cloned and the complete amino acid sequence of the protein derived for the first time. An immature maize endosperm cDNA bank was screened for the expression of a beta-lactamase:glutelin-2 (G2) fusion polypeptide by using antibodies against the purified 28 kd G2 protein. A clone corresponding to the 28 kd G2 protein was sequenced and the primary structure of this protein was derived. Five regions can be defined in the protein sequence: an 11 residue N-terminal part, a repeated region formed by eight units of the sequence Pro-Pro-Pro-Val-His-Leu, an alternating Pro-X stretch 21 residues long, a Cys rich domain and a C-terminal part rich in Gln. The protein sequence is preceded by 19 residues which have the characteristics of the signal peptide found in secreted proteins. Unlike zeins, the main maize storage proteins, 28 kd glutelin-2 has several homologous sequences in common with other cereal storage proteins. Images PMID:3839076

  3. Partial amino acid sequence of human pancreatic stone protein, a novel pancreatic secretory protein.

    PubMed Central

    Montalto, G; Bonicel, J; Multigner, L; Rovery, M; Sarles, H; De Caro, A

    1986-01-01

    Pancreatic stone protein (PSP) is the major organic component of human pancreatic stones. With the use of monoclonal antibody immunoadsorbents, five immunoreactive forms (PSP-S) with close Mr values (14,000-19,000) were isolated from normal pancreatic juice. By CM-Trisacryl M chromatography the lowest-Mr form (PSP-S1) was separated from the others and some of its molecular characteristics were investigated. The Mr of the PSP-S1 polypeptide chain calculated from the amino acid composition was about 16,100. The N-terminal sequences (40 residues) of PSP and PSP-S1 are identical, which suggests that the peptide backbone is the same for both of these polypeptides. The PSP-S1 sequence was determined up to residue 65 and was found to be different from all other known protein sequences. Images Fig. 1. PMID:3541906

  4. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  5. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  6. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  7. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  8. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  9. Human liver apolipoprotein B-100 cDNA: complete nucleic acid and derived amino acid sequence.

    PubMed Central

    Law, S W; Grant, S M; Higuchi, K; Hospattankar, A; Lackner, K; Lee, N; Brewer, H B

    1986-01-01

    Human apolipoprotein B-100 (apoB-100), the ligand on low density lipoproteins that interacts with the low density lipoprotein receptor and initiates receptor-mediated endocytosis and low density lipoprotein catabolism, has been cloned, and the complete nucleic acid and derived amino acid sequences have been determined. ApoB-100 cDNAs were isolated from normal human liver cDNA libraries utilizing immunoscreening as well as filter hybridization with radiolabeled apoB-100 oligodeoxynucleotides. The apoB-100 mRNA is 14.1 kilobases long encoding a mature apoB-100 protein of 4536 amino acids with a calculated amino acid molecular weight of 512,723. ApoB-100 contains 20 potential glycosylation sites, and 12 of a total of 25 cysteine residues are located in the amino-terminal region of the apolipoprotein providing a potential globular structure of the amino terminus of the protein. ApoB-100 contains relatively few regions of amphipathic helices, but compared to other human apolipoproteins it is enriched in beta-structure. The delineation of the entire human apoB-100 sequence will now permit a detailed analysis of the conformation of the protein, the low density lipoprotein receptor binding domain(s), and the structural relationship between apoB-100 and apoB-48 and will provide the basis for the study of genetic defects in apoB-100 in patients with dyslipoproteinemias. PMID:3464946

  10. Computer selection of oligonucleotide probes from amino acid sequences for use in gene library screening.

    PubMed

    Yang, J H; Ye, J H; Wallace, D C

    1984-01-11

    We present a computer program, FINPROBE, which utilizes known amino acid sequence data to deduce minimum redundancy oligonucleotide probes for use in screening cDNA or genomic libraries or in primer extension. The user enters the amino acid sequence of interest, the desired probe length, the number of probes sought, and the constraints on oligonucleotide synthesis. The computer generates a table of possible probes listed in increasing order of redundancy and provides the location of each probe in the protein and mRNA coding sequence. Activation of a next function provides the amino acid and mRNA sequences of each probe of interest as well as the complementary sequence and the minimum dissociation temperature of the probe. A final routine prints out the amino acid sequence of the protein in parallel with the mRNA sequence listing all possible codons for each amino acid.

  11. Deletion of conserved sequences in IG-DMR at Dlk1-Gtl2 locus suggests their involvement in expression of paternally expressed genes in mice

    PubMed Central

    SAITO, Takeshi; HARA, Satoshi; TAMANO, Moe; ASAHARA, Hiroshi; TAKADA, Shuji

    2016-01-01

    Expression regulation of the Dlk1-Dio3 imprinted domain by the intergenic differentially methylated region (IG-DMR) is essential for normal embryonic development in mammals. In this study, we investigated conserved IG-DMR genomic sequences in eutherians to elucidate their role in genomic imprinting of the Dlk1-Dio3 domain. Using a comparative genomics approach, we identified three highly conserved sequences in IG-DMR. To elucidate the functions of these sequences in vivo, we generated mutant mice lacking each of the identified highly conserved sequences using the CRISPR/Cas9 system. Although mutant mice did not exhibit the gross phenotype, deletions of the conserved sequences altered the expression levels of paternally expressed imprinted genes in the mutant embryos without skewing imprinting status. These results suggest that the conserved sequences in IG-DMR are involved in the expression regulation of some of the imprinted genes in the Dlk1-Dio3 domain. PMID:27904015

  12. Deletion of conserved sequences in IG-DMR at Dlk1-Gtl2 locus suggests their involvement in expression of paternally expressed genes in mice.

    PubMed

    Saito, Takeshi; Hara, Satoshi; Tamano, Moe; Asahara, Hiroshi; Takada, Shuji

    2017-02-16

    Expression regulation of the Dlk1-Dio3 imprinted domain by the intergenic differentially methylated region (IG-DMR) is essential for normal embryonic development in mammals. In this study, we investigated conserved IG-DMR genomic sequences in eutherians to elucidate their role in genomic imprinting of the Dlk1-Dio3 domain. Using a comparative genomics approach, we identified three highly conserved sequences in IG-DMR. To elucidate the functions of these sequences in vivo, we generated mutant mice lacking each of the identified highly conserved sequences using the CRISPR/Cas9 system. Although mutant mice did not exhibit the gross phenotype, deletions of the conserved sequences altered the expression levels of paternally expressed imprinted genes in the mutant embryos without skewing imprinting status. These results suggest that the conserved sequences in IG-DMR are involved in the expression regulation of some of the imprinted genes in the Dlk1-Dio3 domain.

  13. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  14. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  15. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  16. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  17. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  18. Site-Directed Mutagenesis and Structural Studies Suggest that the Germination Protease, GPR, in Spores of Bacillus Species Is an Atypical Aspartic Acid Protease

    PubMed Central

    Carroll, Thomas M.; Setlow, Peter

    2005-01-01

    Germination protease (GPR) initiates the degradation of small, acid-soluble spore proteins (SASP) during germination of spores of Bacillus and Clostridium species. The GPR amino acid sequence is not homologous to members of the major protease families, and previous work has not identified residues involved in GPR catalysis. The current work has focused on identifying catalytically essential amino acids by mutagenesis of Bacillus megaterium gpr. A residue was selected for alteration if it (i) was conserved among spore-forming bacteria, (ii) was a potential nucleophile, and (iii) had not been ruled out as inessential for catalysis. GPR variants were overexpressed in Escherichia coli, and the active form (P41) was assayed for activity against SASP and the zymogen form (P46) was assayed for the ability to autoprocess to P41. Variants inactive against SASP and unable to autoprocess were analyzed by circular dichroism spectroscopy and multiangle laser light scattering to determine whether the variant's inactivity was due to loss of secondary or quaternary structure, respectively. Variation of D127 and D193, but no other residues, resulted in inactive P46 and P41, while variants of each form were well structured and tetrameric, suggesting that D127 and D193 are essential for activity and autoprocessing. Mapping these two aspartate residues and a highly conserved lysine onto the B. megaterium P46 crystal structure revealed a striking similarity to the catalytic residues and propeptide lysine of aspartic acid proteases. These data indicate that GPR is an atypical aspartic acid protease. PMID:16199582

  19. Analysis of amino acid sequence variations and immunoglobulin E-binding epitopes of German cockroach tropomyosin.

    PubMed

    Jeong, Kyoung Yong; Lee, Jongweon; Lee, In-Yong; Ree, Han-Il; Hong, Chein-Soo; Yong, Tai-Soon

    2004-09-01

    The allergenicities of tropomyosins from different organisms have been reported to vary. The cDNA encoding German cockroach tropomyosin (Bla g 7) was isolated, expressed, and characterized previously. In the present study, the amino acid sequence variations in German cockroach tropomyosin were analyzed in order to investigate its influence on allergenicity. We also undertook the identification of immunodominant peptides containing immunoglobulin E (IgE) epitopes which may facilitate the development of diagnostic and immunotherapeutic strategies based on the recombinant proteins. Two-dimensional gel electrophoresis and immunoblot analysis with mouse anti-recombinant German cockroach tropomyosin serum was performed to investigate the isoforms at the protein level. Reverse transcriptase PCR (RT-PCR) was applied to examine the sequence diversity. Eleven different variants of the deduced amino acid sequences were identified by RT-PCR. German cockroach tropomyosin has only minor sequence variations that did not seem to affect its allergenicity significantly. These results support the molecular basis underlying the cross-reactivities of arthropod tropomyosins. Recombinant fragments were also generated by PCR, and IgE-binding epitopes were assessed by enzyme-linked immunosorbent assay. Sera from seven patients revealed heterogeneous IgE-binding responses. This study demonstrates multiple IgE-binding epitope regions in a single molecule, suggesting that full-length tropomyosin should be used for the development of diagnostic and therapeutic reagents.

  20. Complete sequence of RNA3 of Cucumber mosaic virus isolates infecting Gerbera jamesonii suggests its grouping under IB subgroup.

    PubMed

    Gautum, K K; Raj, R; Kumar, S; Raj, S K; Roy, R K; Katiyar, R

    2014-01-01

    The complete RNA3 genome of Cucumber mosaic virus (CMV) was amplified by RT-PCR from three infected gerbera (Gerbera jamesonii) leaf samples exhibiting severe chlorotic mosaic and flower deformation symptoms. The amplicons obtained were cloned sequenced and deposited in GenBank under the accessions JN692495, JX913531 (from cv. Zingaro) and JX888093 (from cv. Silvester). These sequences shared 98-99 % identities to each other and with a strain of CMV-Banana reported from India, and 90-95 % identities with various strains of CMV reported worldwide. Phylogenetic analysis revealed their closest affinity with CMV-Banana strain, and close relationships with several other strains of CMV of subgroup IB. This study provides evidence of subgroup IB CMV causing severe chlorosis and flower deformation in two cultivars (Zingaro and Silvester) of G. jamesonii in India.

  1. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment.

  2. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  3. Complete amino acid sequence of a histidine-rich proteolytic fragment of human ceruloplasmin.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1979-04-01

    The complete amino acid sequence has been determined for a fragment of human ceruloplasmin [ferroxidase; iron(II):oxygen oxidoreductase, EC 1.16.3.1]. The fragment (designated Cp F5) contains 159 amino acid residues and has a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains one free cysteine that may be part of a copper-binding site. This fragment is present in most commercial preparations of ceruloplasmin, probably owing to proteolytic degradation, but can also be obtained by limited cleavage of single-chain ceruloplasmin with plasmin. Cp F5 probably is an intact domain attached to the COOH-terminal end of single-chain ceruloplasmin via a labile interdomain peptide bond. A model of the secondary structure predicted by empirical methods suggests that almost one-third of the amino acid residues are distributed in alpha helices, about a third in beta-sheet structure, and the remainder in beta turns and unidentified structures. Computer analysis of the amino acid sequence has not demonstrated a statistically significant relationship between this ceruloplasmin fragment and any other protein, but there is some evidence for an internal duplication.

  4. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  5. Relationships in the Caryophyllales as suggested by phylogenetic analyses of partial chloroplast DNA ORF2280 homolog sequences.

    PubMed

    Downie, S; Katz-Downie, D; Cho, K

    1997-02-01

    Phylogenetic relationships within the angiosperm order Caryophyllales were investigated by comparative sequencing of two portions of the highly conserved inverted repeat (totaling some 1100 base pairs) coinciding with the region occupied by ORF2280 in Nicotiana, the largest gene in the plastid genomes of most land plants. Data were obtained for 33 species in 11 families within the order and for one species each of Plumbaginaceae, Polygonaceae, and Nepenthaceae. These data, when analyzed along with previously published ORF (open reading frame) sequences from Nicotiana. Spinacia. Epifagus, and Pelargonium using parsimony, neighbor-joining, and maximum likelihood methods, reveal that: (1) Amaranthus, Celosia, and Froelichia (all Amaranthaceae) do not comprise a monophyletic group; (2) Amaranthus may be nested within a paraphyletic Chenopodiaceae; (3) Sarcobatus (Chenopodiaceae) is allied with Nyctaginaceae + Phytolaccaceae (the latter family excluding Stegnosperma but including Petiveria); and (4) Caryophyllaceae (with Corrigiola basal within the clade) are sister group to Chenopodiaceae + Amaranthaceae. Basal relations within the order remain obscure. Sequence divergence values in pairwise comparisons across all Caryophyllales taxa ranged from 0.1 to 5% of nucleotides. However, despite these low values, 23 insertion and deletion events were apparent, of which five were informative phylogenetically and bolstered several of the relationships listed above. A polymerase chain reaction (PCR) survey for ORF homolog length variants in representatives from 70 additional angiosperm families revealed major deletions, of 100 to 1400 base pairs, in 19 of these families. Although the ORF is located within the mutationally retarded inverted repeat region of most angiosperm chloroplast DNAs, this gene appears particularly prone to length mutation.

  6. Human retroviruses and aids, 1992. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Korber, B.; Berzofsky, J.A.; Pavlakis, G.N.; Smith, R.F.

    1992-10-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) HIV and SIV Nucleotide Sequences; (H) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions below of the parts of the compendium, the user should read the individual introductions for each part.

  7. High resolution structural evidence suggests the Sarcoplasmic Reticulum forms microdomains with Acidic Stores (lysosomes) in the heart

    PubMed Central

    Aston, Daniel; Capel, Rebecca A.; Ford, Kerrie L.; Christian, Helen C.; Mirams, Gary R.; Rog-Zielinska, Eva A.; Kohl, Peter; Galione, Antony; Burton, Rebecca A. B.; Terrar, Derek A.

    2017-01-01

    Nicotinic Acid Adenine Dinucleotide Phosphate (NAADP) stimulates calcium release from acidic stores such as lysosomes and is a highly potent calcium-mobilising second messenger. NAADP plays an important role in calcium signalling in the heart under basal conditions and following β-adrenergic stress. Nevertheless, the spatial interaction of acidic stores with other parts of the calcium signalling apparatus in cardiac myocytes is unknown. We present evidence that lysosomes are intimately associated with the sarcoplasmic reticulum (SR) in ventricular myocytes; a median separation of 20 nm in 2D electron microscopy and 3.3 nm in 3D electron tomography indicates a genuine signalling microdomain between these organelles. Fourier analysis of immunolabelled lysosomes suggests a sarcomeric pattern (dominant wavelength 1.80 μm). Furthermore, we show that lysosomes form close associations with mitochondria (median separation 6.2 nm in 3D studies) which may provide a basis for the recently-discovered role of NAADP in reperfusion-induced cell death. The trigger hypothesis for NAADP action proposes that calcium release from acidic stores subsequently acts to enhance calcium release from the SR. This work provides structural evidence in cardiac myocytes to indicate the formation of microdomains between acidic and SR calcium stores, supporting emerging interpretations of NAADP physiology and pharmacology in heart. PMID:28094777

  8. Mitochondrial DNA D-loop sequences suggest a Southeast Asian and Indian origin of Zimbabwean village chickens.

    PubMed

    Muchadeyi, F C; Eding, H; Simianer, H; Wollny, C B A; Groeneveld, E; Weigend, S

    2008-12-01

    This study sought to assess mitochondrial DNA (mtDNA) diversity and phylogeographic structure of chickens from five agro-ecological zones of Zimbabwe. Furthermore, chickens from Zimbabwe were compared with populations from other geographical regions (Malawi, Sudan and Germany) and other management systems (broiler and layer purebred lines). Finally, haplotypes of these animals were aligned to chicken sequences, taken from GenBank, that reflected populations of presumed centres of domestication. A 455-bp fragment of the mtDNA D-loop region was sequenced in 283 chickens of 14 populations. Thirty-two variable sites that defined 34 haplotypes were observed. In Zimbabwean chickens, diversity within ecotypes accounted for 96.8% of the variation, indicating little differentiation between ecotypes. The 34 haplotypes clustered into three clades that corresponded to (i) Zimbabwean and Malawian chickens, (ii) broiler and layer purebred lines and Northwest European chickens, and (iii) a mixture of chickens from Zimbabwe, Sudan, Northwest Europe and the purebred lines. Diversity among clades explained more than 80% of the total variation. Results indicated the existence of two distinct maternal lineages evenly distributed among the five Zimbabwean chicken ecotypes. For one of these lineages, chickens from Zimbabwe and Malawi shared major haplotypes with chicken populations that have a Southeast Asian background. The second maternal lineage, probably from the Indian subcontinent, was common to the five Zimbabwean chicken ecotypes, Sudanese and Northwest European chickens as well as purebred broiler and layer chicken lines. A third maternal lineage excluded Zimbabwean and other African chickens and clustered with haplotypes presumably originating from South China.

  9. A 25-Amino Acid Sequence of the Arabidopsis TGD2 Protein Is Sufficient for Specific Binding of Phosphatidic Acid*

    PubMed Central

    Lu, Binbin; Benning, Christoph

    2009-01-01

    Genetic analysis suggests that the TGD2 protein of Arabidopsis is required for the biosynthesis of endoplasmic reticulum derived thylakoid lipids. TGD2 is proposed to be the substrate-binding protein of a presumed lipid transporter consisting of the TGD1 (permease) and TGD3 (ATPase) proteins. The TGD1, -2, and -3 proteins are localized in the inner chloroplast envelope membrane. TGD2 appears to be anchored with an N-terminal membrane-spanning domain into the inner envelope membrane, whereas the C-terminal domain faces the intermembrane space. It was previously shown that the C-terminal domain of TGD2 binds phosphatidic acid (PtdOH). To investigate the PtdOH binding site of TGD2 in detail, the C-terminal domain of the TGD2 sequence lacking the transit peptide and transmembrane sequences was fused to the C terminus of the Discosoma sp. red fluorescent protein (DR). This greatly improved the solubility of the resulting DR-TGD2C fusion protein following production in Escherichia coli. The DR-TGD2C protein bound PtdOH with high specificity, as demonstrated by membrane lipid-protein overlay and liposome association assays. Internal deletion and truncation mutagenesis identified a previously undescribed minimal 25-amino acid fragment in the C-terminal domain of TGD2 that is sufficient for PtdOH binding. Binding characteristics of this 25-mer were distinctly different from those of TGD2C, suggesting that additional sequences of TGD2 providing the proper context for this 25-mer are needed for wild type-like PtdOH binding. PMID:19416982

  10. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  11. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  12. Completion of the amino acid sequence of the alpha 1 chain from type I calf skin collagen. Amino acid sequence of alpha 1(I)B8.

    PubMed Central

    Glanville, R W; Breitkreutz, D; Meitinger, M; Fietzek, P P

    1983-01-01

    The complete amino acid sequence of the 279-residue CNBr peptide CB8 from the alpha 1 chain of type I calf skin collagen is presented. It was determined by sequencing overlapping fragments of CB8 produced by Staphylococcus aureus V8 proteinase, trypsin, Endoproteinase Arg-C and hydroxylamine. Tryptic cleavages were also made specific for lysine by blocking arginine residues with cyclohexane-1,2-dione. This completes the amino acid sequence analysis of the 1054-residues-long alpha (I) chain of calf skin collagen. PMID:6354180

  13. Microbial community dynamics in bioaugmented sequencing batch reactors for bromoamine acid removal.

    PubMed

    Qu, Yuanyuan; Zhou, Jiti; Wang, Jing; Fu, Xiang; Xing, Linlin

    2005-05-01

    Sphingomonas xenophaga QYY with the ability to degrade bromoamine acid (BAA) was previously isolated from sludge samples. The enhancement of BAA removal by strain QYY in sequencing batch reactors (SBRs) was investigated in this study. The results showed that augmented SBRs exhibited stronger abilities to degrade BAA than the non-augmented control one. In order to estimate the relationship between community dynamics and function of augmented SBRs, a combined method based on fingerprints (ribosomal intergenic spacer analysis, RISA) and 16S rRNA gene sequencing was used. The results indicated that the microbial community dynamics were substantially changed, and the introduced strain QYY was persistent in the augmented systems. This study suggests that it is feasible and potentially useful to enhance BAA removal using BAA-degrading bacteria, such as S. xenophaga QYY.

  14. Alignment of 700 globin sequences: extent of amino acid substitution and its correlation with variation in volume.

    PubMed Central

    Kapp, O. H.; Moens, L.; Vanfleteren, J.; Trotman, C. N.; Suzuki, T.; Vinogradov, S. N.

    1995-01-01

    Seven-hundred globin sequences, including 146 nonvertebrate sequences, were aligned on the basis of conservation of secondary structure and the avoidance of gap penalties. Of the 182 positions needed to accommodate all the globin sequences, only 84 are common to all, including the absolutely conserved PheCD1 and HisF8. The mean number of amino acid substitutions per position ranges from 8 to 13 for all globins and 5 to 9 for internal positions. Although the total sequence volumes have a variation approximately 2-3%, the variation in volume per position ranges from approximately 13% for the internal to approximately 21% for the surface positions. Plausible correlations exist between amino acid substitution and the variation in volume per position for the 84 common and the internal but not the surface positions. The amino acid substitution matrix derived from the 84 common positions was used to evaluate sequence similarity within the globins and between the globins and phycocyanins C and colicins A, via calculation of pairwise similarity scores. The scores for globin-globin comparisons over the 84 common positions overlap the globin-phycocyanin and globin-colicin scores, with the former being intermediate. For the subset of internal positions, overlap is minimal between the three groups of scores. These results imply a continuum of amino acid sequences able to assume the common three-on-three alpha-helical structure and suggest that the determinants of the latter include sites other than those inaccessible to solvent. PMID:8535255

  15. An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data.

    PubMed Central

    Adzhubei, I A; Adzhubei, A A; Neidle, S

    1998-01-01

    We have constructed a non-homologous database, termed the Integrated Sequence-Structure Database (ISSD) which comprises the coding sequences of genes, amino acid sequences of the corresponding proteins, their secondary structure and straight phi,psi angles assignments, and polypeptide backbone coordinates. Each protein entry in the database holds the alignment of nucleotide sequence, amino acid sequence and the PDB three-dimensional structure data. The nucleotide and amino acid sequences for each entry are selected on the basis of exact matches of the source organism and cell environment. The current version 1.0 of ISSD is available on the WWW at http://www.protein.bio.msu.su/issd/ and includes 107 non-homologous mammalian proteins, of which 80 are human proteins. The database has been used by us for the analysis of synonymous codon usage patterns in mRNA sequences showing their correlation with the three-dimensional structure features in the encoded proteins. Possible ISSD applications include optimisation of protein expression, improvement of the protein structure prediction accuracy, and analysis of evolutionary aspects of the nucleotide sequence-protein structure relationship. PMID:9399866

  16. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor.

  17. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  18. The complete genome sequences of a Peruvian and a Colombian isolate of Andean potato latent virus and partial sequences of further isolates suggest the existence of two distinct potato-infecting tymovirus species.

    PubMed

    Kreuze, Jan; Koenig, Renate; De Souza, Joao; Vetten, Heinrich Josef; Muller, Giovanna; Flores, Betty; Ziebell, Heiko; Cuellar, Wilmer

    2013-05-01

    The complete genomic RNA sequences of the tymovirus isolates Hu and Col from potato which originally had been considered to be strains of the same virus species, i.e. Andean potato latent virus (APLV), were determined by siRNA sequencing and assembly, and found to share only c. 65% nt sequence identity. This result together with those of serological tests and comparisons of the coat protein gene sequences of additional tymovirus isolates from potato suggest that the species Andean potato latent virus should be subdivided into two species, i.e. APLV and Andean potato mild mosaic virus (APMMV). Primers were designed for the broad specificity detection of both viruses.

  19. Individual sequence variability and functional activities of fibrinogen-related proteins (FREPs) in the Mediterranean mussel (Mytilus galloprovincialis) suggest ancient and complex immune recognition models in invertebrates.

    PubMed

    Romero, Alejandro; Dios, Sonia; Poisa-Beiro, Laura; Costa, Maria M; Posada, David; Figueras, Antonio; Novoa, Beatriz

    2011-03-01

    In this paper, we describe sequences of fibrinogen-related proteins (FREPs) in the Mediterranean mussel Mytilus galloprovincialis (MuFREPs) with the fibrinogen domain probably involved in the antigen recognition, but without the additional collagen-like domain of ficolins, molecules responsible for complement activation by the lectin pathway. Although they do not seem to be true or primive ficolins since the phylogenetic analysis are not conclusive enough, their expression is increased after bacterial infection or PAMPs treatment and they present opsonic activities similar to mammalian ficolins. The most remarkable aspect of these sequences was the existence of a very diverse set of FREP sequences among and within individuals (different mussels do not share any identical sequence) which parallels the extraordinary complexity of the immune system, suggesting the existence of a primitive system with a potential capacity to recognize and eliminate different kind of pathogens.

  20. Solving the woolly mammoth conundrum: amino acid ¹⁵N-enrichment suggests a distinct forage or habitat.

    PubMed

    Schwartz-Narbonne, Rachel; Longstaffe, Fred J; Metcalfe, Jessica Z; Zazula, Grant

    2015-06-09

    Understanding woolly mammoth ecology is key to understanding Pleistocene community dynamics and evaluating the roles of human hunting and climate change in late Quaternary megafaunal extinctions. Previous isotopic studies of mammoths' diet and physiology have been hampered by the 'mammoth conundrum': woolly mammoths have anomalously high collagen δ(15)N values, which are more similar to coeval carnivores than herbivores, and which could imply a distinct diet and (or) habitat, or a physiological adaptation. We analyzed individual amino acids from collagen of adult woolly mammoths and coeval species, and discovered greater  (15)N enrichment in source amino acids of woolly mammoths than in most other herbivores or carnivores. Woolly mammoths consumed an isotopically distinct food source, reflective of extreme aridity, dung fertilization, and (or) plant selection. This dietary signal suggests that woolly mammoths occupied a distinct habitat or forage niche relative to other Pleistocene herbivores.

  1. Solving the woolly mammoth conundrum: amino acid 15N-enrichment suggests a distinct forage or habitat

    NASA Astrophysics Data System (ADS)

    Schwartz-Narbonne, Rachel; Longstaffe, Fred J.; Metcalfe, Jessica Z.; Zazula, Grant

    2015-06-01

    Understanding woolly mammoth ecology is key to understanding Pleistocene community dynamics and evaluating the roles of human hunting and climate change in late Quaternary megafaunal extinctions. Previous isotopic studies of mammoths’ diet and physiology have been hampered by the ‘mammoth conundrum’: woolly mammoths have anomalously high collagen δ15N values, which are more similar to coeval carnivores than herbivores, and which could imply a distinct diet and (or) habitat, or a physiological adaptation. We analyzed individual amino acids from collagen of adult woolly mammoths and coeval species, and discovered greater  15N enrichment in source amino acids of woolly mammoths than in most other herbivores or carnivores. Woolly mammoths consumed an isotopically distinct food source, reflective of extreme aridity, dung fertilization, and (or) plant selection. This dietary signal suggests that woolly mammoths occupied a distinct habitat or forage niche relative to other Pleistocene herbivores.

  2. Solving the woolly mammoth conundrum: amino acid 15N-enrichment suggests a distinct forage or habitat

    PubMed Central

    Schwartz-Narbonne, Rachel; Longstaffe, Fred J.; Metcalfe, Jessica Z.; Zazula, Grant

    2015-01-01

    Understanding woolly mammoth ecology is key to understanding Pleistocene community dynamics and evaluating the roles of human hunting and climate change in late Quaternary megafaunal extinctions. Previous isotopic studies of mammoths’ diet and physiology have been hampered by the ‘mammoth conundrum’: woolly mammoths have anomalously high collagen δ15N values, which are more similar to coeval carnivores than herbivores, and which could imply a distinct diet and (or) habitat, or a physiological adaptation. We analyzed individual amino acids from collagen of adult woolly mammoths and coeval species, and discovered greater  15N enrichment in source amino acids of woolly mammoths than in most other herbivores or carnivores. Woolly mammoths consumed an isotopically distinct food source, reflective of extreme aridity, dung fertilization, and (or) plant selection. This dietary signal suggests that woolly mammoths occupied a distinct habitat or forage niche relative to other Pleistocene herbivores. PMID:26056037

  3. Cretaceous stratigraphic sequences of north-central California suggest a discontinuity in the Late Cretaceous forearc basin

    SciTech Connect

    Haggart, J.W.

    1986-10-01

    The Cretaceous sedimentary succession preserved east of Redding, at the northern end of California's Great Valley, indicates that marine deposition was widespread in the region for only two periods during the Late Cretaceous. If it is assumed that there was minimal Cenozoic offset between the northern Sierra Nevada and eastern Klamath Mountains terranes, Cretaceous sedimentation in this region was most likely restricted to a narrow trough and was not a continuation of the wide, Cretaceous forearc basin of central California. The dissimilar depositional histories of the Redding basin and the Hornbrook basin of north-central California suggest that the basins were not linked continuously during the Late Cretaceous. A thick section of Cretaceous strata beneath the southwestern Modoc Plateau is considered unlikely.

  4. Co-induction of methyltransferase Rv0560c by naphthoquinones and fibric acids suggests attenuation of isoprenoid quinone action in Mycobacterium tuberculosis.

    PubMed

    Garbe, Thomas R

    2004-10-01

    The superoxide generator menadione was previously demonstrated as an inducer of growth stage dependent protein patterns in Mycobacterium tuberculosis. The present study refines this observation by characterizing a novel 27-kDa protein that had not been observed in previous studies relying on younger cultures. A very similar response, based on two-dimensional gel electrophoretic analyses, was induced by the closely related naphthoquinone plumbagin. The 27-kDa protein was also induced by the pro-oxidant peroxisome proliferator gemfibrozil and to a lesser extent by the structurally related compounds fenofibrate and clofibrate. N-terminal sequence data of proteolytic fragments from the 27-kDa protein demonstrated its identity with protein Rv0560c, previously demonstrated to be inducible by salicylate, which also possesses peroxisome proliferating properties. Protein Rv0560c bears three conserved motifs characteristic of S-adenosylmethionine-dependent methyltransferases. Further sequence similarities suggest a function in the bio syn thesis of isoprenoid compounds, e.g., tocopherol, ubiquinone, and sterols. Such involvement is supported by the recognized yet unexplained widespread interference of menadione, salicylate, and fibrates with the isoprenoid quinones ubiquinone, menaquinone, and vitamin K. Induction of Rv0560c by fibrates, salicylate, and naphthoquinones is thus suggested to be caused by action on the plasma membrane, reminiscent of cytochrome P450BM-3 induction by fibrates in Bacillus megaterium, which catalyzes the hydroxylation of fatty acids and thus modulates membrane properties.

  5. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    NASA Technical Reports Server (NTRS)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  6. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  7. Overlapping Patterns of Rapid Evolution in the Nucleic Acid Sensors cGAS and OAS1 Suggest a Common Mechanism of Pathogen Antagonism and Escape

    PubMed Central

    Hancks, Dustin C.; Hartley, Melissa K.; Hagan, Celia; Clark, Nathan L.; Elde, Nels C.

    2015-01-01

    A diverse subset of pattern recognition receptors (PRRs) detects pathogen-associated nucleic acids to initiate crucial innate immune responses in host organisms. Reflecting their importance for host defense, pathogens encode various countermeasures to evade or inhibit these immune effectors. PRRs directly engaged by pathogen inhibitors often evolve under recurrent bouts of positive selection that have been described as molecular ‘arms races.’ Cyclic GMP-AMP synthase (cGAS) was recently identified as a key PRR. Upon binding cytoplasmic double-stranded DNA (dsDNA) from various viruses, cGAS generates the small nucleotide secondary messenger cGAMP to signal activation of innate defenses. Here we report an evolutionary history of cGAS with recurrent positive selection in the primate lineage. Recent studies indicate a high degree of structural similarity between cGAS and 2’-5’-oligoadenylate synthase 1 (OAS1), a PRR that detects double-stranded RNA (dsRNA), despite low sequence identity between the respective genes. We present comprehensive comparative evolutionary analysis of cGAS and OAS1 primate sequences and observe positive selection at nucleic acid binding interfaces and distributed throughout both genes. Our data revealed homologous regions with strong signatures of positive selection, suggesting common mechanisms employed by unknown pathogen encoded inhibitors and similar modes of evasion from antagonism. Our analysis of cGAS diversification also identified alternately spliced forms missing multiple sites under positive selection. Further analysis of selection on the OAS family in primates, which comprises OAS1, OAS2, OAS3 and OASL, suggests a hypothesis where gene duplications and domain fusion events result in paralogs that provide another means of escaping pathogen inhibitors. Together our comparative evolutionary analysis of cGAS and OAS provides new insights into distinct mechanisms by which key molecular sentinels of the innate immune system have

  8. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  9. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  10. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    NASA Astrophysics Data System (ADS)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  11. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities.

  12. The amino acid sequences of the Fd fragments of two human γ heavy chains

    PubMed Central

    Press, E. M.; Hogg, N. M.

    1970-01-01

    The amino acid sequences of the Fd fragments of two human pathological immunoglobulins of the immunoglobulin G1 class are reported. Comparison of the two sequences shows that the heavy-chain variable regions are similar in length to those of the light chains. The existence of heavy chain variable region subgroups is also deduced, from a comparison of these two sequences with those of another γ 1 chain, Eu, a μ chain, Ou, and the partial sequence of a fourth γ 1 chain, Ste. Carbohydrate has been found to be linked to an aspartic acid residue in the variable region of one of the γ 1 chains, Cor. PMID:5449120

  13. Complete nucleotide sequence and construction of an infectious clone of Chinese yam necrotic mosaic virus suggest that macluraviruses have the smallest genome among members of the family Potyviridae.

    PubMed

    Kondo, Toru; Fujita, Takashi

    2012-12-01

    The complete nucleotide sequence of Chinese yam necrotic mosaic virus (CYNMV) was determined from cloned virus cDNA. The CYNMV genomic RNA is 8224 nucleotides in length, excluding the poly(A) tail, and contains one long open reading frame encoding a large polyprotein of 2620 amino acids. CYNMV has no counterpart to the P1 cistron and a short HC-Pro cistron located at the 5' side of the potyvirus genome. A full-length cDNA clone, pCYNMV, was assembled under the control of the cauliflower mosaic virus 35S promoter and the nopaline synthase terminator. Biolistic inoculation of Nagaimo plants with cDNA resulted in systemic necrotic mosaic symptoms typical of CYNMV infection. To our knowledge, this is the first report of the complete nucleotide sequence and construction of an infectious cDNA clone of a member of the genus Macluravirus.

  14. Complete sequencing of the bla(NDM-1)-positive IncA/C plasmid from Escherichia coli ST38 isolate suggests a possible origin from plant pathogens.

    PubMed

    Sekizuka, Tsuyoshi; Matsui, Mari; Yamane, Kunikazu; Takeuchi, Fumihiko; Ohnishi, Makoto; Hishinuma, Akira; Arakawa, Yoshichika; Kuroda, Makoto

    2011-01-01

    The complete sequence of the plasmid pNDM-1_Dok01 carrying New Delhi metallo-β-lactamase (NDM-1) was determined by whole genome shotgun sequencing using Escherichia coli strain NDM-1_Dok01 (multilocus sequence typing type: ST38) and the transconjugant E. coli DH10B. The plasmid is an IncA/C incompatibility type composed of 225 predicted coding sequences in 195.5 kb and partially shares a sequence with bla(CMY-2)-positive IncA/C plasmids such as E. coli AR060302 pAR060302 (166.5 kb) and Salmonella enterica serovar Newport pSN254 (176.4 kb). The bla(NDM-1) gene in pNDM-1_Dok01 is terminally flanked by two IS903 elements that are distinct from those of the other characterized NDM-1 plasmids, suggesting that the bla(NDM-1) gene has been broadly transposed, together with various mobile elements, as a cassette gene. The chaperonin groES and groEL genes were identified in the bla(NDM-1)-related composite transposon, and phylogenetic analysis and guanine-cytosine content (GC) percentage showed similarities to the homologs of plant pathogens such as Pseudoxanthomonas and Xanthomonas spp., implying that plant pathogens are the potential source of the bla(NDM-1) gene. The complete sequence of pNDM-1_Dok01 suggests that the bla(NDM-1) gene was acquired by a novel composite transposon on an extensively disseminated IncA/C plasmid and transferred to the E. coli ST38 isolate.

  15. Effects of Acidic Peptide Size and Sequence on Trivalent Praseodymium Adduction and Electron Transfer Dissociation Mass Spectrometry.

    PubMed

    Commodore, Juliette J; Cassady, Carolyn J

    2017-02-07

    Using the lanthanide ion praseodymium, Pr(III), metallated ion formation and electron transfer dissociation (ETD) were studied for 25 biological and model acidic peptides. For chain lengths of seven or more residues, even highly acidic peptides that can be difficult to protonate by electrospray ionization will metallate and undergo abundant ETD fragmentation. Peptides composed of predominantly acidic residues form only the deprotonated ion, [M + Pr - H](2+) ; this ion yields near complete ETD sequence coverage for larger peptides. Peptides with a mixture of acidic and neutral residues, generate [M + Pr](3+) , which cleaves between every residue for many peptides. Acidic peptides that contain at least one residue with a basic side chain also produce the protonated ion, [M + Pr + H](4+) ; this ion undergoes the most extensive sequence coverage by ETD. Primarily metallated and non-metallated c- and z-ions form for all peptides investigated. Metal adducted product ions are only present when at least half of the peptide sequence can be incorporated into the ion; this suggests that the metal ion simultaneously attaches to more than one acidic site. The only site consistently lacking dissociation is at the N-terminal side of a proline residue. Increasing peptide chain length generates more backbone cleavage for metal-peptide complexes with the same charge state. For acidic peptides with the same length, increasing the precursor ion charge state from 2+ to 3+ also leads to more cleavage. The results of this study indicate that highly acidic peptides can be sequenced by ETD of complexes formed with Pr(III).

  16. Amino acid sequences of alpha-helical segments from S-carbosymethylkerateine-A. Complete sequence of a type-I segment.

    PubMed Central

    Gough, K H; Inglis, A S; Crewther, W G

    1978-01-01

    The amino acid sequence of a type-I helical segment from the low-sulphur protein (S-carboxymethylkerateine-A) of wool was determined by combining automatic and manual-sequencing data. Whereas in the type-II helical segment most of the cationic groups occur in pairs, 11 of the 22 anionic residues in the sequence of the type-I segment were situated next to a second anionic residue. This suggests possible interactions between type-I and type-II helical segments in alpha-keratin. As observed with the sequence of a type-II helical segment a model constructed on 3.6 residues per turn of helix shows a line of hydrophobic residues along the helix, thereby supporting the physicochemical evidence that the molecule is predominantly helical and forms part of a coiled-coil structure. Examination of the sequence data by predictive methods indicates the possibilty of extensive sections of alpha-helix interspersed with discontinuities. The molecule contains a number of regions with peptide sequences identical with those found by other workers after enzymic digestion of fractions from oxidized wool. Images Fig. 1. PMID:697725

  17. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    SciTech Connect

    Yu, Jinghua ); Eng, J.; Yalow, R.S. City Univ. of New York, NY )

    1990-12-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species.

  18. The amino acid sequence of goat beta-lactoglobulin.

    PubMed

    Préaux, G; Braunitzer, G; Schrank, B; Stangl, A

    1979-11-01

    The isolation of beta-lactoglobulin from milk of the goat is described. The purified protein was checked for purity and has been characterized by its gross composition and end groups. The native or the modified protein was then degraded by tryptic and cyanogen bromide cleavage. The cleavage products were isolated and sequenced in the sequenator using a Quadrol and propyne program. These data provide the complete sequence of beta-lactoglobulin of the goat. The results are discussed and compared particularly with bovine beta-lactoglobulin components AB. Some biological aspects are described.

  19. Layered materials with coexisting acidic and basic sites for catalytic one-pot reaction sequences.

    PubMed

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2009-06-17

    Acidic montmorillonite-immobilized primary amines (H-mont-NH(2)) were found to be excellent acid-base bifunctional catalysts for one-pot reaction sequences, which are the first materials with coexisting acid and base sites active for acid-base tamdem reactions. For example, tandem deacetalization-Knoevenagel condensation proceeded successfully with the H-mont-NH(2), affording the corresponding condensation product in a quantitative yield. The acidity of the H-mont-NH(2) was strongly influenced by the preparation solvent, and the base-catalyzed reactions were enhanced by interlayer acid sites.

  20. Synthesis of gamma,delta-unsaturated glycolic acids via sequenced brook and Ireland--claisen rearrangements.

    PubMed

    Schmitt, Daniel C; Johnson, Jeffrey S

    2010-03-05

    Organozinc, -magnesium, and -lithium nucleophiles initiate a Brook/Ireland-Claisen rearrangement sequence of allylic silyl glyoxylates resulting in the formation of gamma,delta-unsaturated alpha-silyloxy acids.

  1. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  2. Genome sequence of the acid-tolerant strain Rhizobium sp. LPU83.

    PubMed

    Wibberg, Daniel; Tejerizo, Gonzalo Torres; Del Papa, María Florencia; Martini, Carla; Pühler, Alfred; Lagares, Antonio; Schlüter, Andreas; Pistorio, Mariano

    2014-04-20

    Rhizobia are important members of the soil microbiome since they enter into nitrogen-fixing symbiosis with different legume host plants. Rhizobium sp. LPU83 is an acid-tolerant Rhizobium strain featuring a broad-host-range. However, it is ineffective in nitrogen fixation. Here, the improved draft genome sequence of this strain is reported. Genome sequence information provides the basis for analysis of its acid tolerance, symbiotic properties and taxonomic classification.

  3. Identification of tropomyosins as major allergens in antarctic krill and mantis shrimp and their amino acid sequence characteristics.

    PubMed

    Motoyama, Kanna; Suma, Yota; Ishizaki, Shoichiro; Nagashima, Yuji; Lu, Ying; Ushio, Hideki; Shiomi, Kazuo

    2008-01-01

    Tropomyosin represents a major allergen of decapod crustaceans such as shrimps and crabs, and its highly conserved amino acid sequence (>90% identity) is a molecular basis of the immunoglobulin E (IgE) cross-reactivity among decapods. At present, however, little information is available about allergens in edible crustaceans other than decapods. In this study, the major allergen in two species of edible crustaceans, Antarctic krill Euphausia superba and mantis shrimp Oratosquilla oratoria that are taxonomically distinct from decapods, was demonstrated to be tropomyosin by IgE-immunoblotting using patient sera. The cross-reactivity of the tropomyosins from both species with decapod tropomyosins was also confirmed by inhibition IgE immunoblotting. Sequences of the tropomyosins from both species were determined by complementary deoxyribonucleic acid cloning. The mantis shrimp tropomyosin has high sequence identity (>90% identity) with decapod tropomyosins, especially with fast-type tropomyosins. On the other hand, the Antarctic krill tropomyosin is characterized by diverse alterations in region 13-42, the amino acid sequence of which is highly conserved for decapod tropomyosins, and hence, it shares somewhat lower sequence identity (82.4-89.8% identity) with decapod tropomyosins than the mantis shrimp tropomyosin. Quantification by enzyme-linked immunosorbent assay revealed that Antarctic krill contains tropomyosin at almost the same level as decapods, suggesting that its allergenicity is equivalent to decapods. However, mantis shrimp was assumed to be substantially not allergenic because of the extremely low content of tropomyosin.

  4. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly).

  5. Molecular cloning and sequencing of a cDNA encoding the thioesterase domain of the rat fatty acid synthetase.

    PubMed

    Naggert, J; Witkowski, A; Mikkelsen, J; Smith, S

    1988-01-25

    A cloned cDNA containing the entire coding sequence for the long-chain S-acyl fatty acid synthetase thioester hydrolase (thioesterase I) component as well as the 3'-noncoding region of the fatty acid synthetase has been isolated using an expression vector and domain-specific antibodies. The coding region was assigned to the thioesterase I domain by identification of sequences coding for characterized peptide fragments, amino-terminal analysis of the isolated thioesterase I domain and the presence of the serine esterase active-site sequence motif. The thioesterase I domain is 306 amino acids long with a calculated molecular mass of 33,476 daltons; its DNA is flanked at the 5'-end by a region coding for the acyl carrier protein domain and at the 3'-end by a 1,537-base pairs-long noncoding sequence with a poly(A) tail. The thioesterase I domain exhibits a low, albeit discernible, homology with the discrete medium-chain S-acyl fatty acid synthetase thioester hydrolases (thioesterase II) from rat mammary gland and duck uropygial gland, suggesting a distant but common evolutionary ancestry for these proteins.

  6. Single-chain structure of human ceruloplasmin: the complete amino acid sequence of the whole molecule.

    PubMed Central

    Takahashi, N; Ortel, T L; Putnam, F W

    1984-01-01

    We have determined the amino acid sequence of the amino-terminal 67,000-dalton (67-kDa) fragment of human ceruloplasmin and have established overlapping sequences between the 67-kDa and 50-kDa fragments and between the 50-kDa and 19-kDa fragments. The 67-kDa fragment contains 480 amino acid residues and three glucosamine oligosaccharides. These results together with our previous sequence data for the 50-kDa and 19-kDa fragments complete the amino acid sequence of human ceruloplasmin. The polypeptide chain has a total of 1,046 amino acid residues (Mr 120,085) and has attachment sites for four glucosamine oligosaccharides; together these account for the total molecular mass of human ceruloplasmin (132 kDa). The sequence analysis of the peptides overlapping the fragments showed that one additional amino acid, arginine, is present between the 67-kDa and 50-kDa fragments, and another, lysine, is between the 50-kDa and 19-kDa fragments. Only two apparent sites of amino acid interchange have been identified in the polypeptide chain. Both involve a single-point interchange of glycine and lysine that would result in a difference in charge. The results of the complete sequence analysis verified that human ceruloplasmin is composed of a single polypeptide chain and that the subunit-like fragments are produced by proteolytic cleavage during purification (and possibly also in vivo). PMID:6582496

  7. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    PubMed Central

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  8. The complete genome sequence of the Alphaentomopoxvirus Anomala cuprea entomopoxvirus, including its terminal hairpin loop sequences, suggests a potentially unique mode of apoptosis inhibition and mode of DNA replication.

    PubMed

    Mitsuhashi, Wataru; Miyamoto, Kazuhisa; Wada, Sanae

    2014-03-01

    Complete genome sequence of Anomala cuprea entomopoxvirus, which belongs to the genus Alphaentomopoxvirus, including its terminal hairpin loop sequences, is reported. This is the first genome sequence of Alphaentomopoxvirus reported, and hairpin loops in entomopoxviruses have not previously been sequenced. The genome is 245,717 bp, which is smaller than had previously been estimated for Alphaentomopoxvirus. The inverted terminal repeats are quite long, and experimental results suggest that one genome molecule has one type of hairpin at one end and another type at the other end. The genome contains unexpected ORFs, e.g., that for the ubiquitin-conjugating enzyme E2 of eukaryotes. The BIR and RING domains found in a single ORF for an inhibitor of apoptosis in baculoviruses and entomopoxviruses occurred in two different, widely separated ORFs. Furthermore, an ORF in the genome contains a serpin domain that was previously found in vertebrate poxviruses for apoptosis inhibition but not in insect viruses.

  9. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    PubMed Central

    2010-01-01

    Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C. sticklandii genome and

  10. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    PubMed

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  11. Evidence to suggest that gonadotropin-releasing hormone inhibits its own secretion by affecting hypothalamic amino acid neurotransmitter release.

    PubMed

    Feleder, C; Jarry, H; Leonhardt, S; Moguilevsky, J A; Wuttke, W

    1996-10-01

    The mediobasal hypothalamus of rats contains gonadotropin-releasing hormone (GnRH) receptors. These hypothalamic neurons also express the GnRH corresponding gene. Under these circumstances, the possibility exists that these GnRH receptors could be localized in other neurons, which are GnRH-receptive, unknowing the neurotransmitter quality. Therefore, we studied the in vitro effects of the GnRH agonist buserelin on GnRH, glutamate, gamma-amino-butyric acid (GABA) and taurine release from explanted superfused hypothalami of untreated and buserelin-pretreated (down-regulated) male rats. When buserelin was added to the superfusion medium it inhibited promptly the release of GnRH and the excitatory amino acid neurotransmitter glutamate, but stimulated the release of the inhibitory neurotransmitters, GABA and taurine. Hypothalamic release of GnRH from hypothalami collected from buserelin-treated (30 micrograms/100 g b.w. twice daily for 4 days) male rats released significantly less GnRH, glutamate and more GABA and taurine. The inhibitory effect of buserelin was maintained when the superfusion medium continuously contained the GnRH analog. When superfusion of hypothalami from buserelin-pretreated animals was performed in the absence of buserelin, GnRH and glutamate release increased significantly within 45-60 min, whereas GABA and taurine release decreased at this time point. When buserelin was added to the superfusion medium 2 h after buserelin-free superfusion, GnRH and glutamate release decreased whereas GABA and taurine release increased instantaneously. Buserelin-treated rats showed significantly low values of LH and testosterone than the untreated rats. These results suggest that GnRH receptors may not only be present in GnRH axon terminals in the median eminence, but also on glutamatergic, GABAergic and taurinergic neurons by which GnRH may exert an autoinhibitory ultrashort loop feedback on its own secretion. This effect appears to be connected with glutamatergic

  12. Fragmentation Characteristics of Deprotonated N-linked Glycopeptides: Influences of Amino Acid Composition and Sequence

    NASA Astrophysics Data System (ADS)

    Nishikaze, Takashi; Kawabata, Shin-ichirou; Tanaka, Koichi

    2014-06-01

    Glycopeptide structural analysis using tandem mass spectrometry is becoming a common approach for elucidating site-specific N-glycosylation. The analysis is generally performed in positive-ion mode. Therefore, fragmentation of protonated glycopeptides has been extensively investigated; however, few studies are available on deprotonated glycopeptides, despite the usefulness of negative-ion mode analysis in detecting glycopeptide signals. Here, large sets of glycopeptides derived from well-characterized glycoproteins were investigated to understand the fragmentation behavior of deprotonated N-linked glycopeptides under low-energy collision-induced dissociation (CID) conditions. The fragment ion species were found to be significantly variable depending on their amino acid sequence and could be classified into three types: (i) glycan fragment ions, (ii) glycan-lost fragment ions and their secondary cleavage products, and (iii) fragment ions with intact glycan moiety. The CID spectra of glycopeptides having a short peptide sequence were dominated by type (i) glycan fragments (e.g., 2,4AR, 2,4AR-1, D, and E ions). These fragments define detailed structural features of the glycan moiety such as branching. For glycopeptides with medium or long peptide sequences, the major fragments were type (ii) ions (e.g., [peptide + 0,2X0-H]- and [peptide-NH3-H]-). The appearance of type (iii) ions strongly depended on the peptide sequence, and especially on the presence of Asp, Asn, and Glu. When a glycosylated Asn is located on the C-terminus, an interesting fragment having an Asn residue with intact glycan moiety, [glycan + Asn-36]-, was abundantly formed. Observed fragments are reasonably explained by a combination of existing fragmentation rules suggested for N-glycans and peptides.

  13. SETG: Nucleic Acid Extraction and Sequencing for In Situ Life Detection on Mars

    NASA Astrophysics Data System (ADS)

    Mojarro, A.; Hachey, J.; Tani, J.; Smith, A.; Bhattaru, S. A.; Pontefract, A.; Doebler, R.; Brown, M.; Ruvkun, G.; Zuber, M. T.; Carr, C. E.

    2016-10-01

    We are developing an integrated nucleic acid extraction and sequencing instrument: the Search for Extra-Terrestrial Genomes (SETG) for in situ life detection on Mars. Our goals are to identify related or unrelated nucleic acid-based life on Mars.

  14. Draft Genome Sequence of Cyanobacterium sp. Strain IPPAS B-1200 with a Unique Fatty Acid Composition

    PubMed Central

    Starikov, Alexander Y.; Usserbaeva, Aizhan A.; Sinetova, Maria A.; Sarsekeyeva, Fariza K.; Zayadan, Bolatkhan K.; Ustinova, Vera V.; Kupriyanova, Elena V.; Los, Dmitry A.

    2016-01-01

    Here, we report the draft genome of Cyanobacterium sp. IPPAS strain B-1200, isolated from Lake Balkhash, Kazakhstan, and characterized by the unique fatty acid composition of its membrane lipids, which are enriched with myristic and myristoleic acids. The approximate genome size is 3.4 Mb, and the predicted number of coding sequences is 3,119. PMID:27856596

  15. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    PubMed

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  16. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    PubMed

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  17. Analysis of cloned cDNA and genomic sequences for phytochrome: complete amino acid sequences for two gene products expressed in etiolated Avena.

    PubMed Central

    Hershey, H P; Barker, R F; Idler, K B; Lissemore, J L; Quail, P H

    1985-01-01

    Cloned cDNA and genomic sequences have been analyzed to deduce the amino acid sequence of phytochrome from etiolated Avena. Restriction endonuclease site polymorphism between clones indicates that at least four phytochrome genes are expressed in this tissue. Sequence analysis of two complete and one partial coding region shows approximately 98% homology at both the nucleotide and amino acid levels, with the majority of amino acid changes being conservative. High sequence homology is also found in the 5'-untranslated region but significant divergence occurs in the 3'-untranslated region. The phytochrome polypeptides are 1128 amino acid residues long corresponding to a molecular mass of 125 kdaltons. The known protein sequence at the chromophore attachment site occurs only once in the polypeptide, establishing that phytochrome has a single chromophore per monomer covalently linked to Cys-321. Computer analyses of the amino acid sequences have provided predictions regarding a number of structural features of the phytochrome molecule. PMID:3001642

  18. DNA sequence analysis suggests that cytb-nd1 PCR-RFLP may not be applicable to sandfly species identification throughout the Mediterranean region.

    PubMed

    Llanes-Acevedo, Ivonne Pamela; Arcones, Carolina; Gálvez, Rosa; Martin, Oihane; Checa, Rocío; Montoya, Ana; Chicharro, Carmen; Cruz, Susana; Miró, Guadalupe; Cruz, Israel

    2016-03-01

    Molecular methods are increasingly used for both species identification of sandflies and assessment of their population structure. In general, they are based on DNA sequence analysis of targets previously amplified by PCR. However, this approach requires access to DNA sequence facilities, and in some circumstances, it is time-consuming. Though DNA sequencing provides the most reliable information, other downstream PCR applications are explored to assist in species identification. Thus, it has been recently proposed that the amplification of a DNA region encompassing partially both the cytochrome-B (cytb) and the NADH dehydrogenase 1 (nd1) genes followed by RFLP analysis with the restriction enzyme Ase I allows the rapid identification of the most prevalent species of phlebotomine sandflies in the Mediterranean region. In order to confirm the suitability of this method, we collected, processed, and molecularly analyzed a total of 155 sandflies belonging to four species including Phlebotomus ariasi, P. papatasi, P. perniciosus, and Sergentomyia minuta from different regions in Spain. This data set was completed with DNA sequences available at the GenBank for species prevalent in the Mediterranean basin and the Middle East. Additionally, DNA sequences from 13 different phlebotomine species (P. ariasi, P. balcanicus, P. caucasicus, P. chabaudi, P. chadlii, P. longicuspis, P. neglectus, P. papatasi, P. perfiliewi, P. perniciosus, P. riouxi, P. sergenti, and S. minuta), from 19 countries, were added to the data set. Overall, our molecular data revealed that this PCR-RFLP method does not provide a unique and specific profile for each phlebotomine species tested. Intraspecific variability and similar RFLP patterns were frequently observed among the species tested. Our data suggest that this method may not be applicable throughout the Mediterranean region as previously proposed. Other molecular approaches like DNA barcoding or phylogenetic analyses would allow a more

  19. Purification, characterization and partial amino acid sequence of glycogen synthase from Saccharomyces cerevisiae.

    PubMed Central

    Carabaza, A; Arino, J; Fox, J W; Villar-Palasi, C; Guinovart, J J

    1990-01-01

    Glycogen synthase from Saccharomyces cerevisiae was purified to homogeneity. The enzyme showed a subunit molecular mass of 80 kDa. The holoenzyme appears to be a tetramer. Antibodies developed against purified yeast glycogen synthase inactivated the enzyme in yeast extracts and allowed the detection of the protein in Western blots. Amino acid analysis showed that the enzyme is very rich in glutamate and/or glutamine residues. The N-terminal sequence (11 amino acid residues) was determined. In addition, selected tryptic-digest peptides were purified by reverse-phase h.p.l.c. and submitted to gas-phase sequencing. Up to eight sequences (79 amino acid residues) could be aligned with the human muscle enzyme sequence. Levels of identity range between 37 and 100%, indicating that, although human and yeast glycogen synthases probably share some conserved regions, significant differences in their primary structure should be expected. Images Fig. 1. Fig. 2. Fig. 3. PMID:2114092

  20. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    PubMed

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  1. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations.

    PubMed

    Abascal, Federico; Zardoya, Rafael; Telford, Maximilian J

    2010-07-01

    We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

  2. A 1.9 Å Crystal Structure of the HDV Ribozyme Precleavage Suggests both Lewis Acid and General Acid Mechanisms Contribute to Phosphodiester Cleavage

    SciTech Connect

    Chen, Jui-Hui; Yajima, Rieko; Chadalavada, Durga M.; Chase, Elaine; Bevilacqua, Philip C.; Golden, Barbara L.

    2010-11-01

    The hepatitis delta virus (HDV) ribozyme and HDV-like ribozymes are self-cleaving RNAs found throughout all kingdoms of life. These RNAs fold into a double-nested pseudoknot structure and cleave RNA, yielding 2{prime},3{prime}-cyclic phosphate and 5{prime}-hydroxyl termini. The active site nucleotide C75 has a pK{sub a} shifted >2 pH units toward neutrality and has been implicated as a general acid/base in the cleavage reaction. An active site Mg{sup 2+} ion that helps activate the 2{prime}-hydroxyl for nucleophilic attack has been characterized biochemically; however, this ion has not been visualized in any previous structures. To create a snapshot of the ribozyme in a state poised for catalysis, we have crystallized and determined the structure of the HDV ribozyme bound to an inhibitor RNA containing a deoxynucleotide at the cleavage site. This structure includes the wild-type C75 nucleotide and Mg{sup 2+} ions, both of which are required for maximal ribozyme activity. This structure suggests that the position of C75 does not change during the cleavage reaction. A partially hydrated Mg{sup 2+} ion is also found within the active site where it interacts with a newly resolved G {center_dot} U reverse wobble. Although the inhibitor exhibits crystallographic disorder, we modeled the ribozyme-substrate complex using the conformation of the inhibitor strand observed in the hammerhead ribozyme. This model suggests that the pro-RP oxygen of the scissile phosphate and the 2{prime}-hydroxyl nucleophile are inner-sphere ligands to the active site Mg{sup 2+} ion. Thus, the HDV ribozyme may use a combination of metal ion Lewis acid and nucleobase general acid strategies to effect RNA cleavage.

  3. Root transcriptomes of two acidic soil adapted Indica rice genotypes suggest diverse and complex mechanism of low phosphorus tolerance.

    PubMed

    Tyagi, Wricha; Rai, Mayank

    2017-03-01

    Low phosphorus (P) tolerance in rice is a biologically and agronomically important character. Low P tolerant Indica-type rice genotypes, Sahbhagi Dhan (SD) and Chakhao Poreiton (CP), are adapted to acidic soils and show variable response to low P levels. Using RNAseq approach, transcriptome data was generated from roots of SD and CP after 15 days of low P treatment to understand differences and similarities at molecular level. In response to low P, number of genes up-regulated (1318) was more when compared with down-regulated genes (761). Eight hundred twenty-one genes found to be significantly regulated between SD and CP in response to low P. De novo assembly using plant database led to further identification of 1535 novel transcripts. Functional annotation of significantly expressed genes suggests two distinct methods of low P tolerance. While root system architecture in SD works through serine-threonine kinase PSTOL1, suberin-mediated cell wall modification seems to be key in CP. The transcription data indicated that CP relies more on releasing its internally bound Pi and coping with low P levels by transcriptional and translational modifications and using dehydration response-based signals. Role of P transporters seems to be vital in response to low P in CP while sugar- and auxin-mediated pathway seems to be preferred in SD. At least six small RNA clusters overlap with transcripts highly expressed under low P, suggesting role of RNA super clusters in nutrient response in plants. These results help us to understand and thereby devise better strategy to enhance low P tolerance in Indica-type rice.

  4. Amino acid sequence of homologous rat atrial peptides: natriuretic activity of native and synthetic forms.

    PubMed Central

    Seidah, N G; Lazure, C; Chrétien, M; Thibault, G; Garcia, R; Cantin, M; Genest, J; Nutt, R F; Brady, S F; Lyle, T A

    1984-01-01

    A substance called atrial natriuretic factor (ANF), localized in secretory granules of atrial cardiocytes, was isolated as four homologous natriuretic peptides from homogenates of rat atria. The complete sequence of the longest form showed that it is composed of 33 amino acids. The three other shorter forms (2-33, 3-33, and 8-33) represent amino-terminally truncated versions of the 33 amino acid parent molecule as shown by analysis of sequence, amino acid composition, or both. The proposed primary structure agrees entirely with the amino acid composition and reveals no significant sequence homology with any known protein or segment of protein. The short form ANF-(8-33) was synthesized by a multi-fragment condensation approach and the synthetic product was shown to exhibit specific activity comparable to that of the natural ANF-(3-33). PMID:6232612

  5. Nucleotide and deduced amino acid sequences of a new subtilisin from an alkaliphilic Bacillus isolate.

    PubMed

    Saeki, Katsuhisa; Magallones, Marietta V; Takimura, Yasushi; Hatada, Yuji; Kobayashi, Tohru; Kawai, Shuji; Ito, Susumu

    2003-10-01

    The gene for a new subtilisin from the alkaliphilic Bacillus sp. KSM-LD1 was cloned and sequenced. The open reading frame of the gene encoded a 97 amino-acid prepro-peptide plus a 307 amino-acid mature enzyme that contained a possible catalytic triad of residues, Asp32, His66, and Ser224. The deduced amino acid sequence of the mature enzyme (LD1) showed approximately 65% identity to those of subtilisins SprC and SprD from alkaliphilic Bacillus sp. LG12. The amino acid sequence identities of LD1 to those of previously reported true subtilisins and high-alkaline proteases were below 60%. LD1 was characteristically stable during incubation with surfactants and chemical oxidants. Interestingly, an oxidizable Met residue is located next to the catalytic Ser224 of the enzyme as in the cases of the oxidation-susceptible subtilisins reported to date.

  6. Shark myelin basic protein: amino acid sequence, secondary structure, and self-association.

    PubMed

    Milne, T J; Atkins, A R; Warren, J A; Auton, W P; Smith, R

    1990-09-01

    Myelin basic protein (MBP) from the Whaler shark (Carcharhinus obscurus) has been purified from acid extracts of a chloroform/methanol pellet from whole brains. The amino acid sequence of the majority of the protein has been determined and compared with the sequences of other MBPs. The shark protein has only 44% homology with the bovine protein, but, in common with other MBPs, it has basic residues distributed throughout the sequence and no extensive segments that are predicted to have an ordered secondary structure in solution. Shark MBP lacks the triproline sequence previously postulated to form a hairpin bend in the molecule. The region containing the putative consensus sequence for encephalitogenicity in the guinea pig contains several substitutions, thus accounting for the lack of activity of the shark protein. Studies of the secondary structure and self-association have shown that shark MBP possesses solution properties similar to those of the bovine protein, despite the extensive differences in primary structure.

  7. Phylogenetic relationships of Irkut and West Caucasian bat viruses within the Lyssavirus genus and suggested quantitative criteria based on the N gene sequence for lyssavirus genotype definition.

    PubMed

    Kuzmin, Ivan V; Hughes, Gareth J; Botvinkin, Alexandr D; Orciari, Lillian A; Rupprecht, Charles E

    2005-07-01

    The nucleoprotein (N), phosphoprotein (P) and glycoprotein (G) genes of Irkut and West Caucasian bat viruses (WCBV) were sequenced and compared with those of other lyssaviruses. N gene nucleotide identities provided unequivocal separation of all lyssavirus genotypes with an identity threshold of 82%. On this basis, Irkut virus should be considered as a new genotype with particular relatedness to genotypes 4 and 5 (78.0-78.6% identity for N gene nucleotides and 90.4-92.6% for amino acids). Furthermore, genotypes 4-6, together with Aravan, Khujand and Irkut viruses, present a solid phylogroup of Old World bat lyssaviruses. This relationship is apparent using all three viral genes, and causes overlap between intragenotype and intergenotype identities for the P gene (Aravan, Khujand viruses and genotype 6) and for the G gene (Aravan, Khujand, genotypes 5 and 6). WCBV is the most divergent of known lyssaviruses with only limited relatedness to genotypes 2 and 3.

  8. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  9. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons.

    PubMed

    Abu-Qarn, Mehtap; Eichler, Jerry

    2007-05-01

    Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  10. Gene structure and amino acid sequence of Latimeria chalumnae (coelacanth) myelin DM20: phylogenetic relation of the fish.

    PubMed

    Tohyama, Y; Kasama-Yoshida, H; Sakuma, M; Kobayashi, Y; Cao, Y; Hasegawa, M; Kojima, H; Tamai, Y; Tanokura, M; Kurihara, T

    1999-07-01

    The structure of Latimeria chalumnae (coelacanth) proteolipid protein/DM20 gene excluding exon 1 was determined, and the amino acid sequence of Latimeria DM20 corresponding to exons 2-7 was deduced. The nucleotide sequence of exon 3 suggests that only DM20 isoform is expressed in Latimeria. The structure of proteolipid protein/DM20 gene is well preserved among human, dog, mouse, and Latimeria. Southern blot analysis indicates that Latimeria DM20 gene is a single-copy gene. When the amino acid sequences of DM20 were compared among various species, Latimeria was more similar to tetrapods than other fishes including lungfish, confirming the previous finding by immunoreactivity (Waehneldt and Malotka 1989 J. Neurochem. 52:1941-1943). However, when phylogenetic trees were constructed from the DM20 sequences, lungfish was clearly the closest to tetrapods. Latimeria was situated outside of lungfish by the maximum likelihood method. The apparent similarity of Latimeria DM20 to tetrapod proteolipid protein/DM20 is explained by the slow amino acid substitution rate of Latimeria DM20.

  11. "De-novo" amino acid sequence elucidation of protein G'e by combined "Top-Down" and "Bottom-Up" mass spectrometry

    NASA Astrophysics Data System (ADS)

    Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F. M.; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L.; Glocker, Michael O.

    2015-03-01

    Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein Ǵ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α- N-gluconoylation and α- N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α- N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant ( K d ) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.

  12. Purification of a marsupial insulin: amino-acid sequence of insulin from the eastern grey kangaroo Macropus giganteus.

    PubMed

    Treacy, G B; Shaw, D C; Griffiths, M E; Jeffrey, P D

    1989-03-24

    Insulin has been purified from kangaroo pancreas by acidic ethanol extraction, diethyl ether precipitation and gel filtration. The amino-acid sequence of this, the first marsupial insulin to be studied, is reported. It differs from human insulin by only four amino-acid substitutions, all in regions of the molecule previously known to be variable. However, it should be noted that one of these, asparagine for threonine at A8, has not been reported before. Computer comparisons of all 43 insulin sequences reported to date with kangaroo insulin show it to be most closely related to a group of mammalian insulins (dog, pig, cow, human) known to be of high biological potency. The measurement of blood glucose lowering in the rabbit by kangaroo insulin is consistent with this conclusion. Comparisons of amino-acid sequences of other proteins with their kangaroo counterparts show a greater difference, in line with the time of divergence of marsupials. The limited differences observed in insulin and cytochrome c suggest that their structures need to be closely conserved in order to maintain function.

  13. The amino acid sequence of Ole e I, the major allergen from olive tree (Olea europaea) pollen.

    PubMed

    Villalba, M; Batanero, E; López-Otín, C; Sánchez, L M; Monsalve, R I; González de la Peña, M A; Lahoz, C; Rodríguez, R

    1993-09-15

    The complete primary structure of the major allergen from Olea europaea (olive tree) pollen, Ole e I (IUIS nomenclature), has been determined. The amino acid sequence was established by automated Edman degradation of the reduced and alkylated molecule as well as of selected fragments obtained by proteolytic digestions. Ole e I contains a single polypeptide chain of 145 amino acid residues with a calculated molecular mass of 16331 Da. No free sulfhydryl groups have been detected in the native protein. The molecule contains a putative glycosylation site. A high degree of microheterogeneity has been observed, mainly centered in the first 33% of the molecule. Comparison of Ole e I sequence with protein sequence databases showed no similarity with other known allergens. However, it has a 36% and 38% sequence identity with the putative polypeptide structures, deduced, respectively, from nucleotide sequences of genes isolated from tomato anthers and corn pollen, which have been suggested to be involved in the growing of the pollen tube. Therefore, the olive tree allergen may be a constitutive protein of the pollen involved in reproductive functions.

  14. Complete genome sequence of Enterococcus mundtii QU 25, an efficient L-(+)-lactic acid-producing bacterium.

    PubMed

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-08-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified-one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci.

  15. Classification of mouse VK groups based on the partial amino acid sequence to the first invariant tryptophan: impact of 14 new sequences from IgG myeloma proteins.

    PubMed

    Potter, M; Newell, J B; Rudikoff, S; Haber, E

    1982-12-01

    Fourteen new VK sequences derived from BALB/c IgG myeloma proteins were determined to the first invariant tryptophan (Trp 35). These partial sequences were compared with 65 other published VK sequences using a computer program. The 79 sequences were organized according to the length of the sequence from the amino terminus to the first invariant tryptophan (Trp 35), into seven groups (33, 34, 35, 36, 39, 40 and 41aa). A distance matrix of all 79 sequences was then computed, i.e. the number of amino acid substitutions necessary to convert one sequence to another was determined. From these data a dendrogram was constructed. Most of the VK sequences fell into clusters or closely related groups. The definition of a sequence group is arbitrary but facilitates the classification of VK proteins. We used 12 substitutions as the basis for defining a sequence group based on the known number of substitutions that are found in the VK21 proteins. By this criterion there were 18 groups in the Trp 35 dendrogram. Twelve of the 14 new sequences fell into one of these sequence groups; two formed new sequence groups. Collective amino acid sequencing is still encountering new VK structures indicating more sequences will be required to attain an accurate estimate of the total number of VK groups. Updated dendrograms can be quickly generated to include newly generated sequences.

  16. Differential Proteomic Analysis of Platelets Suggested Possible Signal Cascades Network in Platelets Treated with Salvianolic Acid B

    PubMed Central

    Ma, Chao; Yao, Yan; Yue, Qing-Xi; Zhou, Xin-Wen; Yang, Peng-Yuan; Wu, Wan-Ying; Guan, Shu-Hong; Jiang, Bao-Hong; Yang, Min; Liu, Xuan; Guo, De-An

    2011-01-01

    Background Salvianolic acid B (SB) is an active component isolated from Danshen, a traditional Chinese medicine widely used for the treatment of cardiovascular disorders. Previous study suggested that SB might inhibit adhesion as well as aggregation of platelets by a mechanism involving the integrin α2β1. But, the signal cascades in platelets after SB binding are still not clear. Methodology/Principal Findings In the present study, a differential proteomic analysis (two-dimensional electrophoresis) was conducted to check the protein expression profiles of rat platelets with or without treatment of SB. Proteins altered in level after SB exposure were identified by MALDI-TOF MS/MS. Treatment of SB caused regulation of 20 proteins such as heat shock-related 70 kDa protein 2 (hsp70), LIM domain protein CLP-36, copine I, peroxiredoxin-2, coronin-1 B and cytoplasmic dynein intermediate chain 2C. The regulation of SB on protein levels was confirmed by Western blotting. The signal cascades network induced by SB after its binding with integrin α2β1 was predicted. To certify the predicted network, binding affinity of SB to integrin α2β1 was checked in vitro and ex vivo in platelets. Furthermore, the effects of SB on protein levels of hsp70, coronin-1B and intracellular levels of Ca(2+) and reactive oxygen species (ROS) were checked with or without pre-treatment of platelets using antibody against integrin α2β1. Electron microscopy study confirmed that SB affected cytoskeleton structure of platelets. Conclusions/Significance Integrin α2β1 might be one of the direct target proteins of SB in platelets. The signal cascades network of SB after binding with integrin α2β1 might include regulation of intracellular Ca(2+) level, cytoskeleton-related proteins such as coronin-1B and cytoskeleton structure of platelets. PMID:21379382

  17. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  18. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  19. Differentiation of acetic acid bacteria based on sequence analysis of 16S-23S rRNA gene internal transcribed spacer sequences.

    PubMed

    González, Angel; Mas, Albert

    2011-06-30

    The 16S-23S gene internal transcribed spacer sequence of sixty-four strains belonging to different acetic acid bacteria genera were analyzed, and phylogenetic trees were generated for each genera. The topologies of the different trees were in accordance with the 16S rRNA gene trees, although the similarity percentages obtained between the species was shown to be much lower. These values suggest the usefulness of including the 16S-23S gene internal transcribed spacer region as a part of the polyphasic approach required for the further classification of acetic acid bacteria. Furthermore, the region could be a good target for primer and probe design. It has also been validated for use in the identification of unknown samples of this bacterial group from wine vinegar and fruit condiments.

  20. Amino acid sequence around the active-site serine residue in the acyltransferase domain of goat mammary fatty acid synthetase.

    PubMed Central

    Mikkelsen, J; Højrup, P; Rasmussen, M M; Roepstorff, P; Knudsen, J

    1985-01-01

    Goat mammary fatty acid synthetase was labelled in the acyltransferase domain by formation of O-ester intermediates by incubation with [1-14C]acetyl-CoA and [2-14C]malonyl-CoA. Tryptic-digest and CNBr-cleavage peptides were isolated and purified by high-performance reverse-phase and ion-exchange liquid chromatography. The sequences of the malonyl- and acetyl-labelled peptides were shown to be identical. The results confirm the hypothesis that both acetyl and malonyl groups are transferred to the mammalian fatty acid synthetase complex by the same transferase. The sequence is compared with those of other fatty acid synthetase transferases. PMID:3922356

  1. Ligation with nucleic acid sequence-based amplification.

    PubMed

    Ong, Carmichael; Tai, Warren; Sarma, Aartik; Opal, Steven M; Artenstein, Andrew W; Tripathi, Anubhav

    2012-01-01

    This work presents a novel method for detecting nucleic acid targets using a ligation step along with an isothermal, exponential amplification step. We use an engineered ssDNA with two variable regions on the ends, allowing us to design the probe for optimal reaction kinetics and primer binding. This two-part probe is ligated by T4 DNA Ligase only when both parts bind adjacently to the target. The assay demonstrates that the expected 72-nt RNA product appears only when the synthetic target, T4 ligase, and both probe fragments are present during the ligation step. An extraneous 38-nt RNA product also appears due to linear amplification of unligated probe (P3), but its presence does not cause a false-positive result. In addition, 40 mmol/L KCl in the final amplification mix was found to be optimal. It was also found that increasing P5 in excess of P3 helped with ligation and reduced the extraneous 38-nt RNA product. The assay was also tested with a single nucleotide polymorphism target, changing one base at the ligation site. The assay was able to yield a negative signal despite only a single-base change. Finally, using P3 and P5 with longer binding sites results in increased overall sensitivity of the reaction, showing that increasing ligation efficiency can improve the assay overall. We believe that this method can be used effectively for a number of diagnostic assays.

  2. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    PubMed

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered.

  3. Thin-film technology for direct visual detection of nucleic acid sequences: applications in clinical research.

    PubMed

    Jenison, Robert D; Bucala, Richard; Maul, Diana; Ward, David C

    2006-01-01

    Certain optical conditions permit the unaided eye to detect thickness changes on surfaces on the order of 20 A, which are of similar dimensions to monomolecular interactions between proteins or hybridization of complementary nucleic acid sequences. Such detection exploits specific interference of reflected white light, wherein thickness changes are perceived as surface color changes. This technology, termed thin-film detection, allows for the visualization of subattomole amounts of nucleic acid targets, even in complex clinical samples. Thin-film technology has been applied to a broad range of clinically relevant indications, including the detection of pathogenic bacterial and viral nucleic acid sequences and the discrimination of sequence variations in human genes causally related to susceptibility or severity of disease.

  4. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  5. RNA internal standard synthesis by nucleic acid sequence-based amplification for competitive quantitative amplification reactions.

    PubMed

    Lo, Wan-Yu; Baeumner, Antje J

    2007-02-15

    Nucleic acid sequence-based amplification (NASBA) reactions have been demonstrated to successfully synthesize new sequences based on deletion and insertion reactions. Two RNA internal standards were synthesized for use in competitive amplification reactions in which quantitative analysis can be achieved by coamplifying the internal standard with the wild type sample. The sequences were created in two consecutive NASBA reactions using the E. coli clpB mRNA sequence as model analyte. The primer sequences of the wild type sequence were maintained, and a 20-nt-long segment inside the amplicon region was exchanged for a new segment of similar GC content and melting temperature. The new RNA sequence was thus amplifiable using the wild type primers and detectable via a new inserted sequence. In the first reaction, the forwarding primer and an additional 20-nt-long sequence was deleted and replaced by a new 20-nt-long sequence. In the second reaction, a forwarding primer containing as 5' overhang sequence the wild type primer sequence was used. The presence of pure internal standard was verified using electrochemiluminescence and RNA lateral-flow biosensor analysis. Additional sequence deletion in order to shorten the internal standard amplicons and thus generate higher detection signals was found not to be required. Finally, a competitive NASBA reaction between one internal standard and the wild type sequence was carried out proving its functionality. This new rapid construction method via NASBA provides advantages over the traditional techniques since it requires no traditional cloning procedures, no thermocyclers, and can be completed in less than 4 h.

  6. Cloning, sequence analysis, and expression in Escherichia coli of the gene encoding an alpha-amino acid ester hydrolase from Acetobacter turbidans.

    PubMed

    Polderman-Tijmes, Jolanda J; Jekel, Peter A; de Vries, Erik J; van Merode, Annet E J; Floris, René; van der Laan, Jan-Metske; Sonke, Theo; Janssen, Dick B

    2002-01-01

    The alpha-amino acid ester hydrolase from Acetobacter turbidans ATCC 9325 is capable of hydrolyzing and synthesizing beta-lactam antibiotics, such as cephalexin and ampicillin. N-terminal amino acid sequencing of the purified alpha-amino acid ester hydrolase allowed cloning and genetic characterization of the corresponding gene from an A. turbidans genomic library. The gene, designated aehA, encodes a polypeptide with a molecular weight of 72,000. Comparison of the determined N-terminal sequence and the deduced amino acid sequence indicated the presence of an N-terminal leader sequence of 40 amino acids. The aehA gene was subcloned in the pET9 expression plasmid and expressed in Escherichia coli. The recombinant protein was purified and found to be dimeric with subunits of 70 kDa. A sequence similarity search revealed 26% identity with a glutaryl 7-ACA acylase precursor from Bacillus laterosporus, but no homology was found with other known penicillin or cephalosporin acylases. There was some similarity to serine proteases, including the conservation of the active site motif, GXSYXG. Together with database searches, this suggested that the alpha-amino acid ester hydrolase is a beta-lactam antibiotic acylase that belongs to a class of hydrolases that is different from the Ntn hydrolase superfamily to which the well-characterized penicillin acylase from E. coli belongs. The alpha-amino acid ester hydrolase of A. turbidans represents a subclass of this new class of beta-lactam antibiotic acylases.

  7. Sequencing of IncX-plasmids suggests ubiquity of mobile forms of a biofilm-promoting gene cassette recruited from Klebsiella pneumoniae.

    PubMed

    Burmølle, Mette; Norman, Anders; Sørensen, Søren J; Hansen, Lars Hestbjerg

    2012-01-01

    Plasmids are a highly effective means with which genetic traits that influence human health, such as virulence and antibiotic resistance, are disseminated through bacterial populations. The IncX-family is a hitherto sparsely populated group of plasmids that are able to thrive within Enterobacteriaceae. In this study, a replicon-centric screening method was used to locate strains from wastewater sludge containing plasmids belonging to the IncX-family. A transposon aided plasmid capture method was then employed to transport IncX-plasmids from their original hosts (and co-hosted plasmids) into a laboratory strain (Escherichia coli Genehogs®) for further study. The nucleotide sequences of the three newly isolated IncX-plasmids (pLN126_33, pMO17_54, pMO440_54) and the hitherto un-sequenced type-plasmid R485 revealed a remarkable occurrence of whole or partial gene cassettes that promote biofilm-formation in Klebsiella pneumonia or E. coli, in all four instances. Two of the plasmids (R485 and pLN126_33) were shown to directly induce biofilm formation in a crystal violet retention assay in E. coli. Sequence comparison revealed that all plasmid-borne forms of the type 3 fimbriae encoding gene cassette mrkABCDF were variations of a composite transposon Tn6011 first described in the E. coli IncX plasmid pOLA52. In conclusion, IncX-plasmids isolated from Enterobacteriaceae over almost 40 years and on three different continents have all been shown to carry a type 3 fimbriae gene cassette mrkABCDF stemming from pathogenic K. pneumoniae. Apart from contributing general knowledge about IncX-plasmids, this study also suggests an apparent ubiquity of a mobile form of an important virulence factor and is an illuminating example of the recruitment, evolution and dissemination of genetic traits through plasmid-mediated horizontal gene transfer.

  8. Transgenic resistance in potato plants expressing potato leaf roll virus (PLRV) replicase gene sequences is RNA-mediated and suggests the involvement of post-transcriptional gene silencing.

    PubMed

    Vazquez Rovere, C; Asurmendi, S; Hopp, H E

    2001-07-01

    Genetically engineered expression of replicase encoding sequences has been proposed as an efficient system to confer protection against virus diseases by eliciting protection mechanisms in the plant. Potato leaf-roll was one of the first diseases for which this kind of protection was engineered in potato plants. However, details of the protecting mechanism were not reported, so far. The ORF2b of an Argentinean strain of PLRV was cloned and sequenced finding 94% and 97% of homology with Australian and Dutch strains, respectively. To elucidate the mechanism of protection against PLRV infection, three versions of ORF2b (non-translatable sense, translatable sense with an engineered ATG and antisense) were constructed under the control of the 35S CaMV promoter and the nos terminator and introduced in potato plants (cv. Kennebec) by Agrobacterium tumefaciens-mediated transformation. Grafting infection experiments showed that resistant transgenic plants could be obtained with any of the constructs, suggesting that the mechanism of protection is independent of the expression of protein and is RNA mediated. Field trial infection confirmed that resistant transgenic events were obtained. Biolistic transient transformation experiments of leaves derived from transgenic plants using a gene coding for the fusion protein GUS-ORF2b, followed by scoring of the number of GUS expressing leaf spots, supported that the protection is mediated by a post-transcriptional gene silencing mechanism.

  9. Genome-wide DNA methylation profiling by modified reduced representation bisulfite sequencing in Brassica rapa suggests that epigenetic modifications play a key role in polyploid genome evolution

    PubMed Central

    Chen, Xun; Ge, Xianhong; Wang, Jing; Tan, Chen; King, Graham J.; Liu, Kede

    2015-01-01

    Brassica rapa includes some of the most important vegetables worldwide as well as oilseed crops. The complete annotated genome sequence confirmed its paleohexaploid origins and provides opportunities for exploring the detailed process of polyploid genome evolution. We generated a genome-wide DNA methylation profile for B. rapa using a modified reduced representation bisulfite sequencing (RRBS) method. This sampling represented 2.24% of all CG loci (2.5 × 105), 2.16% CHG (2.7 × 105), and 1.68% CHH loci (1.05 × 105) (where H = A, T, or C). Our sampling of DNA methylation in B. rapa indicated that 52.4% of CG sites were present as 5mCG, with 31.8% of CHG and 8.3% of CHH. It was found that genic regions of single copy genes had significantly higher methylation compared to those of two or three copy genes. Differences in degree of genic DNA methylation were observed in a hierarchical relationship corresponding to the relative age of the three ancestral subgenomes, primarily accounted by single-copy genes. RNA-seq analysis revealed that overall the level of transcription was negatively correlated with mean gene methylation content and depended on copy number or was associated with the different subgenomes. These results provide new insights into the role epigenetic variation plays in polyploid genome evolution, and suggest an alternative mechanism for duplicate gene loss. PMID:26500672

  10. Amino acid sequences of two nonspecific lipid-transfer proteins from germinated castor bean.

    PubMed

    Takishima, K; Watanabe, S; Yamada, M; Suga, T; Mamiya, G

    1988-11-01

    The amino acid sequence of two nonspecific lipid-transfer proteins (nsLTP) B and C from germinated castor bean seeds have been determined. Both the proteins consist of 92 residues, as for nsLTP previously reported, and their calculated Mr values are 9847 and 9593 for nsLTP-B and nsLTP-C, respectively. The sequences of nsLTP-B and nsLTP-C, compared to the known sequence of nsLTP-A from the same source, are 68% and 35% similar, respectively. No variation was found at the positions of the cysteine residues, indicating that they might be involved in disulfide bridges.

  11. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences.

  12. Complete amino acid sequence of the N-terminal extension of calf skin type III procollagen.

    PubMed Central

    Brandt, A; Glanville, R W; Hörlein, D; Bruckner, P; Timpl, R; Fietzek, P P; Kühn, K

    1984-01-01

    The N-terminal extension peptide of type III procollagen, isolated from foetal-calf skin, contains 130 amino acid residues. To determine its amino acid sequence, the peptide was reduced and carboxymethylated or aminoethylated and fragmented with trypsin, Staphylococcus aureus V8 proteinase and bacterial collagenase. Pyroglutamate aminopeptidase was used to deblock the N-terminal collagenase fragment to enable amino acid sequencing. The type III collagen extension peptide is homologous to that of the alpha 1 chain of type I procollagen with respect to a three-domain structure. The N-terminal 79 amino acids, which contain ten of the 12 cysteine residues, form a compact globular domain. The next 39 amino acids are in a collagenase triplet sequence (Gly- Xaa - Yaa )n with a high hydroxyproline content. Finally, another short non-collagenous domain of 12 amino acids ends at the cleavage site for procollagen aminopeptidase, which cleaves a proline-glutamine bond. In contrast with type I procollagen, the type III procollagen extension peptides contain interchain disulphide bridges located at the C-terminus of the triple-helical domain. PMID:6331392

  13. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  14. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  15. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  16. Complete amino acid sequence of branched-chain amino acid aminotransferase (transaminase B) of Salmonella typhimurium, identification of the coenzyme-binding site and sequence comparison analysis

    SciTech Connect

    Feild, M.J.

    1988-01-01

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase of Salmonella typhimurium was determined by automated Edman degradation of peptide fragments generated by chemical and enzymatic digestion of S-carboxymethylated and S-pyridylethylated transaminase B. Peptide fragments of transaminase B were generated by treatment of the enzyme with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. Protocols were developed for separation of the peptide fragments by reverse-phase high performance liquid chromatography (HPLC), ion-exchange HPLC, and SDS-urea gel electrophoresis. The enzyme subunit contains 308 amino acid residues and has a molecular weight of 33,920 daltons. The coenzyme-binding site was determined by treatment of the enzyme, containing bound pyridoxal 5-phosphate, with tritiated sodium borohydride prior to trypsin digestion. Monitoring radioactivity incorporation and peptide map comparisons with an apoenzyme tryptic digest, allowed identification of the pyridoxylated-peptide which was isolated by reverse-phase HPLC and sequenced. The coenzyme-binding site is a lysyl residue at position 159. Some peptides were further characterized by fast atom bombardment mass spectrometry.

  17. The amino acid sequence of cytochromes c-551 from three species of Pseudomonas

    PubMed Central

    Ambler, R. P.; Wynn, Margaret

    1973-01-01

    The amino acid sequences of the cytochromes c-551 from three species of Pseudomonas have been determined. Each resembles the protein from Pseudomonas strain P6009 (now known to be Pseudomonas aeruginosa, not Pseudomonas fluorescens) in containing 82 amino acids in a single peptide chain, with a haem group covalently attached to cysteine residues 12 and 15. In all four sequences 43 residues are identical. Although by bacteriological criteria the organisms are closely related, the differences between pairs of sequences range from 22% to 39%. These values should be compared with the differences in the sequence of mitochondrial cytochrome c between mammals and amphibians (about 18%) or between mammals and insects (about 33%). Detailed evidence for the amino acid sequences of the proteins has been deposited as Supplementary Publication SUP 50015 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973), 131, 5. PMID:4352718

  18. Draft Genome Sequence of Sorghum Grain Mold Fungus Epicoccum sorghinum, a Producer of Tenuazonic Acid

    PubMed Central

    Oliveira, Rodrigo C.; Davenport, Karen W.; Hovde, Blake; Silva, Danielle; Chain, Patrick S. G.; Correa, Benedito

    2017-01-01

    ABSTRACT The facultative plant pathogen Epicoccum sorghinum is associated with grain mold of sorghum and produces the mycotoxin tenuazonic acid. This fungus can have serious economic impact on sorghum production. Here, we report the draft genome sequence of E. sorghinum (USPMTOX48). PMID:28126937

  19. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein.

  20. Draft Genome Sequence of Bacillus coagulans NL01, a Wonderful l-Lactic Acid Producer

    PubMed Central

    Zheng, Zhaojuan; Jiang, Ting; Lin, Xi; Zhou, Jie

    2015-01-01

    Here, we report the draft genome sequence of Bacillus coagulans NL01, which could produce high optically pure l-lactic acid using xylose as a sole carbon source. The draft genome is 3,505,081 bp, with 144 contigs. About 3,903 protein-coding genes and 92 rRNAs are predicted from this assembly. PMID:26089419

  1. Defining sequence space and reaction products within the cyanuric acid hydrolase (AtzD)/barbiturase protein family.

    PubMed

    Seffernick, Jennifer L; Erickson, Jasmine S; Cameron, Stephan M; Cho, Seunghee; Dodge, Anthony G; Richman, Jack E; Sadowsky, Michael J; Wackett, Lawrence P

    2012-09-01

    Cyanuric acid hydrolases (AtzD) and barbiturases are homologous, found almost exclusively in bacteria, and comprise a rare protein family with no discernible linkage to other protein families or an X-ray structural class. There has been confusion in the literature and in genome projects regarding the reaction products, the assignment of individual sequences as either cyanuric acid hydrolases or barbiturases, and spurious connection of this family to another protein family. The present study has addressed those issues. First, the published enzyme reaction products of cyanuric acid hydrolase are incorrectly identified as biuret and carbon dioxide. The current study employed (13)C nuclear magnetic resonance (NMR) spectroscopy and mass spectrometry to show that cyanuric acid hydrolase releases carboxybiuret, which spontaneously decarboxylates to biuret. This is significant because it revealed that homologous cyanuric acid hydrolases and barbiturases catalyze completely analogous reactions. Second, enzymes that had been annotated incorrectly in genome projects have been reassigned here by bioinformatics, gene cloning, and protein characterization studies. Third, the AtzD/barbiturase family has previously been suggested to consist of members of the amidohydrolase superfamily, a large class of metallohydrolases. Bioinformatics and the lack of bound metals both argue against a connection to the amidohydrolase superfamily. Lastly, steady-state kinetic measurements and observations of protein stability suggested that the AtzD/barbiturase family might be an undistinguished protein family that has undergone some resurgence with the recent introduction of industrial s-triazine compounds such as atrazine and melamine into the environment.

  2. Multilocus sequence analysis of the marine bacterial genus Tenacibaculum suggests parallel evolution of fish pathogenicity and endemic colonization of aquaculture systems.

    PubMed

    Habib, Christophe; Houel, Armel; Lunazzi, Aurélie; Bernardet, Jean-François; Olsen, Anne Berit; Nilsen, Hanne; Toranzo, Alicia E; Castro, Nuria; Nicolas, Pierre; Duchaud, Eric

    2014-09-01

    The genus Tenacibaculum, a member of the family Flavobacteriaceae, is an abundant component of marine bacterial ecosystems that also hosts several fish pathogens, some of which are of serious concern for marine aquaculture. Here, we applied multilocus sequence analysis (MLSA) to 114 representatives of most known species in the genus and of the worldwide diversity of the major fish pathogen Tenacibaculum maritimum. Recombination hampers precise phylogenetic reconstruction, but the data indicate intertwined environmental and pathogenic lineages, which suggests that pathogenicity evolved independently in several species. At lower phylogenetic levels recombination is also important, and the species T. maritimum constitutes a cohesive group of isolates. Importantly, the data reveal no trace of long-distance dissemination that could be linked to international fish movements. Instead, the high number of distinct genotypes suggests an endemic distribution of strains. The MLSA scheme and the data described in this study will help in monitoring Tenacibaculum infections in marine aquaculture; we show, for instance, that isolates from tenacibaculosis outbreaks in Norwegian salmon farms are related to T. dicentrarchi, a recently described species.

  3. Multilocus Sequence Analysis of the Marine Bacterial Genus Tenacibaculum Suggests Parallel Evolution of Fish Pathogenicity and Endemic Colonization of Aquaculture Systems

    PubMed Central

    Habib, Christophe; Houel, Armel; Lunazzi, Aurélie; Bernardet, Jean-François; Olsen, Anne Berit; Nilsen, Hanne; Toranzo, Alicia E.; Castro, Nuria; Nicolas, Pierre

    2014-01-01

    The genus Tenacibaculum, a member of the family Flavobacteriaceae, is an abundant component of marine bacterial ecosystems that also hosts several fish pathogens, some of which are of serious concern for marine aquaculture. Here, we applied multilocus sequence analysis (MLSA) to 114 representatives of most known species in the genus and of the worldwide diversity of the major fish pathogen Tenacibaculum maritimum. Recombination hampers precise phylogenetic reconstruction, but the data indicate intertwined environmental and pathogenic lineages, which suggests that pathogenicity evolved independently in several species. At lower phylogenetic levels recombination is also important, and the species T. maritimum constitutes a cohesive group of isolates. Importantly, the data reveal no trace of long-distance dissemination that could be linked to international fish movements. Instead, the high number of distinct genotypes suggests an endemic distribution of strains. The MLSA scheme and the data described in this study will help in monitoring Tenacibaculum infections in marine aquaculture; we show, for instance, that isolates from tenacibaculosis outbreaks in Norwegian salmon farms are related to T. dicentrarchi, a recently described species. PMID:24973065

  4. Amino acid sequences of heterotrophic and photosynthetic ferredoxins from the tomato plant (Lycopersicon esculentum Mill.).

    PubMed

    Kamide, K; Sakai, H; Aoki, K; Sanada, Y; Wada, K; Green, L S; Yee, B C; Buchanan, B B

    1995-11-01

    Several forms (isoproteins) of ferredoxin in roots, leaves, and green and red pericarps in tomato plants (Lycopersicon esculentum Mill.) were earlier identified on the basis of N-terminal amino acid sequence and chromatographic behavior (Green et al. 1991). In the present study, a large scale preparation made possible determination of the full length amino acid sequence of the two ferredoxins from leaves. The ferredoxins characteristic of fruit and root were sequenced from the amino terminus to the 30th residue or beyond. The leaf ferredoxins were confirmed to be expressed in pericarp of both green and red fruit. The ferredoxins characteristic of fruit and root appeared to be restricted to those tissue. The results extend earlier findings in demonstrating that ferredoxin occurs in the major organs of the tomato plant where it appears to function irrespective of photosynthetic competence.

  5. Amino acid sequence of myoglobin from white-tailed deer (Odocoileus virginianus).

    PubMed

    Joseph, Poulson; Suman, Surendranath P; Li, Shuting; Fontaine, Michele; Steinke, Laurey

    2012-10-01

    Our objective was to determine the primary structure of white-tailed deer myoglobin (Mb). White-tailed deer Mb was isolated from cardiac muscles employing ammonium sulfate precipitation and gel-filtration chromatography. The amino acid sequence was determined by Edman degradation. Sequence analyses of intact Mb as well as tryptic- and cyanogen bromide-peptides yielded the complete primary structure of white-tailed deer Mb, which shared 100% similarity with red deer Mb. White-tailed deer Mb consists of 153 amino acid residues and shares more than 96% sequence similarity with myoglobins from meat-producing ruminants, such as cattle, buffalo, sheep, and goat. Similar to sheep and goat myoglobins, white-tailed deer Mb contains 12 histidine residues. Proximal (position 93) and distal (position 64) histidine residues responsible for maintaining the stability of heme are conserved in white-tailed deer Mb.

  6. Binding of α,α-Disubstituted Amino Acids to Arginase Suggests New Avenues for Inhibitor Design1

    PubMed Central

    Ilies, Monica; Di Costanzo, Luigi; Dowling, Daniel P.; Thorn, Katherine J.; Christianson, David W.

    2011-01-01

    Arginase is a binuclear manganese metalloenzyme that hydrolyzes L-arginine to form L-ornithine and urea, and aberrant arginase activity is implicated in various diseases such as erectile dysfunction, asthma, atherosclerosis, and cerebral malaria. Accordingly, arginase inhibitors may be therapeutically useful. Continuing our efforts to expand the chemical space of arginase inhibitor design, and inspired by the binding of 2-(difluoromethyl)-L-ornithine to human arginase I, we now report the first study of the binding of α,α-disubstituted amino acids to arginase. Specifically, we report the design, synthesis, and assay of racemic 2-amino-6-borono-2- methylhexanoic acid and racemic 2-amino-6-borono-2-(difluoromethyl)hexanoic acid. X-ray crystal structures of human arginase I and Plasmodium falciparum arginase complexed with these inhibitors reveal the exclusive binding of the L-stereoisomer; the additional α-substituent of each inhibitor is readily accommodated and makes new intermolecular interactions in the outer active site of each enzyme. Therefore, this work highlights a new region of the protein surface that can be targeted for additional affinity interactions, as well as the first comparative structural insights on inhibitor discrimination between a human and a parasitic arginase. PMID:21728378

  7. Molecular cloning, nucleotide sequence, and abscisic acid induction of a suberization-associated highly anionic peroxidase.

    PubMed

    Roberts, E; Kolattukudy, P E

    1989-06-01

    A highly anionic peroxidase induced in suberizing cells was suggested to be the key enzyme involved in polymerization of phenolic monomers to generate the aromatic matrix of suberin. The enzyme encoded by a potato cDNA was found to be highly homologous to the anionic peroxidase induced in suberizing tomato fruit. A tomato genomic library was screened using the potato anionic peroxidase cDNA and one genomic clone was isolated that contained two tandemly oriented anionic peroxidase genes. These genes were sequenced and were 96% and 87% identical to the mRNA for potato anionic peroxidase. Both genes consist of three exons with the relative positions of their two introns being conserved between the two genes. Primer extension analysis showed that only one of the genes is expressed in the periderm of 3 day wound-healed tomato fruits. Southern blot analyses suggested that there are two copies each of the two highly homologous genes per haploid genome in both potato and tomato. Abscisic acid (ABA) induced the accumulation of the anionic peroxidase transcripts in potato and tomato callus tissues. Northern blots showed that peroxidase mRNA was detectable at 2 days and was maximal at 8 days after transfer of potato callus to solid agar media containing 10(-4) M ABA. The transcripts induced by ABA in both potato and tomato callus were identical in size to those induced in wound-healing potato tuber and tomato fruit. The anionic peroxidase peptide was detected in extracts of potato callus grown on the ABA-containing media by western blot analysis. The results support the suggestion that stimulation of suberization by ABA involves the induction of the highly anionic peroxidase.

  8. Nucleotide sequence and the encoded amino acids of human apolipoprotein A-I mRNA.

    PubMed Central

    Law, S W; Brewer, H B

    1984-01-01

    The cDNA clones encoding the precursor form of human liver apolipoprotein A-I (apoA-I), preproapoA-I, have been isolated from a cDNA library. A 17-base synthetic oligonucleotide based on residues 108-113 of apoA-I and a 26-base primer-extended, dideoxynucleotide-terminated cDNA were used as hybridization probes to select for recombinant plasmids bearing the apoA-I sequence. The complete nucleic acid sequence of human liver preproapoA-I has been determined by analysis of the cloned cDNA. The sequence is composed of 801 nucleotides encoding 267 amino acid residues. PreproapoA-I contains an 18-amino-acid prepeptide and a 6-amino-acid propeptide connected to the amino terminus of the 243-amino acid mature apoA-I. Southern blotting analysis of chromosomal DNA obtained from peripheral blood indicated the apoA-I gene is contained in a 2.1-kilobase-pair Pst I fragment and there is no gross difference in structural organization between the normal apoA-I gene and the Tangier disease apoA-I gene. Images PMID:6198645

  9. Complete Plastid Genome Sequencing of Trochodendraceae Reveals a Significant Expansion of the Inverted Repeat and Suggests a Paleogene Divergence between the Two Extant Species

    PubMed Central

    Sun, Yan-xia; Moore, Michael J.; Meng, Ai-ping; Soltis, Pamela S.; Soltis, Douglas E.; Li, Jian-qiang; Wang, Heng-chang

    2013-01-01

    The early-diverging eudicot order Trochodendrales contains only two monospecific genera, Tetracentron and Trochodendron. Although an extensive fossil record indicates that the clade is perhaps 100 million years old and was widespread throughout the Northern Hemisphere during the Paleogene and Neogene, the two extant genera are both narrowly distributed in eastern Asia. Recent phylogenetic analyses strongly support a clade of Trochodendrales, Buxales, and Gunneridae (core eudicots), but complete plastome analyses do not resolve the relationships among these groups with strong support. However, plastid phylogenomic analyses have not included data for Tetracentron. To better resolve basal eudicot relationships and to clarify when the two extant genera of Trochodendrales diverged, we sequenced the complete plastid genome of Tetracentron sinense using Illumina technology. The Tetracentron and Trochodendron plastomes possess the typical gene content and arrangement that characterize most angiosperm plastid genomes, but both genomes have the same unusual ∼4 kb expansion of the inverted repeat region to include five genes (rpl22, rps3, rpl16, rpl14, and rps8) that are normally found in the large single-copy region. Maximum likelihood analyses of an 83-gene, 88 taxon angiosperm data set yield an identical tree topology as previous plastid-based trees, and moderately support the sister relationship between Buxaceae and Gunneridae. Molecular dating analyses suggest that Tetracentron and Trochodendron diverged between 44-30 million years ago, which is congruent with the fossil record of Trochodendrales and with previous estimates of the divergence time of these two taxa. We also characterize 154 simple sequence repeat loci from the Tetracentron sinense and Trochodendron aralioides plastomes that will be useful in future studies of population genetic structure for these relict species, both of which are of conservation concern. PMID:23577110

  10. Sequence Analysis of LRPPRC and Its SEC1 Domain Interaction Partners Suggests Roles in Cytoskeletal Organization, Vesicular Trafficking, Nucleocytosolic Shuttling and Chromosome Activity

    PubMed Central

    Liu, Leyuan; McKeehan, Wallace L.

    2011-01-01

    LRPPRC (originally called LRP130) is an intracellular 130-kDa leucine-rich protein that co-purifies with the FGF receptor from liver cell extracts and has been detected in diverse multi-protein complexes from the cell membrane, cytoskeleton and nucleus. Here we report results of a sequence homology analysis of LRPPRC and its SEC1 domain interactive partners. Twenty-three copies of tandem repeats that are similar to PPR, TPR and HEAT repeats characterize the LRPPRC sequence. The N-terminus exhibits multiple copies of leucine-rich nuclear transport signals followed by ENTH, DUF28 and SEC1 homology domains. We used the SEC1 domain to trap interactive partners expressed from a human liver cDNA library. Interactive C19ORF5 (XP_038600) exhibited a strong homology to microtubule-associated proteins (MAP) and a potential arginine-rich mRNA binding motif. UXT (XP_033860) exhibited α-helical properties homologous to the actin-associated spectrin repeat and L/I heptad repeats in mobile transcription factors. C6ORF34 (XP_004305) was homologous to the non-DNA binding C-terminus of the E. coli Rob transcription factor. CECR2 (AAK15343) exhibited a transcription factor AT-hook motif next to two bromodomains and a homology to guanylate-binding protein 1. Taken together these features suggest a regulatory role of LRPPRC and its SEC1 domain-interactive partners in integration of cytoskeletal networks with vesicular trafficking, nucleocytosolic shuttling, chromosome remodeling and transcription. PMID:11827465

  11. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids.

    PubMed

    Das, Jayanta Kumar; Das, Provas; Ray, Korak Kumar; Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.

  12. Solubility Challenges in High Concentration Monoclonal Antibody Formulations: Relationship with Amino Acid Sequence and Intermolecular Interactions.

    PubMed

    Pindrus, Mariya; Shire, Steven J; Kelley, Robert F; Demeule, Barthélemy; Wong, Rita; Xu, Yiren; Yadav, Sandeep

    2015-11-02

    The purpose of this work was to elucidate the molecular interactions leading to monoclonal antibody self-association and precipitation and utilize biophysical measurements to predict solubility behavior at high protein concentration. Two monoclonal antibodies (mAb-G and mAb-R) binding to overlapping epitopes were investigated. Precipitation of mAb-G solutions was most prominent at high ionic strength conditions and demonstrated strong dependence on ionic strength, as well as slight dependence on solution pH. At similar conditions no precipitation was observed for mAb-R solutions. Intermolecular interactions (interaction parameter, kD) related well with high concentration solubility behavior of both antibodies. Upon increasing buffer ionic strength, interactions of mAb-R tended to weaken, while those of mAb-G became more attractive. To investigate the role of amino acid sequence on precipitation behavior, mutants were designed by substituting the CDR of mAb-R into the mAb-G framework (GM-1) or deleting two hydrophobic residues in the CDR of mAb-G (GM-2). No precipitation was observed at high ionic strength for either mutant. The molecular interactions of mutants were similar in magnitude to those of mAb-R. The results suggest that presence of hydrophobic groups in the CDR of mAb-G may be responsible for compromising its solubility at high ionic strength conditions since deleting these residues mitigated the solubility issue.

  13. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided.

  14. Amino acid sequence of band-3 protein from rainbow trout erythrocytes derived from cDNA.

    PubMed Central

    Hübner, S; Michel, F; Rudloff, V; Appelhans, H

    1992-01-01

    In this report we present the first complete band-3 cDNA sequence of a poikilothermic lower vertebrate. The primary structure of the anion-exchange protein band 3 (AE1) from rainbow trout erythrocytes was determined by nucleotide sequencing of cDNA clones. The overlapping clones have a total length of 3827 bp with a 5'-terminal untranslated region of 150 bp, a 2754 bp open reading frame and a 3'-untranslated region of 924 bp. Band-3 protein from trout erythrocytes consists of 918 amino acid residues with a calculated molecular mass of 101 827 Da. Comparison of its amino acid sequence revealed a 60-65% identity within the transmembrane spanning sequence of band-3 proteins published so far. An additional insertion of 24 amino acid residues within the membrane-associated domain of trout band-3 protein was identified, which until now was thought to be a general feature only of mammalian band-3-related proteins. PMID:1637296

  15. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken; SNL,

    2016-07-12

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  16. Complete mtDNA sequences of two millipedes suggest a new model for mitochondrial gene rearrangements: Duplication and non-random loss

    SciTech Connect

    Lavrov, Dennis V.; Boore, Jeffrey L.; Brown, Wesley M.

    2001-11-08

    We determined the complete mtDNA sequences of the millipedes Narceus annularus and Thyropygus sp. (Arthropoda: Diplopoda) and identified in both genomes all 37 genes typical for metazoan mtDNA. The arrangement of these genes is identical in the two millipedes, but differs from that inferred to be ancestral for arthropods by the location of four genes/gene clusters. This novel gene arrangement is unusual for animal mtDNA, in that genes with opposite transcriptional polarities are clustered in the genome and the two clusters are separated by two non-coding regions. The only exception to this pattern is the gene for cysteine tRNA, which is located in the part of the genome that otherwise contains all genes with the opposite transcriptional polarity. We suggest that a mechanism involving complete mtDNA duplication followed by the loss of genes, predetermined by their transcriptional polarity and location in the genome, could generate this gene arrangement from the one ancestral for arthropods. The proposed mechanism has important implications for phylogenetic inferences that are drawn on the basis of gene arrangement comparisons.

  17. Clonality Analysis of Immunoglobulin Gene Rearrangement by Next-Generation Sequencing in Endemic Burkitt Lymphoma Suggests Antigen Drive Activation of BCR as Opposed to Sporadic Burkitt Lymphoma

    PubMed Central

    Amato, Teresa; Abate, Francesco; Piccaluga, Pierpaolo; Iacono, Michele; Fallerini, Chiara; Renieri, Alessandra; De Falco, Giulia; Ambrosio, Maria Raffaella; Mourmouras, Vaselious; Ogwang, Martin; Calbi, Valeria; Rabadan, Roul; Hummel, Michael; Pileri, Stefano; Bellan, Cristiana

    2016-01-01

    Objectives: Recent studies using next-generation sequencing (NGS) analysis disclosed the importance of the intrinsic activation of the B-cell receptor (BCR) pathway in the pathogenesis of sporadic Burkitt lymphoma (sBL) due to mutations of TCF3/ID3 genes. Since no definitive data are available on the genetic landscape of endemic Burkitt (eBL), we first assessed the mutation frequency of TCF3/ID3 in eBL compared with sBL and subsequently the somatic hypermutation status of the BCR to answer whether an extrinsic activation of BCR signaling could also be demonstrated in Burkitt lymphoma. Methods: We assessed the mutations of TCF3/ID3 by RNAseq and the BCR status by NGS analysis of the immunoglobulin genes (IGs). Results: We detected mutations of TCF3/ID3 in about 30% of the eBL cases. This rate is significantly lower than that detected in sBL (64%). The NGS analysis of IGs revealed intraclonal diversity, suggesting an active targeted somatic hypermutation process in eBL compared with sBL. Conclusions: These findings support the view that the antigenic pressure plays a key role in the pathogenetic pathways of eBL, which may be partially distinct from those driving sBL development. PMID:26712879

  18. Analysis of genome-wide RNA-sequencing data suggests age of the CEPH/Utah (CEU) lymphoblastoid cell lines systematically biases gene expression profiles.

    PubMed

    Yuan, Yuan; Tian, Lei; Lu, Dongsheng; Xu, Shuhua

    2015-01-22

    In human, Lymphoblastoid cell lines (LCLs) from the CEPH/CEU (Centre d'Etude du Polymorphisme Humain - Utah) family resource have been extensively used for examining the genetics of gene expression levels. However, we noted that CEU/CEPH cell lines were collected and transformed approximately thirty years ago, much earlier than the other cell lines from the pertaining individuals, which we suspected could potentially affect gene expression, data analysis and results interpretation. In this study, by analyzing RNA sequencing data of CEU and the other three European populations as well as an African population, we systematically examined and evaluated the potential confounding effect of LCL age on gene expression levels and patterns. Our results indicated that gene expression profiles of CEU samples have been biased by the older age of CEU cell lines. Interestingly, most of CEU-specific expressions are associated with functions related to cell proliferation, which are more likely due to older age of cell lines than intrinsic characters of the population. We suggested the results be carefully explained when CEU LCLs are used for transcriptomic data analysis in future studies.

  19. Role of the two-component leader sequence and mature amino acid sequences in extracellular export of endoglucanase EGL from Pseudomonas solanacearum.

    PubMed Central

    Huang, J Z; Schell, M A

    1992-01-01

    The egl gene of Pseudomonas solanacearum encodes a 43-kDa extracellular endoglucanase (mEGL) involved in wilt disease caused by this phytopathogen. Egl is initially translated with a 45-residue, two-part leader sequence. The first 19 residues are apparently removed by signal peptidase II during export of Egl across the inner membrane (IM); the remaining residues of the leader sequence (modified with palmitate) are removed during export across the outer membrane (OM). Localization of Egl-PhoA fusion proteins showed that the first 26 residues of the Egl leader sequence are required and sufficient to direct lipid modification, processing, and export of Egl or PhoA across the IM but not the OM. Fusions of the complete 45-residue leader sequence or of the leader and increasing portions of mEgl sequences to PhoA did not cause its export across the OM. In-frame deletion of portions of mEGL-coding sequences blocked export of the truncated polypeptides across the OM without affecting export across the IM. These results indicate that the first part of the leader sequence functions independently to direct export of Egl across the IM while the second part and sequences and structures in mEGL are involved in export across the OM. Computer analysis of the mEgl amino acid sequence obtained from its nucleotide sequence identified a region of mEGL similar in amino acid sequence to regions in other prokaryotic endoglucanases. Images PMID:1735723

  20. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    PubMed

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  1. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

    PubMed Central

    Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. PMID:27930687

  2. The complete amino acid sequence of a trypsin inhibitor from Bauhinia variegata var. candida seeds.

    PubMed

    Di Ciero, L; Oliva, M L; Torquato, R; Köhler, P; Weder, J K; Camillo Novello, J; Sampaio, C A; Oliveira, B; Marangoni, S

    1998-11-01

    Trypsin inhibitors of two varieties of Bauhinia variegata seeds have been isolated and characterized. Bauhinia variegata candida trypsin inhibitor (BvcTI) and B. variegata lilac trypsin inhibitor (BvlTI) are proteins with Mr of about 20,000 without free sulfhydryl groups. Amino acid analysis shows a high content of aspartic acid, glutamic acid, serine, and glycine, and a low content of histidine, tyrosine, methionine, and lysine in both inhibitors. Isoelectric focusing for both varieties detected three isoforms (pI 4.85, 5.00, and 5.15), which were resolved by HPLC procedure. The trypsin inhibitors show Ki values of 6.9 and 1.2 nM for BvcTI and BvlTI, respectively. The N-terminal sequences of the three trypsin inhibitor isoforms from both varieties of Bauhinia variegata and the complete amino acid sequence of B. variegata var. candida L. trypsin inhibitor isoform 3 (BvcTI-3) are presented. The sequences have been determined by automated Edman degradation of the reduced and carboxymethylated proteins of the peptides resulting from Staphylococcus aureus protease and trypsin digestion. BvcTI-3 is composed of 167 residues and has a calculated molecular mass of 18,529. Homology studies with other trypsin inhibitors show that BvcTI-3 belongs to the Kunitz family. The putative active site encompasses Arg (63)-Ile (64).

  3. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  4. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand.

  5. The amino-acid sequence of the 2S sulphur-rich proteins from seeds of Brazil nut (Bertholletia excelsa H.B.K.).

    PubMed

    Ampe, C; Van Damme, J; de Castro, L A; Sampaio, M J; Van Montagu, M; Vandekerckhove, J

    1986-09-15

    Storage proteins of the albumin solubility fraction from seeds of Bertholletia excelsa H.B.K. were separated by reversed-phase high-performance liquid chromatography and their primary structures were determined by gas-phase sequencing on intact polypeptides and on the overlapping tryptic and thermolysin peptides. The 2S storage proteins consist of two subunits linked by disulphide bridges. The large subunit (8.5 kDa) is expressed in at least six different isoforms while the small subunit (3.6 kDa) consists of only one form. These proteins are extremely rich in glutamine, glutamic acid, arginine and the sulphur-containing amino acids cysteine and methionine. One of the variants even contains a sequence of six methionine residues in a row. Comparison with known sequences of 2S proteins of other dicotyledonous plants shows limited but distinct sequence homology. In particular, the positions of the cysteine residues relative to each other appear to be completely conserved, suggesting that tertiary structure constraints imposed by disulphide bridges dominate sequence conservation. It has been proposed that the two subunits of a related protein (the Brassica napus storage protein) is cleaved from a precursor polypeptide [Crouch, M. L., Tenbarge, K. M., Simon, A. E. & Ferl, R. (1983) J. Mol. Appl. Genet. 2,273-283]. The amino acid sequence homology of the Brazil nut protein with the former suggests that a similar protein processing event could occur.

  6. The amino acid sequence of protein SCMK-B2C from the high-sulphur fraction of wool keratin

    PubMed Central

    Elleman, T. C.

    1972-01-01

    1. The amino acid sequence of a protein from the reduced and carboxymethylated high-sulphur fraction of wool has been determined. 2. The sequence of this S-carboxymethylkerateine (SCMK-B2C) of 151 amino acid residues displays much internal homology and an unusual residue distribution. Thus a ten-residue sequence occurs four times near the N-terminus and five times near the C-terminus with few changes. These regions contain much of the molecule's half-cystine, whereas between them there is a region of 19 residues that are mainly small and devoid of cystine and proline. 3. Certain models of the wool fibre based on its mechanical and physical properties propose a matrix of small compact globular units linked together to form beaded chains. The unusual distribution of the component residues of protein SCMK-B2C suggests structures in the wool-fibre matrix compatible with certain features of the proposed models. PMID:4678578

  7. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  8. Amino-Acid Sequence of NADP-Specific Glutamate Dehydrogenase of Neurospora crassa

    PubMed Central

    Wootton, John C.; Chambers, Geoffrey K.; Holder, Anthony A.; Baron, Andrew J.; Taylor, John G.; Fincham, John R. S.; Blumenthal, Kenneth M.; Moon, Kenneth; Smith, Emil L.

    1974-01-01

    A tentative primary structure of the NADP-specific glutamate dehydrogenase [L-glutamate: NADP oxidoreductase (deaminating), EC 1.4.1.4] from Neurospora crassa has been determined. The proposed sequence contains 452 amino-acid residues in each of the identical subunits of the hexameric enzyme. Comparison of the sequence with that of the bovine liver enzyme reveals considerable homology in the amino-terminal portion of the chain, including the vicinity of the reactive lysine, with only shorter stretches of homology within the carboxyl-terminal regions. The significance of this distribution of homologous regions is discussed. PMID:4155068

  9. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    PubMed Central

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583

  10. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion.

    PubMed

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-07-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed).

  11. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  12. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  13. The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

    PubMed

    Mir, Rafia; Jallu, Shais; Singh, T P

    2015-06-01

    The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.

  14. The acid adaptive tolerance response in Campylobacter jejuni induces a global response, as suggested by proteomics and microarrays

    PubMed Central

    Varsaki, Athanasia; Murphy, Caroline; Barczynska, Alicja; Jordan, Kieran; Carroll, Cyril

    2015-01-01

    Campylobacter jejuni CI 120 is a natural isolate obtained during poultry processing and has the ability to induce an acid tolerance response (ATR) to acid + aerobic conditions in early stationary phase. Other strains tested they did not induce an ATR or they induced it in exponential phase. Campylobacter spp. do not contain the genes that encode the global stationary phase stress response mechanism. Therefore, the aim of this study was to identify genes that are involved in the C. jejuni CI 120 early stationary phase ATR, as it seems to be expressing a novel mechanism of stress tolerance. Two-dimensional gel electrophoresis was used to examine the expression profile of cytosolic proteins during the C. jejuni CI 120 adaptation to acid + aerobic stress and microarrays to determine the genes that participate in the ATR. The results indicate induction of a global response that activated a number of stress responses, including several genes encoding surface components and genes involved with iron uptake. The findings of this study provide new insights into stress tolerance of C. jejuni, contribute to a better knowledge of the physiology of this bacterium and highlight the diversity among different strains. PMID:26221965

  15. The acid adaptive tolerance response in Campylobacter jejuni induces a global response, as suggested by proteomics and microarrays.

    PubMed

    Varsaki, Athanasia; Murphy, Caroline; Barczynska, Alicja; Jordan, Kieran; Carroll, Cyril

    2015-11-01

    Campylobacter jejuni CI 120 is a natural isolate obtained during poultry processing and has the ability to induce an acid tolerance response (ATR) to acid + aerobic conditions in early stationary phase. Other strains tested they did not induce an ATR or they induced it in exponential phase. Campylobacter spp. do not contain the genes that encode the global stationary phase stress response mechanism. Therefore, the aim of this study was to identify genes that are involved in the C. jejuni CI 120 early stationary phase ATR, as it seems to be expressing a novel mechanism of stress tolerance. Two-dimensional gel electrophoresis was used to examine the expression profile of cytosolic proteins during the C. jejuni CI 120 adaptation to acid + aerobic stress and microarrays to determine the genes that participate in the ATR. The results indicate induction of a global response that activated a number of stress responses, including several genes encoding surface components and genes involved with iron uptake. The findings of this study provide new insights into stress tolerance of C. jejuni, contribute to a better knowledge of the physiology of this bacterium and highlight the diversity among different strains.

  16. Sequence-specific thermodynamic properties of nucleic acids influence both transcriptional pausing and backtracking in yeast

    PubMed Central

    2017-01-01

    RNA Polymerase II pauses and backtracks during transcription, with many consequences for gene expression and cellular physiology. Here, we show that the energy required to melt double-stranded nucleic acids in the transcription bubble predicts pausing in Saccharomyces cerevisiae far more accurately than nucleosome roadblocks do. In addition, the same energy difference also determines when the RNA polymerase backtracks instead of continuing to move forward. This data-driven model corroborates—in a genome wide and quantitative manner—previous evidence that sequence-dependent thermodynamic features of nucleic acids influence both transcriptional pausing and backtracking. PMID:28301878

  17. Respiratory syncytial virus fusion glycoprotein: nucleotide sequence of mRNA, identification of cleavage activation site and amino acid sequence of N-terminus of F1 subunit.

    PubMed Central

    Elango, N; Satake, M; Coligan, J E; Norrby, E; Camargo, E; Venkatesan, S

    1985-01-01

    The amino acid sequence of respiratory syncytial virus fusion protein (Fo) was deduced from the sequence of a partial cDNA clone of mRNA and from the 5' mRNA sequence obtained by primer extension and dideoxysequencing. The encoded protein of 574 amino acids is extremely hydrophobic and has a molecular weight of 63371 daltons. The site of proteolytic cleavage within this protein was accurately mapped by determining a partial amino acid sequence of the N-terminus of the larger subunit (F1) purified by radioimmunoprecipitation using monoclonal antibodies. Alignment of the N-terminus of the F1 subunit within the deduced amino acid sequence of Fo permitted us to identify a sequence of lys-lys-arg-lys-arg-arg at the C-terminus of the smaller N-terminal F2 subunit that appears to represent the cleavage/activation domain. Five potential sites of glycosylation, four within the F2 subunit, were also identified. Three extremely hydrophobic domains are present in the protein; a) the N-terminal signal sequence, b) the N-terminus of the F1 subunit that is analogous to the N-terminus of the paramyxovirus F1 subunit and the HA2 subunit of influenza virus hemagglutinin, and c) the putative membrane anchorage domain near the C-terminus of F1. Images PMID:2987829

  18. Analysis of protein function and its prediction from amino acid sequence.

    PubMed

    Clark, Wyatt T; Radivojac, Predrag

    2011-07-01

    Understanding protein function is one of the keys to understanding life at the molecular level. It is also important in the context of human disease because many conditions arise as a consequence of alterations of protein function. The recent availability of relatively inexpensive sequencing technology has resulted in thousands of complete or partially sequenced genomes with millions of functionally uncharacterized proteins. Such a large volume of data, combined with the lack of high-throughput experimental assays to functionally annotate proteins, attributes to the growing importance of automated function prediction. Here, we study proteins annotated by Gene Ontology (GO) terms and estimate the accuracy of functional transfer from protein sequence only. We find that the transfer of GO terms by pairwise sequence alignments is only moderately accurate, showing a surprisingly small influence of sequence identity (SID) in a broad range (30-100%). We developed and evaluated a new predictor of protein function, functional annotator (FANN), from amino acid sequence. The predictor exploits a multioutput neural network framework which is well suited to simultaneously modeling dependencies between functional terms. Experiments provide evidence that FANN-GO (predictor of GO terms; available from http://www.informatics.indiana.edu/predrag) outperforms standard methods such as transfer by global or local SID as well as GOtcha, a method that incorporates the structure of GO.

  19. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  20. Sequence and expression analyses of KIX domain proteins suggest their importance in seed development and determination of seed size in rice, and genome stability in Arabidopsis.

    PubMed

    Thakur, Jitendra Kumar; Agarwal, Pinky; Parida, Swarup; Bajaj, Deepak; Pasrija, Richa

    2013-08-01

    The KIX domain, which mediates protein-protein interactions, was first discovered as a motif in the large multidomain transcriptional activator histone acetyltransferase p300/CBP. Later, the domain was also found in Mediator subunit MED15, where it interacts with many transcription factors. In both proteins, the KIX domain is a target of activation domains of diverse transcription activators. It was found to be an essential component of several specific gene-activation pathways in fungi and metazoans. Not much is known about KIX domain proteins in plants. This study aims to characterize all the KIX domain proteins encoded by the genomes of Arabidopsis and rice. All identified KIX domain proteins are presented, together with their chromosomal locations, phylogenetic analysis, expression and SNP analyses. KIX domains were found not only in p300/CBP- and MED15-like plant proteins, but also in F-box proteins in rice and DNA helicase in Arabidopsis, suggesting roles of KIX domains in ubiquitin-mediated proteasomal degradation and genome stability. Expression analysis revealed overlapping expression of OsKIX_3, OsKIX_5 and OsKIX_7 in different stages of rice seeds development. Moreover, an association analysis of 136 in silico mined SNP loci in 23 different rice genotypes with grain-length information identified three non-synonymous SNP loci in these three rice genes showing strong association with long- and short-grain differentiation. Interestingly, these SNPs were located within KIX domain encoding sequences. Overall, this study lays a foundation for functional analysis of KIX domain proteins in plants.

  1. A root chicory MADS box sequence and the Arabidopsis flowering repressor FLC share common features that suggest conserved function in vernalization and de-vernalization responses.

    PubMed

    Périlleux, Claire; Pieltain, Alexandra; Jacquemin, Guillaume; Bouché, Frédéric; Detry, Nathalie; D'Aloia, Maria; Thiry, Laura; Aljochim, Pierre; Delansnay, Martin; Mathieu, Anne-Sophie; Lutts, Stanley; Tocquin, Pierre

    2013-08-01

    Root chicory (Cichorium intybus var. sativum) is a biennial crop, but is harvested to obtain root inulin at the end of the first growing season before flowering. However, cold temperatures may vernalize seeds or plantlets, leading to incidental early flowering, and hence understanding the molecular basis of vernalization is important. A MADS box sequence was isolated by RT-PCR and named FLC-LIKE1 (CiFL1) because of its phylogenetic positioning within the same clade as the floral repressor Arabidopsis FLOWERING LOCUS C (AtFLC). Moreover, over-expression of CiFL1 in Arabidopsis caused late flowering and prevented up-regulation of the AtFLC target FLOWERING LOCUS T by photoperiod, suggesting functional conservation between root chicory and Arabidopsis. Like AtFLC in Arabidopsis, CiFL1 was repressed during vernalization of seeds or plantlets of chicory, but repression of CiFL1 was unstable when the post-vernalization temperature was favorable to flowering and when it de-vernalized the plants. This instability of CiFL1 repression may be linked to the bienniality of root chicory compared with the annual lifecycle of Arabidopsis. However, re-activation of AtFLC was also observed in Arabidopsis when a high temperature treatment was used straight after seed vernalization, eliminating the promotive effect of cold on flowering. Cold-induced down-regulation of a MADS box floral repressor and its re-activation by high temperature thus appear to be conserved features of the vernalization and de-vernalization responses in distant species.

  2. Amino acid sequence of myoglobin from emu (Dromaius novaehollandiae) skeletal muscle.

    PubMed

    Suman, S P; Joseph, P; Li, S; Beach, C M; Fontaine, M; Steinke, L

    2010-11-01

    The objective of the present study was to characterize the primary structure of emu myoglobin (Mb). Emu Mb was isolated from Iliofibularis muscle employing gel-filtration chromatography. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry was employed to determine the exact molecular mass of emu Mb in comparison with horse Mb, and Edman degradation was utilized to characterize the amino acid sequence. The molecular mass of emu Mb was 17,380 Da and was close to those reported for ratite and poultry myoglobins. Similar to myoglobins from meat-producing livestock and birds, emu Mb has 153 amino acids. Emu Mb contains 9 histidines. Proximal and distal histidines, responsible for coordinating oxygen-binding property of Mb, are conserved in emu. Emu Mb shared more than 90% homology with ratite and chicken myoglobins, whereas it demonstrated only less than 70% sequence similarity with ruminant myoglobins.

  3. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    NASA Astrophysics Data System (ADS)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2017-01-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  4. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  5. Amino acid sequence of atrial natriuretic peptides in human coronary sinus plasma.

    PubMed

    Yandle, T; Crozier, I; Nicholls, G; Espiner, E; Carne, A; Brennan, S

    1987-07-31

    Two atrial natriuretic peptides were purified from pooled human coronary sinus plasma by Sep-Pak extraction, immunoaffinity chromatography and reverse phase HPLC. The amino acid sequences of the two peptides were homologous with 99-126 human atrial natriuretic peptide (hANP) and 106-126 hANP, the latter being most probably linked to 99-105 ANP by the disulphide bond. The molar ratio of the peptides in plasma, as assessed by radioimmunoassay was 10:3.

  6. Amino acid sequence similarity between rabies virus glycoprotein and snake venom curaremimetic neurotoxins.

    PubMed

    Lentz, T L; Wilson, P T; Hawrot, E; Speicher, D W

    1984-11-16

    Evidence was presented earlier that a host-cell receptor for the highly neurotropic rabies virus might be the acetylcholine receptor. The amino acid sequence of the glycoprotein of rabies virus was compared by computer analysis with that of snake venom curaremimetic neurotoxins, potent ligands of the acetylcholine receptor. A statistically significant sequence relation was found between a segment of the rabies glycoprotein and the entire sequence of long neurotoxins. The greatest identity occurs with residues considered most important in neurotoxicity, including those interacting with the acetylcholine binding site of the acetylcholine receptor. Because of the similarity between the glycoprotein and the receptor-binding region of the neurotoxins, this region of the viral glycoprotein may function as a recognition site for the acetylcholine receptor. Direct binding of the rabies virus glycoprotein to the acetylcholine receptor could contribute to the neurotropism of this virus.

  7. [MOLECULAR EVOLUTION OF ION CHANNELS: AMINO ACID SEQUENCES AND 3D STRUCTURES].

    PubMed

    Korkosh, V S; Zhorov, B S; Tikhonov, D B

    2016-01-01

    An integral part of modern evolutionary biology is comparative analysis of structure and function of macromolecules such as proteins. The first and critical step to understand evolution of homologous proteins is their amino acid sequence alignment. However, standard algorithms fop not provide unambiguous sequence alignments for proteins of poor homology. More reliable results can be obtained by comparing experimental 3D structures obtained at atomic resolution, for instance, with the aid of X-ray structural analysis. If such structures are lacking, homology modeling is used, which may take into account indirect experimental data on functional roles of individual amino-acid residues. An important problem is that the sequence alignment, which reflects genetic modifications, does not necessarily correspond to the functional homology. The latter depends on three-dimensional structures which are critical for natural selection. Since alignment techniques relying only on the analysis of primary structures carry no information on the functional properties of proteins, including 3D structures into consideration is very important. Here we consider several examples involving ion channels and demonstrate that alignment of their three-dimensional structures can significantly improve sequence alignments obtained by traditional methods.

  8. The amino acid sequence of the aspartate aminotransferase from baker's yeast (Saccharomyces cerevisiae).

    PubMed Central

    Cronin, V B; Maras, B; Barra, D; Doonan, S

    1991-01-01

    1. The single (cytosolic) aspartate aminotransferase was purified in high yield from baker's yeast (Saccharomyces cerevisiae). 2. Amino-acid-sequence analysis was carried out by digestion of the protein with trypsin and with CNBr; some of the peptides produced were further subdigested with Staphylococcus aureus V8 proteinase or with pepsin. Peptides were sequenced by the dansyl-Edman method and/or by automated gas-phase methods. The amino acid sequence obtained was complete except for a probable gap of two residues as indicated by comparison with the structures of counterpart proteins in other species. 3. The N-terminus of the enzyme is blocked. Fast-atom-bombardment m.s. was used to identify the blocking group as an acetyl one. 4. Alignment of the sequence of the enzyme with those of vertebrate cytosolic and mitochondrial aspartate aminotransferases and with the enzyme from Escherichia coli showed that about 25% of residues are conserved between these distantly related forms. 5. Experimental details and confirmatory data for the results presented here are given in a Supplementary Publication (SUP 50164, 25 pages) that has been deposited at the British Library Document Supply Centre, Boston Spa. Wetherby, West Yorkshire LS23 7 BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1991) 273, 5. PMID:1859361

  9. Purification, properties and amino acid sequence of a low-Mr abundant seed protein from pea (Pisum sativum L.).

    PubMed

    Gatehouse, J A; Gilroy, J; Hoque, M S; Croy, R R

    1985-01-01

    The seeds of pea (Pisum sativum L.) contain several proteins in the albumin solubility fraction that are significant components of total cotyledonary protein (5-10%) and are accumulated in developing seeds concurrently with storage-protein synthesis. One of these proteins, of low Mr and designated 'Psa LA', has been purified, characterized and sequenced. Psa LA has an Mr of 11000 and contains polypeptides of Mr 6000, suggesting that the protein molecules are dimeric. The amino acid sequence contains 54 residues, with a high content (10/54) of asparagine/aspartate. It has no inhibitory action towards trypsin or chymotrypsin, and is distinct from the inhibitors of those enzymes found in pea seeds, nor does it inhibit hog pancreatic alpha-amylase. The protein contains no methionine, but significant amounts of cysteine (four residues per polypeptide), suggesting a possible role as a sulphur storage protein. However, its sequence is not homologous with low-Mr (2S) storage proteins from castor bean (Ricinus communis) or rape (Brassica napus). Psa LA therefore represents a new type of low-Mr seed protein.

  10. Processing and amino acid sequence analysis of the mouse mammary tumor virus env gene product.

    PubMed Central

    Arthur, L O; Copeland, T D; Oroszlan, S; Schochetman, G

    1982-01-01

    The envelope proteins of mouse mammary tumor virus (MMTV) are synthesized from a subgenomic 24S mRNA as a 75,000-dalton glycosylated precursor polyprotein which is eventually processed to the mature glycoproteins gp52 and gp36. In vivo synthesis of this env precursor in the presence of the core glycosylation inhibitor tunicamycin yielded a precursor of approximately 61,000 daltons (P61env). However, a 67,000-dalton protein (P67env) was obtained from cell-free translation with the MMTV 24S mRNA as the template. To determine whether the portion of the protein cleaved from P67env to give P61env was removed from the NH2-terminal end of P67env and as such would represent a leader sequence, the NH2-terminal amino acid sequence of the terminal peptide gp52 was determined. Glutamic acid, and not methionine, was found to be the amino-terminal residue of gp52, indicating that the cleaved portion was derived from the NH2-terminal end of P67env. The NH2-terminal amino acid sequences of gp52's from endogenous and exogenous C3H MMTVs were determined though 46 residues and found to be identical. However, amino acid composition and type-specific gp52 radioimmunoassays from MMTVs grown in heterologous cells indicated primary structure differences between gp52's of the two viruses. The nucleic acid sequence of cloned MMTV DNA fragments (J. Majors and H. E. Varmus, personal communication) in conjunction with the NH2-terminal sequence of gp52 allowed localization of the env gene in the MMTV genome. Nucleotides coding for the NH2 terminus of gp52 begin approximately 0.8 kilobase to the 3' side of the single EcoRI cleavage site. Localization of the env gene at that point agrees with the proposed gene order -gag-pol-env- and also allows sufficient coding potential for the glycoprotein precursor without extending into the long terminal repeat. Images PMID:6281457

  11. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  12. BeadCons: detection of nucleic acid sequences by flow cytometry.

    PubMed

    Horejsh, Douglas; Martini, Federico; Capobianchi, Maria Rosaria

    2005-11-01

    Molecular beacons are single-stranded nucleic acid structures with a terminal fluorophore and a distal, terminal quencher. These molecules are typically used in real-time PCR assays, but have also been conjugated with solid matrices. This unit describes protocols related to molecular beacon-conjugated beads (BeadCons), whose specific hybridization with complementary target sequences can be resolved by cytometry. Assay sensitivity is achieved through the concentration of fluorescence signal on discrete particles. By using molecular beacons with different fluorophores and microspheres of different sizes, it is possible to construct a fluid array system with each bead corresponding to a specific target nucleic acid. Methods are presented for the design, construction, and use of BeadCons for the specific, multiplexed detection of unlabeled nucleic acids in solution. The use of bead-based detection methods will likely lead to the design of new multiplex molecular diagnostic tools.

  13. Measuring nanometer distances in nucleic acids using a sequence-independent nitroxide probe

    PubMed Central

    Qin, Peter Z; Haworth, Ian S; Cai, Qi; Kusnetzow, Ana K; Grant, Gian Paola G; Price, Eric A; Sowa, Glenna Z; Popova, Anna; Herreros, Bruno; He, Honghang

    2008-01-01

    This protocol describes the procedures for measuring nanometer distances in nucleic acids using a nitroxide probe that can be attached to any nucleotide within a given sequence. Two nitroxides are attached to phosphorothioates that are chemically substituted at specific sites of DNA or RNA. Inter-nitroxide distances are measured using a four-pulse double electron–electron resonance technique, and the measured distances are correlated to the parent structures using a Web-accessible computer program. Four to five days are needed for sample labeling, purification and distance measurement. The procedures described herein provide a method for probing global structures and studying conformational changes of nucleic acids and protein/nucleic acid complexes. PMID:17947978

  14. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1.

    PubMed

    Rhee, Mun Su; Moritz, Brélan E; Xie, Gary; Glavina Del Rio, T; Dalin, E; Tice, H; Bruce, D; Goodwin, L; Chertkov, O; Brettin, T; Han, C; Detter, C; Pitluck, S; Land, Miriam L; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O; Shanmugam, K T

    2011-12-31

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  15. Rat androgen-binding protein: evidence for identical subunits and amino acid sequence homology with human sex hormone-binding globulin.

    PubMed

    Joseph, D R; Hall, S H; French, F S

    1987-01-01

    The cDNA for rat androgen-binding protein (ABP) was previously isolated from a bacteriophage lambda gt11 rat testis cDNA library and its identity was confirmed by epitope selection. Hybrid-arrested translation studies have now demonstrated the identity of the isolates. The nucleotide sequence of a near full-length cDNA encodes a 403-amino acid precursor (Mr = 44,539), which agrees in size with the cell-free translation product (Mr = 45,000) of ABP mRNA. Putative sites of N-glycosylation and signal peptide cleavage were identified. Comparison of the predicted amino acid sequence of rat ABP with the amino-terminal amino acid sequence of human sex hormone-binding globulin revealed that 17 of 25 residues are identical. On the basis of the predicted amino acid sequence the molecular weight of the primary translation product, lacking the signal peptide, was 41,183. Hybridization analyses indicated that the two subunits of ABP are coded for by a single gene and a single mRNA species. Our results suggest that ABP consists of two subunits with identical primary sequences and that differences in post-translational processing result in the production of 47,000 and 41,000 molecular weight monomers.

  16. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group.

  17. Analysis of expression and amino acid sequence of the allergen Mag 3 in two species of house dust mites-Dermatophagoides farinae and D. pteronyssinus (Acari: Astigmata: Pyroglyphidae).

    PubMed

    Asman, Marek; Solarz, Krzysztof; Szilman, Ewa; Szilman, Piotr

    2010-01-01

    In the 90's of the XX century, 2 new and important allergens of house dust mites mites were cloned and sequenced: Mag 1 and Mag 3. However, the second allergen has been identified to date only in extracts of Dermatophagoides farinae [DF ]. In this work, we aimed to detect expression of this important allergen and for the first time analyze to the amino acid sequence in other species of house dust mite - Dermatophagoides pteronyssinus [DP ]. We were able to confirm the expression of allergen Mag 3 in DF and to exclude it in DP . By sequencing the products of DNA amplification, we revealed the nucleotide sequence encoding allergen Mag 3 in DF . This analysis enabled detection of 9 single base changes. An analysis of encoded amino acid sequence by triplets with substituted nucleotides revealed that 8 changes were polymorphic, and 1 was a mutation substituting GTG (valine) for ATG (methionine) at 236 position. However, the presence of amino acid sequence difference in this allergen might suggest that there exist other isoforms which can make difficult both diagnosis as well as immunotherapy in persons who produce allergic response to this allergen. The variants of allergen Mag 3 (group 14) are still not known beside the very good known allergen variants of the other main groups 1, 2, 4, 5 or 7. Thus, the identification and definition of allergic properties of allergen Mag 3 variants needs to be further investigated.

  18. Heterogeneity of ITS1 sequences in the biting midge Culicoides impunctatus (Goetghebuer) suggests a population in Argyll, Scotland, may be genetically distinct.

    PubMed

    Ritchie, Allyson; Blackwell, Alison; Malloch, Gaynor; Fenton, Brian

    2004-06-01

    Ribosomal DNA (rDNA) internal transcribed spacer 1 (ITS1) is a useful genomic region for understanding evolutionary and genetic relationships. In the current study, variation in ITS1 from eight Culicoides species was analysed by PCR, DNA restriction analysis, cloning, and sequencing. ITS1 variants were essentially homogenized within a species, as sequences were identical or closely related. However, Culicoides impunctatus ITS1 sequences derived from one (Argyll) of five populations contained considerable genomic diversity. The secondary structure of each ITS1 was computed. The structure aided the production of an accurate alignment and the identification of a large indel. A phylogenetic analysis was performed. Some of the sequences from the diverse Argyll C. impunctatus population were more related to Culicoides imicola, a vector of animal pathogens in the Old World, than they were to the other C. impunctatus sequences. Thus, the rDNA ITS1 regions of individuals in the Argyll C. impunctatus population were not conforming to the general theory of rDNA homogenization through molecular drive.

  19. Phylogenetic analysis of nuclear small subunit rDNA sequences suggests that the endangered African Pencil Cedar, Juniperus procera, is associated with distinct members of Glomeraceae.

    PubMed

    Wubet, Tesfaye; Weiss, Michael; Kottke, Ingrid; Teketay, Demel; Oberwinkler, Franz

    2006-09-01

    The endangered indigenous tree species Juniperus procera, commonly known as African Pencil Cedar, is an important component of the dry Afromontane vegetation of Ethiopia and was shown to be AM in earlier studies. Here we describe the composition of AM fungi in colonized roots of J. procera from two dry Afromontane forests of Ethiopia. The nuSSU rDNA gene was amplified from colonized roots, cloned and sequenced using AM fungal specific primers that were partly developed for this study. Molecular phylogenetic analysis revealed that all the glomeralean sequences obtained belonged exclusively to the genus Glomus (Glomeraceae). Seven distinct Glomus sequence types were identified that all are new to science. The composition of the AM fungal communities between the sampled trees, and between the two study sites in general, differed significantly. Isolation and utilization of the indigenous AM fungal taxa from the respective sites might be required for successful enrichment plantation of this threatened Juniperus species.

  20. Analysis of the coding-complete genomic sequence of groundnut ringspot virus suggests a common ancestor with tomato chlorotic spot virus.

    PubMed

    de Breuil, Soledad; Cañizares, Joaquín; Blanca, José Miguel; Bejerman, Nicolás; Trucco, Verónica; Giolitti, Fabián; Ziarsolo, Peio; Lenardon, Sergio

    2016-08-01

    Groundnut ringspot virus (GRSV) and tomato chlorotic spot virus (TCSV) share biological and serological properties, so their identification is carried out by molecular methods. Their genomes consist of three segmented RNAs: L, M and S. The finding of a reassortant between these two viruses may complicate correct virus identification and requires the characterization of the complete genome. Therefore, we present for the first time the complete sequences of all the genes encoded by a GRSV isolate. The high level of sequence similarity between GRSV and TCSV (over 90 % identity) observed in the genes and proteins encoded in the M RNA support previous results indicating that these viruses probably have a common ancestor.

  1. Nucleotide sequence of the luxC gene encoding fatty acid reductase of the lux operon from Photobacterium leiognathi.

    PubMed

    Lin, J W; Chao, Y F; Weng, S F

    1993-02-26

    The nucleotide sequence of the luxC gene (EMBL Accession No. 65156) encoding fatty acid reductase (FAR) of the lux operon from Photobacterium leiognathi PL741 was determined and the encoded amino acid sequence deduced. The fatty acid reductase is a component of the fatty acid reductase complex. The complex is responsible for converting fatty acid to aldehyde which serves as the substrate in the luciferase-catalyzed bioluminescent reaction. The protein comprises 478 amino acid residues and has a calculated M(r) of 53,858. Alignment and comparison of the fatty acid reductase of P. leiognathi with that of Vibrio harveyi B392 and Vibrio fischeri ATCC 7744 shows that there is 70% and 59% amino acid residues identity, respectively.

  2. The RNA polymerase I transcription factor UBF is a sequence-tolerant HMG-box protein that can recognize structured nucleic acids.

    PubMed Central

    Copenhaver, G P; Putnam, C D; Denton, M L; Pikaard, C S

    1994-01-01

    Upstream Binding Factor (UBF) is important for activation of ribosomal RNA transcription and belongs to a family of proteins containing nucleic acid binding domains, termed HMG-boxes, with similarity to High Mobility Group (HMG) chromosomal proteins. Proteins in this family can be sequence-specific or highly sequence-tolerant binding proteins. We show that Xenopus UBF can be classified among the sequence-tolerant class. Methylation interference assays using enhancer DNA probes failed to reveal any critical nucleotides required for UBF binding. Selection by UBF of optimal binding sites among a population of enhancer oligonucleotides with randomized sequences also failed to reveal any consensus sequence. The minor groove specific drugs chromomycin A3, distamycin A and actinomycin D competed against UBF for enhancer binding, suggesting that UBF, like other HMG-box proteins, probably interacts with the minor groove. UBF also shares with other HMG box proteins the ability to bind synthetic cruciform DNA. However, UBF appears different from other HMG-box proteins in that it can bind both RNA (tRNA) and DNA. The sequence-tolerant nature of UBF-nucleic acid interactions may accommodate the rapid evolution of ribosomal RNA gene sequences. Images PMID:8041627

  3. Nucleotide sequence of the Klebsiella pneumoniae nifD gene and predicted amino acid sequence of the alpha-subunit of nitrogenase MoFe protein.

    PubMed Central

    Ioannidis, I; Buck, M

    1987-01-01

    The nucleotide sequence of the Klebsiella pneumoniae nifD gene is presented and together with the accompanying paper [Holland, Zilberstein, Zamir & Sussman (1987) Biochem. J. 247, 277-285] completes the sequence of the nifHDK genes encoding the nitrogenase polypeptides. The K. pneumoniae nifD gene encodes the 483-amino acid-residue nitrogenase alpha-subunit polypeptide of Mr 54156. The alpha-subunit has five strongly conserved cysteine residues at positions 63, 89, 155, 184 and 275, some occurring in a region showing both primary sequence and potential structural homology to the K. pneumoniae nitrogenase beta-subunit. A comparison with six other alpha-subunit amino acid sequences has been made, which indicates a number of potentially important domains within alpha-subunits. PMID:3322262

  4. Modulation of crystal formation by bone phosphoproteins: role of glutamic acid-rich sequences in the nucleation of hydroxyapatite by bone sialoprotein.

    PubMed Central

    Hunter, G K; Goldberg, H A

    1994-01-01

    Bone sialoprotein (BSP) is a bone-specific glycoprotein containing phosphoserine and sulphotyrosine residues and regions of contiguous glutamic acid residues. Recent studies in this laboratory have shown that BSP is capable of nucleating the bone mineral hydroxyapatite in a steady-state agarose gel system. We show here that chemical modification of carboxylate groups abolishes the nucleation activity of BSP, but enzymic dephosphorylation has no effect. Formation of hydroxyapatite is also induced by poly(L-glutamic acid) and poly(D-glutamic acid), but not by poly(L-aspartic acid) or poly(L-lysine). Calreticulin, a muscle protein with short sequences of contiguous glutamic acid residues, also lacks nucleation activity. These findings suggest that the nucleation of hydroxyapatite by BSP involves one or both of the glutamic acid-rich sequences. Based on these findings and others, we propose that polycarboxylate sequences represent a general site for growth-modulating interactions between proteins and biological crystals. Images Figure 3 PMID:7915111

  5. Complete amino acid sequence of the A chain of human complement-classical-pathway enzyme C1r.

    PubMed Central

    Arlaud, G J; Willis, A C; Gagnon, J

    1987-01-01

    The amino acid sequence of human C1r A chain was determined, from sequence analysis performed on fragments obtained from C1r autolytic cleavage, cleavage of methionyl bonds, tryptic cleavages at arginine and lysine residues, and cleavages by staphylococcal proteinase. The polypeptide chain has an N-terminal serine residue and contains 446 amino acid residues (Mr 51,200). The sequence data allow chemical characterization of fragments alpha (positions 1-211), beta (positions 212-279) and gamma (positions 280-446) yielded from C1r autolytic cleavage, and identification of the two major cleavage sites generating these fragments. Position 150 of C1r A chain is occupied by a modified amino acid residue that, upon acid hydrolysis, yields erythro-beta-hydroxyaspartic acid, and that is located in a sequence homologous to the beta-hydroxyaspartic acid-containing regions of Factor IX, Factor X, protein C and protein Z. Sequence comparison reveals internal homology between two segments (positions 10-78 and 186-257). Two carbohydrate moieties are attached to the polypeptide chain, both via asparagine residues at positions 108 and 204. Combined with the previously determined sequence of C1r B chain [Arlaud & Gagnon (1983) Biochemistry 22, 1758-1764], these data give the complete sequence of human C1r. PMID:3036070

  6. L-Rhamnose-binding lectin from eggs of the Echinometra lucunter: Amino acid sequence and molecular modeling.

    PubMed

    Carneiro, Rômulo Farias; Teixeira, Claudener Souza; de Melo, Arthur Alves; de Almeida, Alexandra Sampaio; Cavada, Benildo Sousa; de Sousa, Oscarina Viana; da Rocha, Bruno Anderson Matias; Nagano, Celso Shiniti; Sampaio, Alexandre Holanda

    2015-01-01

    An L-rhamnose-binding lectin named ELEL was isolated from eggs of the rock boring sea urchin Echinometra lucunter by affinity chromatography on lactosyl-agarose. ELEL is a homodimer linked by a disulfide bond with subunits of 11 kDa each. The new lectin was inhibited by saccharides possessing the same configuration of hydroxyl groups at C-2 and C-4, such as L-rhamnose, melibiose, galactose and lactose. The amino acid sequence of ELEL was determined by tandem mass spectrometry. The ELEL subunit has 103 amino acids, including nine cysteine residues involved in four conserved intrachain disulfide bonds and one interchain disulfide bond. The full sequence of ELEL presents conserved motifs commonly found in rhamnose-binding lectins, including YGR, DPC and KYL. A three-dimensional model of ELEL was created, and molecular docking revealed favorable binding energies for interactions between ELEL and rhamnose, melibiose and Gb3 (Galα1-4Galβ1-4Glcβ1-Cer). Furthermore, ELEL was able to agglutinate Gram-positive bacterial cells, suggesting its ability to recognize pathogens.

  7. Phylogenetic analysis of dicyemid mesozoans (phylum Dicyemida) from innexin amino acid sequences: dicyemids are not related to Platyhelminthes.

    PubMed

    Suzuki, Takahito G; Ogino, Kazutoyo; Tsuneki, Kazuhiko; Furuya, Hidetaka

    2010-06-01

    Dicyemid mesozoans are endoparasites, or endosymbionts, found only in the renal sac of benthic cephalopod molluscs. The body organization of dicyemids is very simple, consisting of usually 10 to 40 cells, with neither body cavities nor differentiated organs. Dicyemids were considered as primitive animals, and the out-group of all metazoans, or as occupying a basal position of lophotrochozoans close to flatworms. We cloned cDNAs encoding for the gap junction component proteins, innexin, from the dicyemids. Its expression pattern was observed by whole-mount in situ hybridization. In adult individuals, the innexin was expressed in calottes, infusorigens, and infusoriform embryos. The unique temporal pattern was observed in the developing infusoriform embryos. Innexin amino acid sequences had taxon-specific indels which enabled identification of the 3 major protostome lineages, i.e., 2 ecdysozoans (arthropods and nematodes) and the lophotrochozoans. The dicyemids show typical, lophotrochozoan-type indels. In addition, the Bayesian and maximum likelihood trees based on the innexin amino acid sequences suggested dicyemids to be more closely related to the higher lophotrochozoans than to the flatworms. Flatworms were the sister group, or consistently basal, to the other lophotrochozoan clade that included dicyemids, annelids, molluscs, and brachiopods.

  8. Plasma acylcarnitine profiles suggest incomplete long-chain fatty acid beta-oxidation and altered tricarboxylic acid cycle activity in type 2 diabetic African-American women.

    PubMed

    Adams, Sean H; Hoppel, Charles L; Lok, Kerry H; Zhao, Ling; Wong, Scott W; Minkler, Paul E; Hwang, Daniel H; Newman, John W; Garvey, W Timothy

    2009-06-01

    Inefficient muscle long-chain fatty acid (LCFA) combustion is associated with insulin resistance, but molecular links between mitochondrial fat catabolism and insulin action remain controversial. We hypothesized that plasma acylcarnitine profiling would identify distinct metabolite patterns reflective of muscle fat catabolism when comparing individuals bearing a missense G304A uncoupling protein 3 (UCP3 g/a) polymorphism to controls, because UCP3 is predominantly expressed in skeletal muscle and g/a individuals have reduced whole-body fat oxidation. MS analyses of 42 carnitine moieties in plasma samples from fasting type 2 diabetics (n = 44) and nondiabetics (n = 12) with or without the UCP3 g/a polymorphism (n = 28/genotype: 22 diabetic, 6 nondiabetic/genotype) were conducted. Contrary to our hypothesis, genotype had a negligible impact on plasma metabolite patterns. However, a comparison of nondiabetics vs. type 2 diabetics revealed a striking increase in the concentrations of fatty acylcarnitines reflective of incomplete LCFA beta-oxidation in the latter (i.e. summed C10- to C14-carnitine concentrations were approximately 300% of controls; P = 0.004). Across all volunteers (n = 56), acetylcarnitine rose and propionylcarnitine decreased with increasing hemoglobin A1c (r = 0.544, P < 0.0001; and r = -0.308, P < 0.05, respectively) and with increasing total plasma acylcarnitine concentration. In proof-of-concept studies, we made the novel observation that C12-C14 acylcarnitines significantly stimulated nuclear factor kappa-B activity (up to 200% of controls) in RAW264.7 cells. These results are consistent with the working hypothesis that inefficient tissue LCFA beta-oxidation, due in part to a relatively low tricarboxylic acid cycle capacity, increases tissue accumulation of acetyl-CoA and generates chain-shortened acylcarnitine molecules that activate proinflammatory pathways implicated in insulin resistance.

  9. Temperature Shift Experiments Suggest That Metabolic Impairment and Enhanced Rates of Photorespiration Decrease Organic Acid Levels in Soybean Leaflets Exposed to Supra-Optimal Growth Temperatures.

    PubMed

    Sicher, Richard C

    2015-08-05

    Elevated growth temperatures are known to affect foliar organic acid concentrations in various plant species. In the current study, citrate, malate, malonate, fumarate and succinate decreased 40 to 80% in soybean leaflets when plants were grown continuously in controlled environment chambers at 36/28 compared to 28/20 °C. Temperature effects on the above mentioned organic acids were partially reversed three days after plants were transferred among optimal and supra-optimal growth temperatures. In addition, CO2 enrichment increased foliar malate, malonate and fumarate concentrations in the supra-optimal temperature treatment, thereby mitigating effects of high temperature on respiratory metabolism. Glycerate, which functions in the photorespiratory pathway, decreased in response to CO2 enrichment at both growth temperatures. The above findings suggested that diminished levels of organic acids in soybean leaflets upon exposure to high growth temperatures were attributable to metabolic impairment and to changes of photorespiratory flux. Leaf development rates differed among temperature and CO2 treatments, which affected foliar organic acid levels. Additionally, we report that large decreases of foliar organic acids in response to elevated growth temperatures were observed in legume species.

  10. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    PubMed Central

    Chen, Ke; Kurgan, Lukasz A; Ruan, Jishou

    2007-01-01

    Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D) protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP); the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM) and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are characterized by accuracies below 70

  11. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  12. Gene sequence and predicted amino acid sequence of the motA protein, a membrane-associated protein required for flagellar rotation in Escherichia coli.

    PubMed Central

    Dean, G E; Macnab, R M; Stader, J; Matsumura, P; Burks, C

    1984-01-01

    The motA and motB gene products of Escherichia coli are integral membrane proteins necessary for flagellar rotation. We determined the DNA sequence of the region containing the motA gene and its promoter. Within this sequence, there is an open reading frame of 885 nucleotides, which with high probability (98% confidence level) meets criteria for a coding sequence. The 295-residue amino acid translation product had a molecular weight of 31,974, in good agreement with the value determined experimentally by gel electrophoresis. The amino acid sequence, which was quite hydrophobic, was subjected to a theoretical analysis designed to predict membrane-spanning alpha-helical segments of integral membrane proteins; four such hydrophobic helices were predicted by this treatment. Additional amphipathic helices may also be present. A remarkable feature of the sequence is the existence of two segments of high uncompensated charge density, one positive and the other negative. Possible organization of the protein in the membrane is discussed. Asymmetry in the amino acid composition of translated DNA sequences was used to distinguish between two possible initiation codons. The use of this method as a criterion for authentication of coding regions is described briefly in an Appendix. PMID:6090403

  13. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  14. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  15. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  16. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  17. Complete amino acid sequence of ananain and a comparison with stem bromelain and other plant cysteine proteases.

    PubMed Central

    Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T

    1997-01-01

    The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753

  18. [Measurement of the amino acid sequence for the fusion protein FP3 with LC-MS/MS].

    PubMed

    Li, Xiang; Gao, Xiang-Dong; Tao, Lei; Pei, De-Ning; Guo, Ying; Rao, Chun-Ming; Wang, Jun-Zhi

    2012-02-01

    The amino acid sequence of the fusion protein FP3 was measured by two types of LC-MS/MS and its primary structure was confirmed. After reduction and alkylation, the protein was digested with trypsin and glycosyl groups in glycopeptide were removed by PNGase F. The mixed peptides were separated by LC, then Q-TOF and Ion trap tandem mass spectrometry were used to measure b, y fragment ions of each peptide to analyze the amino acid sequence of fusion protein FP3. Seventy-six percent of full amino acid sequence of the fusion protein FP3 was measured by LC-ESI-Q-TOF with the remaining 24% completed by LC-ESI-Trap. As LC-MS and tandem mass spectrometry are rapid, sensitive, accurate to measure the protein amino acid sequence, they are important approach to structure analysis and identification of recombinant protein.

  19. Co-conservation of rRNA tetraloop sequences and helix length suggests involvement of the tetraloops in higher-order interactions

    NASA Technical Reports Server (NTRS)

    Hedenstierna, K. O.; Siefert, J. L.; Fox, G. E.; Murgola, E. J.

    2000-01-01

    Terminal loops containing four nucleotides (tetraloops) are common in structural RNAs, and they frequently conform to one of three sequence motifs, GNRA, UNCG, or CUUG. Here we compare available sequences and secondary structures for rRNAs from bacteria, and we show that helices capped by phylogenetically conserved GNRA loops display a strong tendency to be of conserved length. The simplest interpretation of this correlation is that the conserved GNRA loops are involved in higher-order interactions, intramolecular or intermolecular, resulting in a selective pressure for maintaining the lengths of these helices. A small number of conserved UNCG loops were also found to be associated with conserved length helices, consistent with the possibility that this type of tetraloop also takes part in higher-order interactions.

  20. NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents

    PubMed Central

    Liu, Sophia S.; Hockenberry, Adam J.; Lancichinetti, Andrea; Jewett, Michael C.

    2016-01-01

    The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems. PMID:27835644

  1. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    PubMed

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  2. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  3. Fluorescence energy transfer as a probe for nucleic acid structures and sequences.

    PubMed Central

    Mergny, J L; Boutorine, A S; Garestier, T; Belloc, F; Rougée, M; Bulychev, N V; Koshkin, A A; Bourson, J; Lebedev, A V; Valeur, B

    1994-01-01

    The primary or secondary structure of single-stranded nucleic acids has been investigated with fluorescent oligonucleotides, i.e., oligonucleotides covalently linked to a fluorescent dye. Five different chromophores were used: 2-methoxy-6-chloro-9-amino-acridine, coumarin 500, fluorescein, rhodamine and ethidium. The chemical synthesis of derivatized oligonucleotides is described. Hybridization of two fluorescent oligonucleotides to adjacent nucleic acid sequences led to fluorescence excitation energy transfer between the donor and the acceptor dyes. This phenomenon was used to probe primary and secondary structures of DNA fragments and the orientation of oligodeoxynucleotides synthesized with the alpha-anomers of nucleoside units. Fluorescence energy transfer can be used to reveal the formation of hairpin structures and the translocation of genes between two chromosomes. PMID:8152922

  4. Amino acid sequence of two neurotoxins from the venom of the Egyptian black snake (Walterinnesia aegyptia).

    PubMed

    Samejima, Y; Aoki-Tomomatsu, Y; Yanagisawa, M; Mebs, D

    1997-02-01

    The venom of the Egyptian black snake Walterinnesia aegyptia contains at least three toxins, which act postsynaptically to block the neuromuscular transmission of isolated rat phrenic nerve-diaphragm and chicken biventer cervicis muscle. The complete amino acid sequence of the two toxins, W-III and W-IV, consisting of 62 amino acid residues, was elucidated by Edman degradation of fragments obtained after Staphylococcus aureus protease and prolylpeptidase digestion. Although the toxins exhibit close structural homology to other short-chain postsynaptic neurotoxins from Elapidae venoms, toxin IV is unique by having a free SH-group (cysteine) at position 16. In position 35 of W-III, which is located at the tip of the central loop, threonine is replaced by lysine, which may alter the interaction of the toxin with the acetylcholine receptor, since the toxin is seven times less lethal than toxin W-IV.

  5. Complete genome sequence of Lactococcus lactis IO-1, a lactic acid bacterium that utilizes xylose and produces high levels of L-lactic acid.

    PubMed

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-04-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly L-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  6. Complete genome sequence of Bacillus amyloliquefaciens LL3, which exhibits glutamic acid-independent production of poly-γ-glutamic acid.

    PubMed

    Geng, Weitao; Cao, Mingfeng; Song, Cunjiang; Xie, Hui; Liu, Li; Yang, Chao; Feng, Jun; Zhang, Wei; Jin, Yinghong; Du, Yang; Wang, Shufang

    2011-07-01

    Bacillus amyloliquefaciens is one of most prevalent Gram-positive aerobic spore-forming bacteria with the ability to synthesize polysaccharides and polypeptides. Here, we report the complete genome sequence of B. amyloliquefaciens LL3, which was isolated from fermented food and presents the glutamic acid-independent production of poly-γ-glutamic acid.

  7. Design, synthesis, and characterization of a protein sequencing reagent yielding amino acid derivatives with enhanced detectability by mass spectrometry.

    PubMed Central

    Aebersold, R.; Bures, E. J.; Namchuk, M.; Goghari, M. H.; Shushan, B.; Covey, T. C.

    1992-01-01

    We report the design, chemical synthesis, and structural and functional characterization of a novel reagent for protein sequence analysis by the Edman degradation, yielding amino acid derivatives rapidly detectable at high sensitivity by ion-evaporation mass spectrometry. We demonstrate that the reagent 3-[4'(ethylene-N,N,N-trimethylamino)phenyl]-2-isothiocyanate is chemically stable and shows coupling and cyclization/cleavage yields comparable to phenylisothiocyanate, the standard reagent in chemical sequence analysis, under conditions typically encountered in manual or automated sequence analysis. Amino acid derivatives generated with this reagent were detectable by ion-evaporation mass spectrometry at the subfemtomole sensitivity level at a pace of one sample per minute. Furthermore, derivatives were identified by their mass, thus permitting the rapid and highly sensitive determination of the molecular nature of modified amino acids. Derivatives of amino acids with acidic, basic, polar, or hydrophobic side chains were reproducibly detectable at comparable sensitivities. The polar nature of the reagent required covalent immobilization of polypeptides prior to automated sequence analysis. This reagent, used in automated sequence analysis, has the potential for overcoming the limitations in sensitivity, speed, and the ability to characterize modified amino acid residues inherent in the chemical sequencing methods that are currently used. PMID:1304351

  8. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production.

    PubMed

    Coulson, Thomas J D; Patten, Cheryl L

    2015-08-06

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences.

  9. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production

    PubMed Central

    Coulson, Thomas J. D.

    2015-01-01

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences. PMID:26251488

  10. Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis subsp. lactis TOMSC161, Isolated from a Nonscalded Curd Pressed Cheese

    PubMed Central

    Velly, H.; Abraham, A.-L.; Loux, V.; Delacroix-Buchet, A.; Fonseca, F.; Bouix, M.

    2014-01-01

    Lactococcus lactis is a lactic acid bacterium used in the production of many fermented foods, such as dairy products. Here, we report the genome sequence of L. lactis subsp. lactis TOMSC161, isolated from nonscalded curd pressed cheese. This genome sequence provides information in relation to dairy environment adaptation. PMID:25377704

  11. Deoxyribonucleic acid sequence of araBAD promoter mutants of Escherichia coli.

    PubMed

    Horwitz, A H; Morandi, C; Wilcox, G

    1980-05-01

    The controlling site region for the araBAD operon is defined, in part, by two classes of cis-acting constitutive mutations. The aralc mutations allow low-level constitutive expression of ara-BAD in the absence of the positive regulatory protein coded for by the araC gene, whereas the araXc mutations allow expression of araBAD in the absence of the cyclic adenosine monophosphate receptor protein. Six independently isolated aralc mutations and three independently isolated araXc mutations were cloned onto the plasmid pBR322 using in vitro recombinant deoxyribonucleic acid techniques and in vivo recombination between plasmid and chromosomal deoxyribonucleic acid. The location of these mutations was determined by deoxyribonucleic acid sequence analysis. All of the aralc mutations occurred at position -35 within the araBAD promoter (+1 = messenger ribonucleic acid start for araBAD) and resulted from an AT leads to GC transition. All of the araXc mutations occurred at position -10 within the araBAD promoter and resulted from a GC leads to AT transition. Models are presented to explain the mode of action of the aralc and araXc mutations.

  12. A high-throughput data mining of single nucleotide polymorphisms in Coffea species expressed sequence tags suggests differential homeologous gene expression in the allotetraploid Coffea arabica.

    PubMed

    Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

    2010-11-01

    Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed.

  13. Sequence diversity patterns suggesting balancing selection in partially sex-linked genes of the plant Silene latifolia are not generated by demographic history or gene flow.

    PubMed

    Guirao-Rico, Sara; Sánchez-Gracia, Alejandro; Charlesworth, Deborah

    2017-03-01

    DNA sequence diversity in genes in the partially sex-linked pseudoautosomal region (PAR) of the sex chromosomes of the plant Silene latifolia is higher than expected from within-species diversity of other genes. This could be the footprint of sexually antagonistic (SA) alleles that are maintained by balancing selection in a PAR gene (or genes) and affect polymorphism in linked genome regions. SA selection is predicted to occur during sex chromosome evolution, but it is important to test whether the unexpectedly high sequence polymorphism could be explained without it, purely by the combined effects of partial linkage with the sex-determining region and the population's demographic history, including possible introgression from Silene dioica. To test this, we applied approximate Bayesian computation-based model choice to autosomal sequence diversity data, to find the most plausible scenario for the recent history of S. latifolia and then to estimate the posterior density of the most relevant parameters. We then used these densities to simulate variation to be expected at PAR genes. We conclude that an excess of variants at high frequencies at PAR genes should arise in S. latifolia populations only for genes with strong associations with fully sex-linked genes, which requires closer linkage with the fully sex-linked region than that estimated for the PAR genes where apparent deviations from neutrality were observed. These results support the need to invoke selection to explain the S. latifolia PAR gene diversity, and encourage further work to test the possibility of balancing selection due to sexual antagonism.

  14. Amino acid sequences of alpha-helical segments from S-carboxymethylkerateine-A. Tryptic and chymotryptic peptides from a type-II segment.

    PubMed Central

    Hogg, D M; Dowling, L M; Crewther, W G

    1978-01-01

    1. Amino acid-sequence studies were done on a peptide of mol.wt. approx. 12500 that was isolated from the highly helical fragments obtained by partial chymotryptic digestion of the low-sulphur proteins (S-carboxymethylkerateine-A) from wool. 2. The peptides obtained by tryptic and chymotryptic digestion of this large peptide were separated by ion-exchange chromatography on DEAE-cellulose at pH8.5 with an (NH4)(2)CO(3) concentration gradient and, where necessary, purified further by paper electrophoresis. 3. Determination of the sequences of many of these peptides showed that a high proportion of the cationic residues occurs in pairs. 4. Although two of the four S-carboxymethylcysteine residues are located in what appears to be a non-helical region near the N-terminus the other two S-carboxymethylcysteine residues occur in or near sequences suggesting a helical conformation. 5. Some peptides were obtained, in low yields, that appeared to be homologues of more major ones. These suggest either homologies in the helical portions of the low-sulphur proteins or the presence of closely related amino acid sequences in helical regions of completely different origins. 6. A partial sequence of the complete peptide is proposed. PMID:581263

  15. In the TTF-1 homeodomain the contribution of several amino acids to DNA recognition depends on the bound sequence.

    PubMed Central

    Fabbro, D; Tell, G; Leonardi, A; Pellizzari, L; Pucillo, C; Lonigro, R; Formisano, S; Damante, G

    1996-01-01

    The thyroid transcription factor-1 homeodomain (TTF-1HD) shows a peculiar DNA binding specificity, preferentially recognizing sequences containing the 5'-CAAG-3' core motif. Most other homeodomains instead recognize sites containing the 5'-TAAT-3' core motif. Here, we show that TTF-1HD efficiently recognizes another sequence, called D1, devoid of the 5'-CAAG-3' core motif. Different experimental approaches indicate that TTF-1HD contacts the D1 sequence in a manner which is different to that used to interact with sequences containing the 5'-CAAG-3' core motif. The binding activities that mutants of TTF-1HD display with the D1 sequence or with the sequence containing the 5'-CAAG-3' core motif indicate that the role of several DNA-contacting amino acids is different. In particular, during recognition of the D1 sequence, backbone-interacting amino acids not relevant in binding to sequences containing the 5'-CAAG-3' core motif play an important role. In the TTF-1HD, therefore, the contribution of several amino acids to DNA recognition depends on the bound sequence. These data indicate that although a common bonding network exists in all of the HD/DNA complexes, peculiarities important for DNA recognition may occur in single cases. PMID:8811078

  16. Molecular cloning, encoding sequence, and expression of vaccinia virus nucleic acid-dependent nucleoside triphosphatase gene.

    PubMed Central

    Rodriguez, J F; Kahn, J S; Esteban, M

    1986-01-01

    A rabbit poxvirus genomic library contained within the expression vector lambda gt11 was screened with polyclonal antiserum prepared against vaccinia virus nucleic acid-dependent nucleoside triphosphatase (NTPase)-I enzyme. Five positive phage clones containing from 0.72- to 2.5-kilobase-pair (kbp) inserts expressed a beta-galactosidase fusion protein that was reactive by immunoblotting with the NTPase-I antibody. Hybridization analysis allowed the location of this gene within the vaccinia HindIIID restriction fragment. From the known nucleotide sequence of the 16-kbp vaccinia HindIIID fragment, we identified a region that contains a 1896-base open reading frame coding for a 631-amino acid protein. Analysis of the complete sequence revealed a highly basic protein, with hydrophilic COOH and NH2 termini, various hydrophobic domains, and no significant homology to other known proteins. Translational studies demonstrate that NTPase-I belongs to a late class of viral genes. This protein is highly conserved among Orthopoxviruses. Images PMID:3025846

  17. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    PubMed

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts.

  18. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    PubMed

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  19. The Sequence-Specific Cellular Uptake of Spherical Nucleic Acid Nanoparticle Conjugates

    PubMed Central

    Narayan, Suguna P.; Choi, Chung Hang J.; Hao, Liangliang; Calabrese, Colin M.; Auyeung, Evelyn; Zhang, Chuan; Goor, Olga J.G.M.

    2015-01-01

    We investigated the sequence-dependent cellular uptake of spherical nucleic acid nanoparticle conjugates (SNAs). This process occurs by interaction with class A scavenger receptors (SR-A) and caveolae-mediated endocytosis. It is known that linear poly(guanine) (poly G) is a natural ligand for SR-A, and it has been proposed that interaction of poly G with SR-A is dependent on the formation of G-quadruplexes. Since G-rich oligonucleotides are known to interact strongly with SR-A, we hypothesized that SNAs with higher G contents would be able to enter cells in larger amounts than SNAs composed of other nucleotides, and as such we measured cellular internalization of SNAs as a function of constituent oligonucleotide sequence. Indeed, SNAs with enriched G content show the highest cellular uptake. Using this hypothesis, we chemically conjugated a small molecule (camptothecin) with SNAs to create drug-SNA conjugates and observed that poly G SNAs deliver the most camptothecin to cells and have the highest cytotoxicity in cancer cells. Our data elucidate important design considerations for enhancing the intracellular delivery of spherical nucleic acids. PMID:26097111

  20. Partial amino acid sequences around sulfhydryl groups of soybean beta-amylase.

    PubMed

    Nomura, K; Mikami, B; Morita, Y

    1987-08-01

    Sulfhydryl (SH) groups of soybean beta-amylase were modified with 5-(iodoaceto-amidoethyl)aminonaphthalene-1-sulfonate (IAEDANS) and the SH-containing peptides exhibiting fluorescence were purified after chymotryptic digestion of the modified enzyme. The sequence analysis of the peptides derived from the modification of all SH groups in the denatured enzyme revealed the existence of six SH groups, in contrast to five reported previously. One of them was found to have extremely low reactivity toward SH-reagents without reduction. In the native state, IAEDANS reacted with 2 mol of SH groups per mol of the enzyme (SH1 and SH2) accompanied with inactivation of the enzyme owing to the modification of SH2 located near the active site of this enzyme. The selective modification of SH2 with IAEDANS was attained after the blocking of SH1 with 5,5'-dithiobis-(2-nitrobenzoic acid). The amino acid sequences of the peptides containing SH1 and SH2 were determined to be Cys-Ala-Asn-Pro-Gln and His-Gln-Cys-Gly-Gly-Asn-Val-Gly-Asp-Ile-Val-Asn-Ile-Pro-Ile-Pro-Gln-Trp, respectively.

  1. Genome Sequence of Lactobacillus rhamnosus Strain CASL, an Efficient l-Lactic Acid Producer from Cheap Substrate Cassava

    PubMed Central

    Yu, Bo; Su, Fei; Wang, Limin; Zhao, Bo; Qin, Jiayang; Ma, Cuiqing; Xu, Ping; Ma, Yanhe

    2011-01-01

    Lactobacillus rhamnosus is a type of probiotic bacteria with industrial potential for l-lactic acid production. We announce the draft genome sequence of L. rhamnosus CASL (2,855,156 bp with a G+C content of 46.6%), which is an efficient producer of l-lactic acid from cheap, nonfood substrate cassava with a high production titer. PMID:22123765

  2. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-03-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus.

  3. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed Central

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-01-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus. PMID:3355530

  4. Amino acid composition of the bushcricket spermatophore and the function of courtship feeding: Variable composition suggests a dynamic role of the nuptial gift.

    PubMed

    Jarrige, Alicia; Body, Mélanie; Giron, David; Greenfield, Michael D; Goubault, Marlène

    2015-11-01

    Nuptial gifts are packages of non-gametic material transferred by males to females at mating. These gifts are common in bushcrickets, where males produce a complex spermatophore consisting in a sperm-containing ampulla and an edible sperm-free spermatophylax. Two non-mutually exclusive hypotheses have been suggested to explain the function of the spermatophylax: the paternal investment hypothesis proposes that it represents a male nutritional investment in offspring; the mating effort hypothesis proposes that the spermatophylax maximizes the male's sperm transfer. Because gift production may represent significant energy expenditure, males are expected to adjust their investment relative to the perceived quality of the female. In this study, we first examined the free amino acid composition and protein-bound amino acid composition of the nuptial gift in the bushcricket, Ephippiger diurnus (Orthoptera: Tettigoniidae). Second, we investigated whether this composition was altered according to female age and body weight. Our study represents the first investigation of both free and protein-bound amino acid fractions of a bushcricket spermatophylax. We found that composition of the nuptial gift varied both qualitatively and quantitatively with respect to traits of the receiving female: older females received larger amounts of protein-bound amino acids (both essential and non-essential), less water and less free glycine. This result suggests that gift composition is highly labile in E. diurnus, and we propose that gift allocation might represent a form of cryptic male mate choice, allowing males to maximize their chances of paternity according to the risk of sperm competition that is associated with mate quality.

  5. Amino acid sequence of neurotoxin III of the scorpion Androctonus austrialis Hector.

    PubMed

    Kopeyan, C; Martinez, G; Rochat, H

    1979-03-01

    The amino acid sequence of neurotoxin III, purified from the venom of the North African scorpion Androctonus australis Hector, has been determined by Edman degradation using a liquid-phase sequencer. Carboxypeptidase A hydrolyses confirmed not only the sequence of the five last residues but also the presence of a free alpha-carboxylic group at the C-terminus. Edman degradation was conducted on one hand with the Quadrol [N,N,N',N'-tetrakis(2-hydroxypropyl)ethylene diamine] program and S-alkylated protein before or after coupling with sulfophenylisothiocynate (the first 34 residues were thus identified), on the other hand on tryptic and chymotryptic peptides with a dimethylbenzylamine program (residues 1--23 and 31--34 were confirmed, the positions of residues 35-64 were established). Neurotoxin III was found to belong to the same group of scorpion toxins active on mammals as neurotoxin I purified from the same venom (50 homologous positions exist in the two proteins).

  6. Purification, amino acid sequence and characterisation of kangaroo IGF-I.

    PubMed

    Yandell, C A; Francis, G L; Wheldrake, J F; Upton, Z

    1998-01-01

    Insulin-like growth factor-I (IGF-I) and IGF-II have been purified to homogeneity from kangaroo (Macropus fuliginosus) serum, thus this represents the first report of the purification, sequencing and characterisation of marsupial IGFs. N-Terminal protein sequencing reveals that there are six amino acid differences between kangaroo and human IGF-I. Kangaroo IGF-II has been partially sequenced and no differences were found between human and kangaroo IGF-II in the 53 residues identified. Thus the IGFs appear to be remarkably structurally conserved during mammalian radiation. In addition, in vitro characterisation of kangaroo IGF-I demonstrated that the functional properties of human, kangaroo and chicken IGF-I are very similar. In an assay measuring the ability of the proteins to stimulate protein synthesis in rat L6 myoblasts, all IGF-I proteins were found to be equally potent. The ability of all three proteins to compete for binding with radiolabelled human IGF-I to type-1 IGF receptors in L6 myoblasts and in Sminthopsis crassicaudata transformed lung fibroblasts, a marsupial cell line, was comparable. Furthermore, kangaroo and human IGF-I react equally in a human IGF-I RIA using a human reference standard, radiolabelled human IGF-I and a polyclonal antibody raised against recombinant human IGF-I. This study indicates that not only is the primary structure of eutherian and metatherian IGF-I conserved, but also the proteins appear to be functionally similar.

  7. Complete Genome Sequence of the Prototype Lactic Acid Bacterium Lactococcus lactis subsp. cremoris MG1363▿

    PubMed Central

    Wegmann, Udo; O'Connell-Motherway, Mary; Zomer, Aldert; Buist, Girbe; Shearman, Claire; Canchaya, Carlos; Ventura, Marco; Goesmann, Alexander; Gasson, Michael J.; Kuipers, Oscar P.; van Sinderen, Douwe; Kok, Jan

    2007-01-01

    Lactococcus lactis is of great importance for the nutrition of hundreds of millions of people worldwide. This paper describes the genome sequence of Lactococcus lactis subsp. cremoris MG1363, the lactococcal strain most intensively studied throughout the world. The 2,529,478-bp genome contains 81 pseudogenes and encodes 2,436 proteins. Of the 530 unique proteins, 47 belong to the COG (clusters of orthologous groups) functional category “carbohydrate metabolism and transport,” by far the largest category of novel proteins in comparison with L. lactis subsp. lactis IL1403. Nearly one-fifth of the 71 insertion elements are concentrated in a specific 56-kb region. This integration hot-spot region carries genes that are typically associated with lactococcal plasmids and a repeat sequence specifically found on plasmids and in the “lateral gene transfer hot spot” in the genome of Streptococcus thermophilus. Although the parent of L. lactis MG1363 was used to demonstrate lysogeny in Lactococcus, L. lactis MG1363 carries four remnant/satellite phages and two apparently complete prophages. The availability of the L. lactis MG1363 genome sequence will reinforce its status as the prototype among lactic acid bacteria through facilitation of further applied and fundamental research. PMID:17307855

  8. Whole Exome Sequencing Suggests Much of Non-BRCA1/BRCA2 Familial Breast Cancer Is Due to Moderate and Low Penetrance Susceptibility Alleles

    PubMed Central

    Gracia-Aznarez, Francisco Javier; Fernandez, Victoria; Pita, Guillermo; Peterlongo, Paolo; Dominguez, Orlando; de la Hoya, Miguel; Duran, Mercedes; Osorio, Ana; Moreno, Leticia; Gonzalez-Neira, Anna; Rosa-Rosa, Juan Manuel; Sinilnikova, Olga; Mazoyer, Sylvie; Hopper, John; Lazaro, Conchi; Southey, Melissa; Odefrey, Fabrice; Manoukian, Siranoush; Catucci, Irene; Caldes, Trinidad; Lynch, Henry T.; Hilbers, Florentine S. M.; van Asperen, Christi J.; Vasen, Hans F. A.; Goldgar, David; Radice, Paolo; Devilee, Peter; Benitez, Javier

    2013-01-01

    The identification of the two most prevalent susceptibility genes in breast cancer, BRCA1 and BRCA2, was the beginning of a sustained effort to uncover new genes explaining the missing heritability in this disease. Today, additional high, moderate and low penetrance genes have been identified in breast cancer, such as P53, PTEN, STK11, PALB2 or ATM, globally accounting for around 35 percent of the familial cases. In the present study we used massively parallel sequencing to analyze 7 BRCA1/BRCA2 negative families, each having at least 6 affected women with breast cancer (between 6 and 10) diagnosed under the age of 60 across generations. After extensive filtering, Sanger sequencing validation and co-segregation studies, variants were prioritized through either control-population studies, including up to 750 healthy individuals, or case-control assays comprising approximately 5300 samples. As a result, a known moderate susceptibility indel variant (CHEK2 1100delC) and a catalogue of 11 rare variants presenting signs of association with breast cancer were identified. All the affected genes are involved in important cellular mechanisms like DNA repair, cell proliferation and survival or cell cycle regulation. This study highlights the need to investigate the role of rare variants in familial cancer development by means of novel high throughput analysis strategies optimized for genetically heterogeneous scenarios. Even considering the intrinsic limitations of exome resequencing studies, our findings support the hypothesis that the majority of non-BRCA1/BRCA2 breast cancer families might be explained by the action of moderate and/or low penetrance susceptibility alleles. PMID:23409019

  9. The ABRF Edman Sequencing Research Group 2008 Study: Investigation into Homopolymeric Amino Acid N-Terminal Sequence Tags and Their Effects on Automated Edman Degradation

    PubMed Central

    Thoma, R. S.; Smith, J. S.; Sandoval, W.; Leone, J. W.; Hunziker, P.; Hampton, B.; Linse, K. D.; Denslow, N. D.

    2009-01-01

    The Edman Sequence Research Group (ESRG) of the Association of Biomolecular Resource designs and executes interlaboratory studies investigating the use of automated Edman degradation for protein and peptide analysis. In 2008, the ESRG enlisted the help of core sequencing facilities to investigate the effects of a repeating amino acid tag at the N-terminus of a protein. Commonly, to facilitate protein purification, an affinity tag containing a polyhistidine sequence is conjugated to the N-terminus of the protein. After expression, polyhistidine-tagged protein is readily purified via chelation with an immobilized metal affinity resin. The addition of the polyhistidine tag presents unique challenges for the determination of protein identity using Edman degradation chemistry. Participating laboratories were asked to sequence one protein engineered in three configurations: with an N-terminal polyhistidine tag; with an N-terminal polyalanine tag; or with no tag. Study participants were asked to return a data file containing the uncorrected amino acid picomole yields for the first 17 cycles. Initial and repetitive yield (R.Y.) information and the amount of lag were evaluated. Information about instrumentation and sample treatment was also collected as part of the study. For this study, the majority of participating laboratories successfully called the amino acid sequence for 17 cycles for all three test proteins. In general, laboratories found it more difficult to call the sequence containing the polyhistidine tag. Lag was observed earlier and more consistently with the polyhistidine-tagged protein than the polyalanine-tagged protein. Histidine yields were significantly less than the alanine yields in the tag portion of each analysis. The polyhistidine and polyalanine protein-R.Y. calculations were found to be equivalent. These calculations showed that the nontagged portion from each protein was equivalent. The terminal histidines from the tagged portion of the protein

  10. The amino acid sequence around the active-site cysteine and histidine residues, and the buried cysteine residue in ficin.

    PubMed

    Husain, S S; Lowe, G

    1970-04-01

    Ficin that had been prepared from the latex of Ficus glabrata by salt fractionation and chromatography on carboxymethylcellulose was completely and irreversibly inhibited with 1,3-dibromo[2-(14)C]acetone and then treated with N-(4-dimethylamino-3,5-dinitrophenyl)maleimide in 6m-guanidinium chloride. After reduction and carboxymethylation of the labelled protein, it was digested with trypsin and alpha-chymotrypsin. Two radioactive peptides and two coloured peptides were isolated chromatographically and their sequences determined. The radioactive peptides revealed the amino acid sequences around the active-site cysteine and histidine residues and showed a high degree of homology with the omino acid sequence around the active-site cysteine and histidine residues in papain. The coloured peptides allowed the amino acid sequence around the buried cysteine residue in ficin to be determined.

  11. The `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis: isolation of the gene, nucleotide and amino acid sequence

    PubMed Central

    Michel, H.; Weyer, K. A.; Gruenberg, H.; Lottspeich, F.

    1985-01-01

    The gene coding for the `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis was isolated in an expression vector. Expression of the heavy subunit in Escherichia coli was detected with antibodies raised against crystalline reaction centres. The entire subunit, and not a fusion protein, was expressed in E. coli. The protein coding region of the gene was sequenced and the amino acid sequence derived. Part of the amino acid sequence was confirmed by chemical sequence analysis of the protein. The heavy subunit consists of 258 amino acids and its mol. wt. is 28 345. It possesses one membrane-spanning α-helical segment, as was revealed by the concomitant X-ray structure analysis. ImagesFig. 1.Fig. 2. PMID:16453623

  12. Purification, amino acid sequence and immunological characterization of Ole e 6, a cysteine-enriched allergen from olive tree pollen.

    PubMed

    Batanero, E; Ledesma, A; Villalba, M; Rodríguez, R

    1997-06-30

    The Ole e 6 allergen from olive tree pollen has been isolated by combining gel permeation and reverse-phase chromatographies. It is a single and highly acidic (pI 4.2) polypeptide chain protein. Its NH2-terminal amino acid sequence has been determined by Edman degradation. Total RNA from the olive tree pollen was isolated, and a specific cDNA was amplified by the polymerase chain reaction using a degenerate oligonucleotide primer designed according to the NH2-terminal sequence of the protein. The nucleotide sequencing of the cDNA rendered an open reading frame encoding a 50 amino acid polypeptide chain, in which two sets of the sequential motif Cys-X3-Cys-X3-Cys are present. No sequence similarity has been found between this protein and other previously described polypeptides.

  13. Nucleotide and derived amino acid sequences of the major porin of Comamonas acidovorans and comparison of porin primary structures.

    PubMed Central

    Gerbl-Rieger, S; Peters, J; Kellermann, J; Lottspeich, F; Baumeister, W

    1991-01-01

    The DNA sequence of the gene which codes for the major outer membrane porin (Omp32) of Comamonas acidovorans has been determined. The structural gene encodes a precursor consisting of 351 amino acid residues with a signal peptide of 19 amino acid residues. Comparisons with amino acid sequences of outer membrane proteins and porins from several other members of the class Proteobacteria and of the Chlamydia trachomatis porin and the Neurospora crassa mitochondrial porin revealed a motif of eight regions of local homology. The results of this analysis are discussed with regard to common structural features of porins. PMID:1848840

  14. The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences.

    PubMed

    White, S H

    1994-04-01

    This paper continues an examination of the hypothesis that modern proteins evolved from random heteropeptide sequences. In support of the hypothesis, White and Jacobs (1993, J Mol Evol 36:79-95) have shown that any sequence chosen randomly from a large collection of nonhomologous proteins has a 90% or better chance of having a lengthwise distribution of amino acids that is indistinguishable from the random expectation regardless of amino acid type. The goal of the present study was to investigate the possibility that the random-origin hypothesis could explain the lengths of modern protein sequences without invoking specific mechanisms such as gene duplication or exon splicing. The sets of sequences examined were taken from the 1989 PIR database and consisted of 1,792 "super-family" proteins selected to have little sequence identity, 623 E. coli sequences, and 398 human sequences. The length distributions of the proteins could be described with high significance by either of two closely related probability density functions: The gamma distribution with parameter 2 or the distribution for the sum of two exponential random independent variables. A simple theory for the distributions was developed which assumes that (1) protoprotein sequences had exponentially distributed random independent lengths, (2) the length dependence of protein stability determined which of these protoproteins could fold into compact primitive proteins and thereby attain the potential for biochemical activity, (3) the useful protein sequences were preserved by the primitive genome, and (4) the resulting distribution of sequence lengths is reflected by modern proteins. The theory successfully predicts the two observed distributions which can be distinguished by the functional form of the dependence of protein stability on length. The theory leads to three interesting conclusions. First, it predicts that a tetra-nucleotide was the signal for primitive translation termination. This prediction is

  15. Proteaselike sequence in hepatitis B virus core antigen is not required for e antigen generation and may not be part of an aspartic acid-type protease.

    PubMed Central

    Nassal, M; Galle, P R; Schaller, H

    1989-01-01

    The hepatitis B virus (HBV) C gene directs the synthesis of two major gene products: HBV core antigen (HBcAg[p21c]), which forms the nucleocapsid, and HBV e antigen (HBeAg [p17e]), a secreted antigen that is produced by several processing events during its maturation. These proteins contain an amino acid sequence similar to the active-site residues of aspartic acid and retroviral proteases. On the basis of this sequence similarity, which is highly conserved among mammalian hepadnaviruses, a model has been put forward according to which processing to HBeAg is due to self-cleavage of p21c involving the proteaselike sequence. Using site-directed mutagenesis in conjunction with transient expression of HBV proteins in the human hepatoma cell line HepG2, we tested this hypothesis. Our results with HBV mutants in which one or two of the conserved amino acids have been replaced by others suggest strongly that processing to HBeAg does not depend on the presence of an intact proteaselike sequence in the core protein. Attempts to detect an influence of this sequence on the processing of HBV P gene products into enzymatically active viral polymerase also gave no conclusive evidence for the existence of an HBV protease. Mutations replacing the putatively essential aspartic acid showed little effect on polymerase activity. Additional substitution of the likewise conserved threonine residue by alanine, in contrast, almost abolished the activity of the polymerase. We conclude that an HBV protease, if it exists, is functionally different from aspartic acid and retroviral proteases. Images PMID:2657101

  16. Structure of LP2179, the first representative of Pfam family PF08866, suggests a new fold with a role in amino-acid metabolism

    PubMed Central

    Bakolitsa, Constantina; Kumar, Abhinav; Carlton, Dennis; Miller, Mitchell D.; Krishna, S. Sri; Abdubek, Polat; Astakhova, Tamara; Axelrod, Herbert L.; Chiu, Hsiu-Ju; Clayton, Thomas; Deller, Marc C.; Duan, Lian; Elsliger, Marc-André; Feuerhelm, Julie; Grzechnik, Slawomir K.; Grant, Joanna C.; Han, Gye Won; Jaroszewski, Lukasz; Jin, Kevin K.; Klock, Heath E.; Knuth, Mark W.; Kozbial, Piotr; Marciano, David; McMullan, Daniel; Morse, Andrew T.; Nigoghossian, Edward; Okach, Linda; Oommachen, Silvya; Paulsen, Jessica; Reyes, Ron; Rife, Christopher L.; Tien, Henry J.; Trout, Christina V.; van den Bedem, Henry; Weekes, Dana; Xu, Qingping; Hodgson, Keith O.; Wooley, John; Deacon, Ashley M.; Godzik, Adam; Lesley, Scott A.; Wilson, Ian A.

    2010-01-01

    The structure of LP2179, a member of the PF08866 (DUF1831) family, suggests a novel α+β fold comprising two β-sheets packed against a single helix. A remote structural similarity to two other uncharacterized protein families specific to the Bacillus genus (PF08868 and PF08968), as well as to prokaryotic S-adenosylmethionine decarboxylases, is consistent with a role in amino-acid metabolism. Genomic neighborhood analysis of LP2179 supports this functional assignment, which might also then be extended to PF08868 and PF08968. PMID:20944212

  17. Amino acid sequence diversity of the major human papillomavirus capsid protein: Implications for current and next generation vaccines☆

    PubMed Central

    Ahmed, Amina I.; Bissett, Sara L.; Beddows, Simon

    2013-01-01

    Despite the fidelity of host cell polymerases, the human papillomavirus (HPV) displays a degree of genomic polymorphism resulting in distinct genotypes and intra-type variants. The current HPV vaccines target the most prevalent genotypes associated with cervical cancer (HPV16/18) and genital warts (HPV6/11). Although these vaccines confer some measure of cross-protection, a multivalent HPV vaccine is in the pipeline that aims to broaden vaccine protection against other cervical cancer-associated genotypes including HPV31, HPV33, HPV45, HPV52 and HPV58. Both current and next generation vaccines comprise virus-like particles, based upon the major capsid protein, L1, and vaccine-induced, type-specific protection is likely mediated by neutralizing antibodies targeting L1 surface-exposed domains. The aim of this study was to perform an in silico analysis of existing full length L1 sequences representing vaccine-relevant HPV genotypes in order to address the degree of naturally-occurring, intra-type polymorphisms. In total, 1281 sequences from the Americas, Africa, Asia and Europe were assembled. Intra-type entropy was low and/or limited to non-surface-exposed residues for HPV6, HPV11 and HPV52 suggesting a minimal effect on vaccine antibodies for these genotypes. For HPV16, intra-type entropy was high but the present analysis did not reveal any significant polymorphisms not previously identified. For HPV31, HPV33, HPV58, however, intra-type entropy was high, mostly mapped to surface-exposed domains and in some cases within known neutralizing antibody epitopes. For HPV18 and HPV45 there were too few sequences for a definitive analysis, but HPV45 displayed some degree of surface-exposed residue diversity. In most cases, the reference sequence for each genotype represented a minority variant and the consensus L1 sequences for HPV18, HPV31, HPV45 and HPV58 did not reflect the L1 sequence of the currently available HPV pseudoviruses. These data highlight a number of variant

  18. An amino acid sequence motif sufficient for subnuclear localization of an arginine/serine-rich splicing factor.

    PubMed

    Hedley, M L; Amrein, H; Maniatis, T

    1995-12-05

    We have identified an amino acid sequence in the Drosophila Transformer (Tra) protein that is capable of directing a heterologous protein to nuclear speckles, regions of the nucleus previously shown to contain high concentrations of spliceosomal small nuclear RNAs and splicing factors. This sequence contains a nucleoplasmin-like bipartite nuclear localization signal (NLS) and a repeating arginine/serine (RS) dipeptide sequence adjacent to a short stretch of basic amino acids. Sequence comparisons from a number of other splicing factors that colocalize to nuclear speckles reveal the presence of one or more copies of this motif. We propose a two-step subnuclear localization mechanism for splicing factors. The first step is transport across the nuclear envelope via the nucleoplasmin-like NLS, while the second step is association with components in the speckled domain via the RS dipeptide sequence.

  19. Purification and partial amino acid sequence of the chloroplast cytochrome b-559.

    PubMed

    Widger, W R; Cramer, W A; Hermodson, M; Meyer, D; Gullifor, M

    1984-03-25

    The hydrophobic cytochrome b-559, purified from unstacked, ethanol-washed spinach thylakoid membranes, using extraction with 2% Triton X-100 in 4 M urea and three chromatographic steps in the presence of protease inhibitors, has a dominant band on sodium dodecyl sulfate-urea gels corresponding to Mr = 10,000. The yield of this preparation is 30-50% (5-10 mg) starting with 600 mg of chlorophyll. The heme content yields a calculated molecular weight of no more than 17,500/heme, and perhaps somewhat smaller after correction for impurities. The Mr = 10,000 band is stained by the tetramethylbenzidine-H2O2 heme reagent on lithium dodecyl sulfate gels run at 0 degrees C. The Mr = 10,000 protein, further separated by high performance liquid chromatography, contains a unique NH2 terminus that is not blocked, and the amino acid sequence for the first 27 residues is NH2-Ser-Gly-Ser-Thr-Gly-Glu-Arg-Ser-Phe-Ala-Asp-Ile-Ile-Thr-Ser-Ile-Arg-Tyr-Trp -Val-Ile-X-Ser-Ile-Thr-Ile-Pro. . . COOH. Approximately 55% of the amino acids are hydrophobic, based on amino acid analysis of the Mr = 10,000 peptide, which also indicated the presence of at least one histidine. Only one cytochrome b-559 component could be identified, whose yield indicated that it arises from a single b-559 protein in chloroplasts corresponding to the in situ high potential cytochrome of the chloroplast photosystem II.

  20. Two amino acid sequences direct Aspergillus nidulans protein kinase C (PkcA) localization to hyphal apices and septation sites.

    PubMed

    Jackson-Hayes, Loretta; Hill, Terry W; Loprete, Darlene M; DelBove, Claire E; Shapiro, Justin A; Henley, Jordan L; Dawodu, Omolola O

    2015-01-01

    The Aspergillus nidulans ortholog of protein kinase C (pkcA) is involved in the organism's putative cell wall integrity (CWI) pathway, and PkcA also is highly localized at growing tips and forming septa. In the present work we identify the regions within PkcA that are responsible for its localization to hyphal tips and septation sites. To this end, we used serially truncated pkcA constructs and expressed them as green fluorescent protein (GFP) chimeras and identified two regions that direct PkcA localization. The first region is a 10 amino-acid sequence near the carboxyl end of the C2 domain that is required for localization to hyphal tips. Proteins containing this sequence also localize to septation sites. A second region between C2 and C1B (encompassing C1A) is sufficient for localization to septation sites but not to hyphal tips. We also report that localization to hyphal tips and septation sites alone is not sufficient for truncated constructs to complement hypersensitivity to the cell wall compromising agent calcofluor white in a strain bearing a mutation in the pkcA gene. Taken together, these results suggest that localization and stress response might be independent.

  1. Targeted sequencing of BRCA1 and BRCA2 across a large unselected breast cancer cohort suggests that one-third of mutations are somatic

    PubMed Central

    Winter, C.; Nilsson, M. P.; Olsson, E.; George, A. M.; Chen, Y.; Kvist, A.; Törngren, T.; Vallon-Christersson, J.; Hegardt, C.; Häkkinen, J.; Jönsson, G.; Grabau, D.; Malmberg, M.; Kristoffersson, U.; Rehn, M.; Gruvberger-Saal, S. K.; Larsson, C.; Borg, Å.; Loman, N.; Saal, L. H.

    2016-01-01

    Background A mutation found in the BRCA1 or BRCA2 gene of a breast tumor could be either germline or somatically acquired. The prevalence of somatic BRCA1/2 mutations and the ratio between somatic and germline BRCA1/2 mutations in unselected breast cancer patients are currently unclear. Patients and methods Paired normal and tumor DNA was analyzed for BRCA1/2 mutations by massively parallel sequencing in an unselected cohort of 273 breast cancer patients from south Sweden. Results Deleterious germline mutations in BRCA1 (n = 10) or BRCA2 (n = 10) were detected in 20 patients (7%). Deleterious somatic mutations in BRCA1 (n = 4) or BRCA2 (n = 5) were detected in 9 patients (3%). Accordingly, about 1 in 9 breast carcinomas (11%) in our cohort harbor a BRCA1/2 mutation. For each gene, the tumor phenotypes were very similar regardless of the mutation being germline or somatically acquired, whereas the tumor phenotypes differed significantly between wild-type and mutated cases. For age at diagnosis, the patients with somatic BRCA1/2 mutations resembled the wild-type patients (median age at diagnosis, germline BRCA1: 41.5 years; germline BRCA2: 49.5 years; somatic BRCA1/2: 65 years; wild-type BRCA1/2: 62.5 years). Conclusions In a population without strong germline founder mutations, the likelihood of a BRCA1/2 mutation found in a breast carcinoma being somatic was ∼1/3 and germline 2/3. This may have implications for treatment and genetic counseling. PMID:27194814

  2. Correlation between carbohydrate-binding specificity and amino acid sequence of carbohydrate-binding regions of Cytisus-type anti-H(O) lectins.

    PubMed

    Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

    1992-06-15

    A carbohydrate-binding peptide of the di-N-acetylchitobiose-binding Cytisus sessilifolius anti-H(O) lectin I (CSA-I) was isolated from the endoproteinase Asp-N digest of CSA-I by affinity chromatography on a column of N-acetyl-D-glucosamine oligomer-Sepharose (GlcNAc oligomer-Sepharose). The amino acid sequence of the carbohydrate-binding peptide of CSA-I was determined to be DTYFGKTYNPW using a gas-phase protein sequencer. This sequence corresponds to the sequence from Asp-129 to Trp-139 based on the primary structure of CSA-I, and shows a high degree of homology to those of the putative carbohydrate-binding peptide of the Laburnum alpinum lectin I (LAA-I) (DTYFGKAYNPW) and of the Ulex europaeus lectin II (UEA-II) (DSYFGKTYNPW). The binding of these three anti-H(O) lectins is known to be inhibited by di-N-acetylchitobiose but not by L-fucose. These results strongly suggest that there is a good correlation between the carbohydrate-binding specificity and the amino acid sequence of the carbohydrate-binding regions of di-N-acetylchitobiose-binding lectins.

  3. The impact of monomer sequence and stereochemistry on the swelling and erosion of biodegradable poly(lactic-co-glycolic acid) matrices.

    PubMed

    Washington, Michael A; Swiner, Devin J; Bell, Kerri R; Fedorchak, Morgan V; Little, Steven R; Meyer, Tara Y

    2017-02-01

    Monomer sequence is demonstrated to be a primary factor in determining the hydrolytic degradation profile of poly(lactic-co-glycolic acid)s (PLGAs). Although many approaches have been used to tune the degradation of PLGAs, little effort has been expended in exploring the sequence-control strategy exploited by nature in biopolymers. Cylindrical matrices and films prepared from a series of sequenced and random PLGAs were subjected to hydrolysis in a pH 7.4 buffer at 37 °C. Swelling ranged from 107% for the random racemic PLGA with a 50:50 ratio of lactic (L) to glycolic (G) units to 6% for the sequenced alternating copolymer poly LG. Erosion followed an inverse trend with the random 50:50 PLGA showing an erosion half-life of 3-4 weeks while poly LG required ca. >10 weeks. Stereosequence was found to play a large role in determining swelling and erosion; stereopure analogs swelled less and were slower to lose mass. Molecular weight loss followed similar trends and increases in dispersity correlated with the onset of significant swelling. The relative proportion of rapidly cleavable G-G linkages relative to G-L/L-G (moderate) and L-L (slow) correlates strongly with the degree of swelling observed and the rate of erosion. The dramatic sequence-dependent variation in swelling, in the absence of a parallel hydrophilicity trend, suggest that osmotic pressure, driven by the differential accumulation of degradation products, plays an important role.

  4. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

    SciTech Connect

    Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

    1987-07-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO/sub 4//PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene.

  5. Real-Time Nucleic Acid Sequence-Based Amplification Assay for Detection of Hepatitis A Virus

    PubMed Central

    Abd El Galil, Khaled H.; El Sokkary, M. A.; Kheira, S. M.; Salazar, Andre M.; Yates, Marylynn V.; Chen, Wilfred; Mulchandani, Ashok

    2005-01-01

    A nucleic acid sequence-based amplification (NASBA) assay in combination with a molecular beacon was developed for the real-time detection and quantification of hepatitis A virus (HAV). A 202-bp, highly conserved 5′ noncoding region of HAV was targeted. The sensitivity of the real-time NASBA assay was tested with 10-fold dilutions of viral RNA, and a detection limit of 1 PFU was obtained. The specificity of the assay was demonstrated by testing with other environmental pathogens and indicator microorganisms, with only HAV positively identified. When combined with immunomagnetic separation, the NASBA assay successfully detected as few as 10 PFU from seeded lake water samples. Due to its isothermal nature, its speed, and its similar sensitivity compared to the real-time RT-PCR assay, this newly reported real-time NASBA method will have broad applications for the rapid detection of HAV in contaminated food or water. PMID:16269748

  6. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  7. Sequence-defined shuttles for targeted nucleic acid and protein delivery.

    PubMed

    Röder, Ruth; Wagner, Ernst

    2014-01-01

    Molecular medicine opens into a space of novel specific therapeutic agents: intracellularly active drugs such as peptides, proteins or nucleic acids, which are not able to cross cell membranes and enter the intracellular space on their own. Through the development of cell-targeted shuttles for specific delivery, this restriction in delivery has the potential to be converted into an advantage. On the one hand, due to the multiple extra- and intracellular barriers, such carrier systems need to be multifunctional. On the other hand, they must be precise and reproducibly manufactured due to pharmaceutical reasons. Here we review the design of precise sequence-defined delivery carriers, including solid-phase synthesized peptides and nonpeptidic oligomers, or nucleotide-based carriers such as aptamers and origami nanoboxes.

  8. Identification of amino acid sequences in the polyomavirus capsid proteins that serve as nuclear localization signals

    NASA Technical Reports Server (NTRS)

    Chang, D.; Haynes, J. I. Jr; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

    1993-01-01

    The molecular mechanism participating in the transport of newly synthesized proteins from the cytoplasm to the nucleus in mammalian cells is poorly understood. Recently, the nuclear localization signal sequences (NLS) of many nuclear proteins have been identified, and most have been found to be composed of a highly basic amino acid stretch. A genetic "subtractive" and a biochemical "additive" approach were used in our studies to identify the NLS's of the polyomavirus structural capsid proteins. An NLS was identified at the N-terminus (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) of the major capsid protein VP1 and at the C-terminus (Glu307 -Glu-Asp-Gly-Pro-Glu-Lys-Lys-Lys-Arg-Arg-Leu318) of the VP2/VP3 minor capsid proteins.

  9. The amino acid sequence of a carbohydrate-containing fragment of hen ovotransferrin.

    PubMed Central

    Kingston, I B; Williams, J

    1975-01-01

    1. Hen ovotransferrin was treated with CNBr and fractionated by gel filtration. 2. After further treatment by reduction and carboxymethylation a carbohydrate-containing fragment of molecular weight 11990 was obtained (fragment BCd). 3. The amino acid sequence of this fragment was determined. It consists of a single chain of 94 residues. 4. The structure of a tryptic glycopeptide derived from whole ovotransferrin permitted a further eight residues to be assigned at the N-terminus of fragment BCd. 5. Heterogeneity was found at two positions. 6. Further evidence has been deposited as Supplementary Publication SUP 50045 (19 pages) at the British Library (Lending Division), Boston Spa, Wetherby, W. Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1975), 145, 5. PMID:1172663

  10. Phylogenetic analysis of beta-papillomaviruses as inferred from nucleotide and amino acid sequence data.

    PubMed

    Gottschling, Marc; Köhler, Anja; Stockfleth, Eggert; Nindl, Ingo

    2007-01-01

    Human papillomaviruses (HPV) of the beta-group seem to be involved in the pathogenesis of non-melanoma skin cancer. Papillomaviruses are host specific and are considered closely co-evolving with their hosts. Evolutionary incongruence between early genes and late genes has been reported among oncogenic genital alpha-papillomaviruses and considerably challenge phylogenetic reconstructions. We investigated the relationships of 29 beta-HPV (25 types plus four putative new types, subtypes, or variants) as inferred from codon aligned and amino acid sequence data of the genes E1, E2, E6, E7, L1, and L2 using likelihood, distance, and parsimony approaches. An analysis of a L1 fragment included additional nucleotide and amino acid sequences from seven non-human beta-papillomaviruses. Early genes and late genes evolution did not conflict significantly in beta-papillomaviruses based on partition homogeneity tests (p > or = 0.001). As inferred from the complete genome analyses, beta-papillomaviruses were monophyletic and segregated into four highly supported monophyletic assemblages corresponding to the species 1, 2, 3, and fused 4/5. They basically split into the species 1 and the remainder of beta-papillomaviruses, whose species 3, 4, and 5 constituted the sistergroup of species 2. beta-Papillomaviruses have been isolated from humans, apes, and monkeys, and phylogenetic analyses of the L1 fragment showed non-human papillomaviruses highly polyphyletic nesting within the HPV species. Thus, host and virus phylogenies were not congruent in beta-papillomaviruses, and multiple invasions across species borders may contribute (additionally to host-linked evolution) to their diversification.

  11. Amino acid sequence and chemical modification of a novel alpha-neurotoxin (Oh-5) from king cobra (Ophiophagus hannah) venom.

    PubMed

    Lin, S R; Leu, L F; Chang, L S; Chang, C C

    1997-04-01

    A novel alpha-neurotoxin, Oh-5, was isolated from king cobra (Ophiophagus hannah) venom and purified by successive SP-Sephadex C-25 column chromatography and reversed-phase HPLC. The complete sequence of Oh-5 was determined by Edman degradation of peptide fragments generated by endopeptidases, i.e., trypsin, Saccharomyces aureus V8 protease and lysyl endopeptidase. This novel toxin comprises 72 amino acid residues with 10 cysteines. The sequence shows 89% sequence homology with Oh-4, and 60% with Toxins a and b from the same venom. The tyrosine, tryptophan, lysine and arginine residues in Oh-5 were modified with tetranitromethane (TNM), 2-nitrophenylsulfenyl (NPS) chloride, trinitrobenzene sulfonate (TNBS), and p-hydroxyphenylglyoxal (HPG), respectively. Modification of Tyr-4 or Trp-27 did not affect the lethal toxicity at all, while the Tyr-4 and 23 nitrated derivative retained about 50% of the lethality of native toxin. Selective trinitrophenylation of Lys-51 or 69 resulted in a decrease in lethality by 29%, and 50% lethality was retained after modification of Lys-2, 51, and 69. A drastic decrease in lethality to 26% was observed when both Arg-35 and 37 were modified. The neurotoxicity was further decreased when Arg-9 was additionally modified. These results suggest that the aromatic residues, Tyr-4 and Trp-27, are not crucial for the neurotoxicity, whereas the cationic residues are involved in multipoint contact between the toxin molecule and the nicotinic acetylcholine receptor (nAChR). The residues Tyr-23 and Arg-35 and 37 in the central loop of Oh-5 seem to contribute greatly to the neurotoxicity.

  12. Amino acid sequence homology between rat and human C-reactive protein.

    PubMed Central

    Taylor, J A; Bruton, C J; Anderson, J K; Mole, J E; De Beer, F C; Baltz, M L; Pepys, M B

    1984-01-01

    The rat serum protein that undergoes Ca2+-dependent binding to pneumococcal C-polysaccharide and to phosphocholine residues, and that is evidently a member of the pentraxin family of proteins by virtue of its appearance under the electron microscope, has been variously designated as rat C-reactive protein (CRP) [de Beer, Baltz, Munn, Feinstein, Taylor, Bruton, Clamp & Pepys (1982) Immunology 45, 55-70], 'phosphoryl choline-binding protein' [Nagpurkar & Mookerjea (1981) J. Biol. Chem. 256, 7440-7448] and rat serum amyloid P component (SAP) [Pontet, D'Asnieres, Gache, Escaig & Engler (1981) Biochim. Biophys. Acta 671, 202-210]. The partial amino acid sequence (45 residues) towards the C-terminus of this protein was determined, and it showed 71.7% identity with the known sequence of human CRP but only 54.3% identity with human SAP. Since human CRP and SAP are themselves approximately 50% homologous, the level of identity between the rat protein and human SAP is evidence only of membership of the pentraxin family. In contrast, the much greater resemblance to human CRP confirms that the rat C-polysaccharide-binding/phosphocholine-binding protein is in fact rat CRP. PMID:6477504

  13. Limited proteolysis and sequence analysis of the 2-oxo acid dehydrogenase complexes from Escherichia coli. Cleavage sites and domains in the dihydrolipoamide acyltransferase components.

    PubMed Central

    Packman, L C; Perham, R N

    1987-01-01

    The structures of the dihydrolipoamide acyltransferase (E2) components of the 2-oxo acid dehydrogenase complexes from Escherichia coli were investigated by limited proteolysis. Trypsin and Staphylococcus aureus V8 proteinase were used to excise the three lipoyl domains from the E2p component of the pyruvate dehydrogenase complex and the single lipoyl domain from the E2o component of the 2-oxoglutarate dehydrogenase complex. The principal sites of action of these enzymes on each E2 chain were determined by sequence analysis of the isolated lipoyl fragments and of the truncated E2p and E2o chains. Each of the numerous cleavage sites (12 in E2p, six in E2o) fell within similar segments of the E2 chains, namely stretches of polypeptide rich in alanine, proline and/or charged amino acids. These regions are clearly accessible to proteinases of Mr 24,000-28,000 and, on the basis of n.m.r. spectroscopy, some of them have previously been implicated in facilitating domain movements by virtue of their conformational flexibility. The limited proteolysis data suggest that E2p and E2o possess closer architectural similarities than would be predicted from inspection of their amino acid sequences. As a result of this work, an error was detected in the sequence of E2o inferred from the previously published sequence of the encoding gene, sucB. The relevant peptides from E2o were purified and sequenced by direct means; an amended sequence is presented. Images Fig. 1. Fig. 2. PMID:3297046

  14. Biosynthesis of D-alanyl-lipoteichoic acid: cloning, nucleotide sequence, and expression of the Lactobacillus casei gene for the D-alanine-activating enzyme.

    PubMed Central

    Heaton, M P; Neuhaus, F C

    1992-01-01

    The D-alanine-activating enzyme (Dae; EC 6.3.2.4) encoded by the dae gene from Lactobacillus casei ATCC 7469 is a cytosolic protein essential for the formation of the D-alanyl esters of membrane-bound lipoteichoic acid. The gene has been cloned, sequenced, and expressed in Escherichia coli, an organism which does not possess Dae activity. The open reading frame is 1,518 nucleotides and codes for a protein of 55.867 kDa, a value in agreement with the 56 kDa obtained by electrophoresis. A putative promoter and ribosome-binding site immediately precede the dae gene. A second open reading frame contiguous with the dae gene has also been partially sequenced. The organization of these genetic elements suggests that more than one enzyme necessary for the biosynthesis of D-alanyl-lipoteichoic acid may be present in this operon. Analysis of the amino acid sequence deduced from the dae gene identified three regions with significant homology to proteins in the following groups of ATP-utilizing enzymes: (i) the acid-thiol ligases, (ii) the activating enzymes for the biosynthesis of enterobactin, and (iii) the synthetases for tyrocidine, gramicidin S, and penicillin. From these comparisons, a common motif (GXXGXPK) has been identified that is conserved in the 19 protein domains analyzed. This motif may represent the phosphate-binding loop of an ATP-binding site for this class of enzymes. A DNA fragment (1,568 nucleotides) containing the dae gene and its putative ribosome-binding site has been subcloned and expressed in E. coli. Approximately 0.5% of the total cell protein is active Dae, whereas 21% is in the form of inclusion bodies. The isolation of this minimal fragment without a native promoter sequence provides the basis for designing a genetic system for modulating the D-alanine ester content of lipoteichoic acid. PMID:1385594

  15. Spermatogenesis of the lizard Lacerta vivipara: histological studies and amino acid sequence of a protamine lacertine 1.

    PubMed

    Martinage, A; Depeiges, A; Wouters, D; Morel, L; Sautière, P

    1996-06-01

    The lizard Lacerta vivipara is a seasonal breeder with a well characterized reproductive cycle. An histological study of the lizard testis has been performed at different stages of spermatogenesis and the nuclear basic proteins content was assessed by electrophoretical analysis. Two protamines, lacertines 1 and 2, are present in spermatozoa in April and May. We have isolated lacertine1 and characterized a protamine with a mass of 4,963.7 Da. Amino acid sequence of this protamine (41 residues) was established from data provided by automated Edman degradation. It is characterized by a basic amino acid stretch in the N- and C-terminal regions and by a central part which only consists of 3 different intermingled amino acids. This protamine presents 62% homology with scylliorhinine Z3 from dog-fish Scylliorhinus caniculus and 58% homology with quail protamine. The reported lizard protamine sequence is the first reptilian protamine sequence available so far.

  16. The amino acid sequence of the cytochrome c-554(547) from the chemolithotrophic bacterium Thiobacillus neapolitanus.

    PubMed Central

    Ambler, R P; Meyer, T E; Trudinger, P A; Kamen, M D

    1985-01-01

    An amino acid sequence is proposed for the cytochrome c-554(547) from the bacterium Thiobacillus neapolitanus N.C.I.B. 8539). It consists of a polypeptide chain of 91 residues, with a pair of haem-attachment cysteine residues at positions 15 and 18. There is similarity in sequence with each of the halves of the sequence of the dihaem cytochromes c4 and with a cytochrome c-554(548) from a halophilic strain of Paracoccus. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50127 (11 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1985) 225, 5. PMID:2988504

  17. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    SciTech Connect

    Myers, G.; Korber, B.; Wain-Hobson, S.; Smith, R.F.; Pavlakis, G.N.

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  18. Peruvian and globally reported amino acid substitutions on the Mycobacterium tuberculosis pyrazinamidase suggest a conserved pattern of mutations associated to pyrazinamide resistance

    PubMed Central

    Zimic, Mirko; Sheen, Patricia; Quiliano, Miguel; Gutierrez, Andrés; Gilman, Robert H.

    2010-01-01

    Resistance to pyrazinamide in Mycobacterium tuberculosis is usually associated with a reduction of pyrazinamidase activity caused by mutations in pncA, the pyrazinamidase coding gene. Pyrazinamidase is a hydrolase that converts pyrazinamide, the antituberculous drug against the latent stage, to the active compound, pyrazinoic acid. To better understand the relationship between pncA mutations and pyrazinamide-resistance, it is necessary to analyze the distribution of pncA mutations from pyrazinamide resistant strains. We determined the distribution of Peruvian and globally reported pncA missense mutations from M. tuberculosis clinical isolates resistant to pyrazinamide. The distributions of the single amino acid substitutions were compared at the secondary-structure-domains level. The distribution of the Peruvian mutations followed a similar pattern as the mutations reported globally. A consensus clustering of mutations was observed in hot-spot regions located in the metal coordination site and to a lesser extent in the active site of the enzyme. The data was not able to reject the null hypothesis that both distributions are similar, suggesting that pncA mutations associated to pyrazinamide resistance in M. tuberculosis, follow a conserved pattern responsible to impair the pyrazinamidase activity. PMID:19963078

  19. Nucleic acid sequence of an internal image-bearing monoclonal anti-idiotype and its comparison to the sequence of the external antigen.

    PubMed Central

    Bruck, C; Co, M S; Slaoui, M; Gaulton, G N; Smith, T; Fields, B N; Mullins, J I; Greene, M I

    1986-01-01

    The monoclonal anti-idiotypic antibody (mAb2) 87.92.6 directed against the 9B.G5 antibody specific for the virus neutralizing epitope on the mammalian reovirus type 3 hemagglutinin was previously demonstrated to express an internal image of the receptor binding epitope of the reovirus type 3. Furthermore, this mAb2 has autoimmune reactivity to the cell surface receptor of the reovirus. The nucleotide and deduced amino acid sequences of the 87.92.6 mAb2 heavy and light chains are described in this report. The sequence analysis reveals that the same heavy chain variable and joining (VH and JH) gene segments are used by the 87.92.6 anti-idiotypic mAb2 and by the dominant idiotypes of the BALB/c anti-GAT (cGAT) and anti-NP (NPa) responses. [GAT; random polymer that is 60% glutamic acid, 30% alanine, and 10% tyrosine. NP; (4-hydroxy-3-nitrophenyl)-acetyl.] Despite extensive homology at the level of the heavy chain variable regions, the NPa positive BALB/c anti-NP monoclonal antibody 17.2.25 binds neither 9B.G5 nor the cellular receptor for the hemagglutinin. Amino acid sequence comparison between the viral hemagglutinin and the 87.92.6 mAb2 light chain "internal image," reveals an area of significant homology indicating that antigen mimicry by antibodies may be achieved by sharing primary structure. PMID:2428036

  20. Draft Genome Sequence of Escherichia coli O157:H7 ATCC 35150 and a Nalidixic Acid-Resistant Mutant Derivative

    PubMed Central

    Markell, James A.; Koziol, Adam G.

    2015-01-01

    Shiga toxin-producing Escherichia coli strains, occasionally isolated from food, are of public health importance. Here, we report on the 5.30-Mbp draft genome sequence of E. coli O157:H7 EDL931 (strain ATCC 35150) and the 5.32-Mbp draft genome sequence of a nalidixic acid-resistant mutant derivative used as a distinguishable control strain in food-testing laboratories. PMID:26205873

  1. The deletion of several amino acid stretches of Escherichia coli alpha-hemolysin (HlyA) suggests that the channel-forming domain contains beta-strands.

    PubMed

    Benz, Roland; Maier, Elke; Bauer, Susanne; Ludwig, Albrecht

    2014-01-01

    Escherichia coli α-hemolysin (HlyA) is a pore-forming protein of 110 kDa belonging to the family of RTX toxins. A hydrophobic region between the amino acid residues 238 and 410 in the N-terminal half of HlyA has previously been suggested to form hydrophobic and/or amphipathic α-helices and has been shown to be important for hemolytic activity and pore formation in biological and artificial membranes. The structure of the HlyA transmembrane channel is, however, largely unknown. For further investigation of the channel structure, we deleted in HlyA different stretches of amino acids that could form amphipathic β-strands according to secondary structure predictions (residues 71-110, 158-167, 180-203, and 264-286). These deletions resulted in HlyA mutants with strongly reduced hemolytic activity. Lipid bilayer measurements demonstrated that HlyAΔ71-110 and HlyAΔ264-286 formed channels with much smaller single-channel conductance than wildtype HlyA, whereas their channel-forming activity was virtually as high as that of the wildtype toxin. HlyAΔ158-167 and HlyAΔ180-203 were unable to form defined channels in lipid bilayers. Calculations based on the single-channel data indicated that the channels generated by HlyAΔ71-110 and HlyAΔ264-286 had a smaller size (diameter about 1.4 to 1.8 nm) than wildtype HlyA channels (diameter about 2.0 to 2.6 nm), suggesting that in these mutants part of the channel-forming domain was removed. Osmotic protection experiments with erythrocytes confirmed that HlyA, HlyAΔ71-110, and HlyAΔ264-286 form defined transmembrane pores and suggested channel diameters that largely agreed with those estimated from the single-channel data. Taken together, these results suggest that the channel-forming domain of HlyA might contain β-strands, possibly in addition to α-helical structures.

  2. The Deletion of Several Amino Acid Stretches of Escherichia coli Alpha-Hemolysin (HlyA) Suggests That the Channel-Forming Domain Contains Beta-Strands

    PubMed Central

    Benz, Roland; Maier, Elke; Bauer, Susanne; Ludwig, Albrecht

    2014-01-01

    Escherichia coli α-hemolysin (HlyA) is a pore-forming protein of 110 kDa belonging to the family of RTX toxins. A hydrophobic region between the amino acid residues 238 and 410 in the N-terminal half of HlyA has previously been suggested to form hydrophobic and/or amphipathic α-helices and has been shown to be important for hemolytic activity and pore formation in biological and artificial membranes. The structure of the HlyA transmembrane channel is, however, largely unknown. For further investigation of the channel structure, we deleted in HlyA different stretches of amino acids that could form amphipathic β-strands according to secondary structure predictions (residues 71–110, 158–167, 180–203, and 264–286). These deletions resulted in HlyA mutants with strongly reduced hemolytic activity. Lipid bilayer measurements demonstrated that HlyAΔ71–110 and HlyAΔ264–286 formed channels with much smaller single-channel conductance than wildtype HlyA, whereas their channel-forming activity was virtually as high as that of the wildtype toxin. HlyAΔ158–167 and HlyAΔ180–203 were unable to form defined channels in lipid bilayers. Calculations based on the single-channel data indicated that the channels generated by HlyAΔ71–110 and HlyAΔ264–286 had a smaller size (diameter about 1.4 to 1.8 nm) than wildtype HlyA channels (diameter about 2.0 to 2.6 nm), suggesting that in these mutants part of the channel-forming domain was removed. Osmotic protection experiments with erythrocytes confirmed that HlyA, HlyAΔ71–110, and HlyAΔ264–286 form defined transmembrane pores and suggested channel diameters that largely agreed with those estimated from the single-channel data. Taken together, these results suggest that the channel-forming domain of HlyA might contain β-strands, possibly in addition to α-helical structures. PMID:25463653

  3. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    PubMed

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein.

  4. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    NASA Astrophysics Data System (ADS)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  5. Terminal sequence importance of de novo proteins from binary-patterned library: stable artificial proteins with 11- or 12-amino acid alphabet.

    PubMed

    Okura, Hiromichi; Takahashi, Tsuyoshi; Mihara, Hisakazu

    2012-06-01

    Successful approaches of de novo protein design suggest a great potential to create novel structural folds and to understand natural rules of protein folding. For these purposes, smaller and simpler de novo proteins have been developed. Here, we constructed smaller proteins by removing the terminal sequences from stable de novo vTAJ proteins and compared stabilities between mutant and original proteins. vTAJ proteins were screened from an α3β3 binary-patterned library which was designed with polar/ nonpolar periodicities of α-helix and β-sheet. vTAJ proteins have the additional terminal sequences due to the method of constructing the genetically repeated library sequences. By removing the parts of the sequences, we successfully obtained the stable smaller de novo protein mutants with fewer amino acid alphabets than the originals. However, these mutants showed the differences on ANS binding properties and stabilities against denaturant and pH change. The terminal sequences, which were designed just as flexible linkers not as secondary structure units, sufficiently affected these physicochemical details. This study showed implications for adjusting protein stabilities by designing N- and C-terminal sequences.

  6. Purification, characterization, gene cloning and nucleotide sequencing of D: -stereospecific amino acid amidase from soil bacterium: Delftia acidovorans.

    PubMed

    Hongpattarakere, Tipparat; Komeda, Hidenobu; Asano, Yasuhisa

    2005-12-01

    The D-amino acid amidase-producing bacterium was isolated from soil samples using an enrichment culture technique in medium broth containing D-phenylalanine amide as a sole source of nitrogen. The strain exhibiting the strongest activity was identified as Delftia acidovorans strain 16. This strain produced intracellular D-amino acid amidase constitutively. The enzyme was purified about 380-fold to homogeneity and its molecular mass was estimated to be about 50 kDa, on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The enzyme was active preferentially toward D-amino acid amides rather than their L-counterparts. It exhibited strong amino acid amidase activity toward aromatic amino acid amides including D-phenylalanine amide, D-tryptophan amide and D-tyrosine amide, yet it was not specifically active toward low-molecular-weight D-amino acid amides such as D-alanine amide, L-alanine amide and L-serine amide. Moreover, it was not specifically active toward oligopeptides. The enzyme showed maximum activity at 40 degrees C and pH 8.5 and appeared to be very stable, with 92.5% remaining activity after the reaction was performed at 45 degrees C for 30 min. However, it was mostly inactivated in the presence of phenylmethanesulfonyl fluoride or Cd2+, Ag+, Zn2+, Hg2+ and As3+ . The NH2 terminal and internal amino acid sequences of the enzyme were determined; and the gene was cloned and sequenced. The enzyme gene damA encodes a 466-amino-acid protein (molecular mass 49,860.46 Da); and the deduced amino acid sequence exhibits homology to the D-amino acid amidase from Variovorax paradoxus (67.9% identity), the amidotransferase A subunit from Burkholderia fungorum (50% identity) and other enantioselective amidases.

  7. Proteomic and Biochemical Studies of Lysine Malonylation Suggest Its Malonic Aciduria-associated Regulatory Role in Mitochondrial Function and Fatty Acid Oxidation*

    PubMed Central

    Colak, Gozde; Pougovkina, Olga; Dai, Lunzhi; Tan, Minjia; te Brinke, Heleen; Huang, He; Cheng, Zhongyi; Park, Jeongsoon; Wan, Xuelian; Liu, Xiaojing; Yue, Wyatt W.; Wanders, Ronald J. A.; Locasale, Jason W.; Lombard, David B.; de Boer, Vincent C. J.; Zhao, Yingming

    2015-01-01

    The protein substrates of sirtuin 5-regulated lysine malonylation (Kmal) remain unknown, hindering its functional analysis. In this study, we carried out proteomic screening, which identified 4042 Kmal sites on 1426 proteins in mouse liver and 4943 Kmal sites on 1822 proteins in human fibroblasts. Increased malonyl-CoA levels in malonyl-CoA decarboxylase (MCD)-deficient cells induces Kmal levels in substrate proteins. We identified 461 Kmal sites showing more than a 2-fold increase in response to MCD deficiency as well as 1452 Kmal sites detected only in MCD−/− fibroblast but not MCD+/+ cells, suggesting a pathogenic role of Kmal in MCD deficiency. Cells with increased lysine malonylation displayed impaired mitochondrial function and fatty acid oxidation, suggesting that lysine malonylation plays a role in pathophysiology of malonic aciduria. Our study establishes an association between Kmal and a genetic disease and offers a rich resource for elucidating the contribution of the Kmal pathway and malonyl-CoA to cellular physiology and human diseases. PMID:26320211

  8. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  9. KM+, a mannose-binding lectin from Artocarpus integrifolia: amino acid sequence, predicted tertiary structure, carbohydrate recognition, and analysis of the beta-prism fold.

    PubMed Central

    Rosa, J. C.; De Oliveira, P. S.; Garratt, R.; Beltramini, L.; Resing, K.; Roque-Barreira, M. C.; Greene, L. J.

    1999-01-01

    The complete amino acid sequence of the lectin KM+ from Artocarpus integrifolia (jackfruit), which contains 149 residues/mol, is reported and compared to those of other members of the Moraceae family, particularly that of jacalin, also from jackfruit, with which it shares 52% sequence identity. KM+ presents an acetyl-blocked N-terminus and is not posttranslationally modified by proteolytic cleavage as is the case for jacalin. Rather, it possesses a short, glycine-rich linker that unites the regions homologous to the alpha- and beta-chains of jacalin. The results of homology modeling implicate the linker sequence in sterically impeding rotation of the side chain of Asp141 within the binding site pocket. As a consequence, the aspartic acid is locked into a conformation adequate only for the recognition of equatorial hydroxyl groups on the C4 epimeric center (alpha-D-mannose, alpha-D-glucose, and their derivatives). In contrast, the internal cleavage of the jacalin chain permits free rotation of the homologous aspartic acid, rendering it capable of accepting hydrogen bonds from both possible hydroxyl configurations on C4. We suggest that, together with direct recognition of epimeric hydroxyls and the steric exclusion of disfavored ligands, conformational restriction of the lectin should be considered to be a new mechanism by which selectivity may be built into carbohydrate binding sites. Jacalin and KM+ adopt the beta-prism fold already observed in two unrelated protein families. Despite presenting little or no sequence similarity, an analysis of the beta-prism reveals a canonical feature repeatedly present in all such structures, which is based on six largely hydrophobic residues within a beta-hairpin containing two classic-type beta-bulges. We suggest the term beta-prism motif to describe this feature. PMID:10210179

  10. Genome wide identification of microRNAs involved in fatty acid and lipid metabolism of Brassica napus by small RNA and degradome sequencing.

    PubMed

    Wang, Zhiwei; Qiao, Yan; Zhang, Jingjing; Shi, Wenhui; Zhang, Jinwen

    2017-04-01

    Rapeseed (Brassica napus) is an important cash crop considered as the third largest oil crop worldwide. Rapeseed oil contains various saturation or unsaturation fatty acids, these fatty acids, whose could incorporation with TAG form into lipids stored in seeds play various roles in the metabolic activity. The different fatty acids in B. napus seeds determine oil quality, define if the oil is edible or must be used as industrial material. miRNAs are kind of non-coding sRNAs that could regulate gene expressions through post-transcriptional modification to their target transcripts playing important roles in plant metabolic activities. We employed high-throughput sequencing to identify the miRNAs and their target transcripts involved in fatty acids and lipids metabolism in different development of B. napus seeds. As a result, we identified 826 miRNA sequences, including 523 conserved and 303 newly miRNAs. From the degradome sequencing, we found 589 mRNA could be targeted by 236 miRNAs, it includes 49 novel miRNAs and 187 conserved miRNAs. The miRNA-target couple suggests that bna-5p-163957_18, bna-5p-396192_7, miR9563a-p3, miR9563b-p5, miR838-p3, miR156e-p3, miR159c and miR1134 could target PDP, LACS9, MFPA, ADSL1, ACO32, C0401, GDL73, PlCD6, OLEO3 and WSD1. These target transcripts are involving in acetyl-CoA generate and carbon chain desaturase, regulating the levels of very long chain fatty acids, β-oxidation and lipids transport and metabolism process. At the same, we employed the q-PCR to valid the expression of miRNAs and their target transcripts that involve in fatty acid and lipid metabolism, the result suggested that the miRNA and their transcript expression are negative correlation, which in accord with the expression of miRNA and its target transcript. The study findings suggest that the identified miRNA may play important role in the fatty acids and lipids metabolism in seeds of B. napus.

  11. Differences in acid tolerance between Bifidobacterium breve BB8 and its acid-resistant derivative B. breve BB8dpH, revealed by RNA-sequencing and physiological analysis.

    PubMed

    Yang, Xu; Hang, Xiaomin; Tan, Jing; Yang, Hong

    2015-06-01

    Bifidobacteria are common inhabitants of the human gastrointestinal tract, and their application has increased dramatically in recent years due to their health-promoting effects. The ability of bifidobacteria to tolerate acidic environments is particularly important for their function as probiotics because they encounter such environments in food products and during passage through the gastrointestinal tract. In this study, we generated a derivative, Bifidobacterium breve BB8dpH, which displayed a stable, acid-resistant phenotype. To investigate the possible reasons for the higher acid tolerance of B. breve BB8dpH, as compared with its parental strain B. breve BB8, a combined transcriptome and physiological approach was used to characterize differences between the two strains. An analysis of the transcriptome by RNA-sequencing indicated that the expression of 121 genes was increased by more than 2-fold, while the expression of 146 genes was reduced more than 2-fold, in B. breve BB8dpH. Validation of the RNA-sequencing data using real-time quantitative PCR analysis demonstrated that the RNA-sequencing results were highly reliable. The comparison analysis, based on differentially expressed genes, suggested that the acid tolerance of B. breve BB8dpH was enhanced by regulating the expression of genes involved in carbohydrate transport and metabolism, energy production, synthesis of cell envelope components (peptidoglycan and exopolysaccharide), synthesis and transport of glutamate and glutamine, and histidine synthesis. Furthermore, an analysis of physiological data showed that B. breve BB8dpH displayed higher production of exopolysaccharide and lower H(+)-ATPase activity than B. breve BB8. The results presented here will improve our understanding of acid tolerance in bifidobacteria, and they will lead to the development of new strategies to enhance the acid tolerance of bifidobacterial strains.

  12. Human parainfluenza type 3 virus hemagglutinin-neuraminidase glycoprotein: nucleotide sequence of mRNA and limited amino acid sequence of the purified protein.

    PubMed Central

    Elango, N; Coligan, J E; Jambou, R C; Venkatesan, S

    1986-01-01

    The nucleotide sequence of mRNA for the hemagglutinin-neuraminidase (HN) protein of human parainfluenza type 3 virus obtained from the corresponding cDNA clone had a single long open reading frame encoding a putative protein of 64,254 daltons consisting of 572 amino acids. The deduced protein sequence was confirmed by limited N-terminal amino acid microsequencing of CNBr cleavage fragments of native HN that was purified by immunoprecipitation. The HN protein is moderately hydrophobic and has four potential sites (Asn-X-Ser/Thr) of N-glycosylation in the C-terminal half of the molecule. It is devoid of both the N-terminal signal sequence and the C-terminal membrane anchorage domain characteristic of the hemagglutinin of influenza virus and the fusion (F0) protein of the paramyxoviruses. Instead, it has a single prominent hydrophobic region capable of membrane insertion beginning at 32 residues from the N terminus. This N-terminal membrane insertion is similar to that of influenza virus neuraminidase and the recently reported structures of HN proteins of Sendai virus and simian virus 5. Images PMID:3003381

  13. Next-generation re-sequencing of genes involved in increased platelet reactivity in diabetic patients on acetylsalicylic acid.

    PubMed

    Postula, Marek; Janicki, Piotr K; Eyileten, Ceren; Rosiak, Marek; Kaplon-Cieslicka, Agnieszka; Sugino, Shigekazu; Wilimski, Radosław; Kosior, Dariusz A; Opolski, Grzegorz; Filipiak, Krzysztof J; Mirowska-Guzel, Dagmara

    2016-06-01

    The objective of this study was to investigate whether rare missense genetic variants in several genes related to platelet functions and acetylsalicylic acid (ASA) response are associated with the platelet reactivity in patients with diabetes type 2 (T2D) on ASA therapy. Fifty eight exons and corresponding introns of eight selected genes, including PTGS1, PTGS2, TXBAS1, PTGIS, ADRA2A, ADRA2B, TXBA2R, and P2RY1 were re-sequenced in 230 DNA samples from T2D patients by using a pooled PCR amplification and next-generation sequencing by Illumina HiSeq2000. The observed non-synonymous variants were confirmed by individual genotyping of 384 DNA samples comprising of the individuals from the original discovery pools and additional verification cohort of 154 ASA-treated T2DM patients. The association between investigated phenotypes (ASA induced changes in platelets reactivity by PFA-100, VerifyNow and serum thromboxane B2 level [sTxB2]), and accumulation of rare missense variants (genetic burden) in investigated genes was tested using statistical collapsing tests. We identified a total of 35 exonic variants, including 3 common missense variants, 15 rare missense variants, and 17 synonymous variants in 8 investigated genes. The rare missense variants exhibited statistically significant difference in the accumulation pattern between a group of patients with increased and normal platelet reactivity based on PFA-100 assay. Our study suggests that genetic burden of the rare functional variants in eight genes may contribute to differences in the platelet reactivity measured with the PFA-100 assay in the T2DM patients treated with ASA.

  14. Sequence dependent N-terminal rearrangement and degradation of peptide nucleic acid (PNA) in aqueous solution

    NASA Technical Reports Server (NTRS)

    Eriksson, M.; Christensen, L.; Schmidt, J.; Haaima, G.; Orgel, L.; Nielsen, P. E.

    1998-01-01

    The stability of the PNA (peptide nucleic acid) thymine monomer inverted question markN-[2-(thymin-1-ylacetyl)]-N-(2-aminoaminoethyl)glycine inverted question mark and those of various PNA oligomers (5-8-mers) have been measured at room temperature (20 degrees C) as a function of pH. The thymine monomer undergoes N-acyl transfer rearrangement with a half-life of 34 days at pH 11 as analyzed by 1H NMR; and two reactions, the N-acyl transfer and a sequential degradation, are found by HPLC analysis to occur at measurable rates for the oligomers at pH 9 or above. Dependent on the amino-terminal sequence, half-lives of 350 h to 163 days were found at pH 9. At pH 12 the half-lives ranged from 1.5 h to 21 days. The results are discussed in terms of PNA as a gene therapeutic drug as well as a possible prebiotic genetic material.

  15. Low molecular weight (C1-C10) monocarboxylic acids, dissolved organic carbon and major inorganic ions in alpine snow pit sequence from a high mountain site, central Japan

    NASA Astrophysics Data System (ADS)

    Kawamura, Kimitaka; Matsumoto, Kohei; Tachibana, Eri; Aoki, Kazuma

    2012-12-01

    Snowpack samples were collected from a snow pit sequence (6 m in depth) at the Murodo-Daira site near the summit of Mt. Tateyama, central Japan, an outflow region of Asian dusts. The snow samples were analyzed for a homologous series of low molecular weight normal (C1-C10) and branched (iC4-iC6) monocarboxylic acids as well as aromatic (benzoic) and hydroxy (glycolic and lactic) acids, together with major inorganic ions and dissolved organic carbon (DOC). The molecular distributions of organic acids were characterized by a predominance of acetic (range 7.8-76.4 ng g-1-snow, av. 34.8 ng g-1) or formic acid (2.6-48.1 ng g-1, 27.7 ng g-1), followed by propionic acid (0.6-5.2 ng g-1, 2.8 ng g-1). Concentrations of normal organic acids generally decreased with an increase in carbon chain length, although nonanoic acid (C9) showed a maximum in the range of C5-C10. Higher concentrations were found in the snowpack samples containing dust layer. Benzoic acid (0.18-4.1 ng g-1, 1.4 ng g-1) showed positive correlation with nitrate (r = 0.70), sulfate (0.67), Na+ (0.78), Ca2+ (0.86) and Mg+ (0.75), suggesting that this aromatic acid is involved with anthropogenic sources and Asian dusts. Higher concentrations of Ca2+ and SO42- were found in the dusty snow samples. We found a weak positive correlation (r = 0.43) between formic acid and Ca2+, suggesting that gaseous formic acid may react with Asian dusts in the atmosphere during long-range transport. However, acetic acid did not show any positive correlations with major inorganic ions. Hydroxyacids (0.03-5.7 ng g-1, 1.5 ng g-1) were more abundant in the granular and dusty snow. Total monocarboxylic acids (16-130 ng g-1, 74 ng g-1) were found to account for 1-6% of DOC (270-1500 ng g-1, 630 ng g-1) in the snow samples.

  16. Frequencies of amino acid strings in globular protein sequences indicate suppression of blocks of consecutive hydrophobic residues

    PubMed Central

    Schwartz, Russell; Istrail, Sorin; King, Jonathan

    2001-01-01

    Patterns of hydrophobic and hydrophilic residues play a major role in protein folding and function. Long, predominantly hydrophobic strings of 20–22 amino acids each are associated with transmembrane helices and have been used to identify such sequences. Much less attention has been paid to hydrophobic sequences within globular proteins. In prior work on computer simulations of the competition between on-pathway folding and off-pathway aggregate formation, we found that long sequences of consecutive hydrophobic residues promoted aggregation within the model, even controlling for overall hydrophobic content. We report here on an analysis of the frequencies of different lengths of contiguous blocks of hydrophobic residues in a database of amino acid sequences of proteins of known structure. Sequences of three or more consecutive hydrophobic residues are found to be significantly less common in actual globular proteins than would be predicted if residues were selected independently. The result may reflect selection against long blocks of hydrophobic residues within globular proteins relative to what would be expected if residue hydrophobicities were independent of those of nearby residues in the sequence. PMID:11316883

  17. Amino acid sequence of rabbit kidney neutral endopeptidase 24.11 (enkephalinase) deduced from a complementary DNA.

    PubMed Central

    Devault, A; Lazure, C; Nault, C; Le Moual, H; Seidah, N G; Chrétien, M; Kahn, P; Powell, J; Mallet, J; Beaumont, A

    1987-01-01

    Neutral endopeptidase (EC 3.4.24.11) is a major constituent of kidney brush border membranes. It is also present in the brain where it has been shown to be involved in the inactivation of opioid peptides, methionine- and leucine-enkephalins. For this reason this enzyme is often called 'enkephalinase'. In order to characterize the primary structure of the enzyme, oligonucleotide probes were designed from partial amino acid sequences and used to isolate clones from kidney cDNA libraries. Sequencing of the cDNA inserts revealed the complete primary structure of the enzyme. Neutral endopeptidase consists of 750 amino acids. It contains a short N-terminal cytoplasmic domain (27 amino acids), a single membrane-spanning segment (23 amino acids) and an extracellular domain that comprises most of the protein mass. The comparison of the primary structure of neutral endopeptidase with that of thermolysin, a bacterial Zn-metallopeptidase, indicates that most of the amino acid residues involved in Zn coordination and catalytic activity in thermolysin are found within highly honmologous sequences in neutral endopeptidase. Images Fig. 1. Fig. 3. PMID:2440677

  18. Fe(II) oxidation during acid mine drainage neutralization in a pilot-scale Sequencing Batch Reactor.

    PubMed

    Zvimba, J N; Mathye, M; Vadapalli, V R K; Swanepoel, H; Bologo, L

    2013-01-01

    This study investigated Fe(II) oxidation during acid mine drainage (AMD) neutralization using CaCO3 in a pilot-scale Sequencing Batch Reactor (SBR) of hydraulic retention time (HRT) of 90 min and sludge retention time (SRT) of 360 min in the presence of air. The removal kinetics of Fe(II), of initial concentration 1,033 ± 0 mg/L, from AMD through oxidation to Fe(III) was observed to depend on both pH and suspended solids, resulting in Fe(II) levels of 679 ± 32, 242 ± 64, 46 ± 16 and 28 ± 0 mg/L recorded after cycles 1, 2, 3 and 4 respectively, with complete Fe(II) oxidation only achieved after complete neutralization of AMD. Generally, it takes 30 min to completely oxidize Fe(II) during cycle 4, suggesting that further optimization of SBR operation based on both pH and suspended solids manipulation can result in significant reduction of the number of cycles required to achieve acceptable Fe(II) oxidation for removal as ferric hydroxide. Overall, complete removal of Fe(II) during AMD neutralization is attractive as it promotes recovery of better quality waste gypsum, key to downstream gypsum beneficiation for recovery of valuables, thereby enabling some treatment-cost recovery and prevention of environmental pollution from dumping of sludge into landfills.

  19. Classifying nucleic acid sub-sequences as introns or exons using genetic programming

    SciTech Connect

    Handley, S.

    1995-12-31

    An evolutionary computation technique, genetic programming, created programs that classify messenger RNA sequences into one of two classes: (1) the sequence is expressed as (part of) a protein (an exon), or (2) not expressed as protein (an intron).

  20. Clickable Nucleic Acids: Sequence-Controlled Periodic Copolymer/Oligomer Synthesis by Orthogonal Thiol-X Reactions.

    PubMed

    Xi, Weixian; Pattanayak, Sankha; Wang, Chen; Fairbanks, Benjamin; Gong, Tao; Wagner, Justine; Kloxin, Christopher J; Bowman, Christopher N

    2015-11-23

    Synthetic polymer approaches generally lack the ability to control the primary sequence, with sequence control referred to as the holy grail. Two click chemistry reactions were now combined to form nucleobase-containing sequence-controlled polymers in simple polymerization reactions. Two distinct approaches are used to form these click nucleic acid (CNA) polymers. These approaches employ thiol-ene and thiol-Michael reactions to form homopolymers of a single nucleobase (e.g., poly(A)n ) or homopolymers of specific repeating nucleobase sequences (e.g., poly(ATC)n). Furthermore, the incorporation of monofunctional thiol-terminated polymers into the polymerization system enables the preparation of multiblock copolymers in a single reaction vessel; the length of the diblock copolymer can be tuned by the stoichiometric ratio and/or the monomer functionality. These polymers are also used for organogel formation where complementary CNA-based polymers form reversible crosslinks.

  1. Sequence Comparison and Phylogeny of Nucleotide Sequence of Coat Protein and Nucleic Acid Binding Protein of a Distinct Isolate of Shallot virus X from India.

    PubMed

    Majumder, S; Baranwal, V K

    2011-06-01

    Shallot virus X (ShVX), a type species in the genus Allexivirus of the family Alfaflexiviridae has been associated with shallot plants in India and other shallot growing countries like Russia, Germany, Netherland, and New Zealand. Coat protein (CP) and nucleic acid binding protein (NB) region of the virus was obtained by reverse transcriptase polymerase chain reaction from scales leaves of shallot bulbs. The partial cDNA contained two open reading frames encoding proteins of molecular weights of 28.66 and 14.18 kDa belonging to Flexi_CP super-family and viral NB super-family, respectively. The percent identity and phylogenetic analysis of amino acid sequences of CP and NB region of the virus associated with shallot indicated that it was a distinct isolate of ShVX.

  2. Amino acid sequence diversity within the family of antibodies bearing the major antiarsonate cross-reactive idiotype of the A strain mouse

    PubMed Central

    1983-01-01

    VH region amino acid sequences are described for five A/J anti-p- azophenylarsonate (anti-Ars) hybridoma antibodies for which the VL region sequences have previously been determined, thus completing the V domain sequences of these molecules. These antibodies all belong to the family designated Ars-A which bears the major anti-arsonate cross- reactive idiotype (CRI) of the A strain mouse. However, they differ in the degree to which they express the CRI in standard competition radioimmunoassays. Although the sequences are closely related, all are different from each other. Replacements are distributed throughout the VH region and occur in positions of the chain encoded by all three gene segments, VH, DH, and JH. It is likely that somatic diversification processes play a dominant role in producing the sequence variability in each of these segments. The number of differences from the sequence encoded by the germline is smallest for antibodies that express the CRI most strongly, suggesting that somatic diversification is responsible for loss of the CRI in members of the Ars-A antibody family. There is an unusual degree of clustering of differences in both CDR2 and CDR3 and many of the substitutions are located in "hot spots" of variation. The large number of differences between the chains prohibits the unambiguous identification of positions at which alterations play a major role in reducing the expression of the CRI. However, the data suggest that the loss of the CRI is associated with a definable repertoire of somatic changes at a restricted number of highly variable sites. PMID:6415209

  3. Phylogenetic analysis of the genera Proteus, Morganella and Providencia by comparison of rpoB gene sequences of type and clinical strains suggests the reclassification of Proteus myxofaciens in a new genus, Cosenzaea gen. nov., as Cosenzaea myxofaciens comb. nov.

    PubMed

    Giammanco, Giovanni M; Grimont, Patrick A D; Grimont, Francine; Lefevre, Martine; Giammanco, Giuseppe; Pignato, Sarina

    2011-07-01

    Phylogenetic analysis of partial rpoB gene sequences of type and clinical strains belonging to different 16S rRNA gene-fingerprinting ribogroups within 11 species of enterobacteria of the genera Proteus, Morganella and Providencia was performed and allowed the definition of rpoB clades, supported by high bootstrap values and confirmed by ≥2.5 % nucleotide divergence. None of the resulting clades included strains belonging to different species and the majority of the species were confirmed as discrete and homogeneous. However, more than one distinct rpoB clade could be defined among strains belonging to the species Proteus vulgaris (two clades), Providencia alcalifaciens (two clades) and Providencia rettgeri (three clades), suggesting that some strains represent novel species according to the genotypes outlined by rpoB gene sequence analysis. Percentage differences between the rpoB gene sequence of the type strain of Proteus myxofaciens and other members of the same genus (17.3-18.9 %) were similar to those calculated amongst strains of the genus Providencia (16.4-18.7 %), suggesting a genetic distance at the genus-level between Proteus myxofaciens and the rest of the Proteus-Providencia group. Proteus myxofaciens therefore represents a member of a new genus, for which the name Cosenzaea gen. nov., is proposed.

  4. A rapid method for manual or automated purification of fluorescently labeled nucleic acids for sequencing, genotyping, and microarrays.

    PubMed

    Springer, Amy L; Booth, Lisa R; Braid, Michael D; Houde, Christiane M; Hughes, Karin A; Kaiser, Robert J; Pedrak, Casandra; Spicer, Douglas A; Stolyar, Sergey

    2003-03-01

    Fluorescent dyes provide specific, sensitive, and multiplexed detection of nucleic acids. To maximize sensitivity, fluorescently labeled reaction products (e.g., cycle sequencing or primer extension products) must be purified away from residual dye-labeled precursors. Successful high-throughput analyses require that this purification be reliable, rapid, and amenable to automation. Common methods for purifying reaction products involve several steps and require processes that are not easily automated. Prolinx, Inc. has devel oped RapXtract superparamagnetic separation technology affording rapid and easy-to-perform methods that yield high-quality product and are easily automated. The technology uses superparamagnetic particles that specifically remove unincorporated dye-labeled precursors. These particles are efficiently pelleted in the presence of a magnetic field, making them ideal for purification because of the rapid separations that they allow. RapXtract-purified sequencing reactions yield data with good signal and high Phred quality scores, and they work with various sequencing dye chemistries, including BigDye and near-infrared fluorescence IRDyes. RapXtract technology can also be used to purify dye primer sequencing reactions, primer extension reactions for genotyping analysis, and nucleic acid labeling reactions for microarray hybridization. The ease of use and versatility of RapXtract technology makes it a good choice for manual or automated purification of fluorescently labeled nucleic acids.

  5. Homology of the NH2-terminal amino acid sequences of the heavy and light chains of human monoclonal lupus autoantibodies containing the dominant 16/6 idiotype.

    PubMed Central

    Atkinson, P M; Lampman, G W; Furie, B C; Naparstek, Y; Schwartz, R S; Stollar, B D; Furie, B

    1985-01-01

    The NH2-terminal amino acid sequences have been determined by automated Edman degradation for the heavy and light chains of five monoclonal IgM anti-DNA autoantibodies that were produced by human-human hybridomas derived from lymphocytes of two patients with systemic lupus erythematosus. Four of the antibodies were closely related to the idiotype system 16/6, whereas the fifth antibody was unrelated idiotypically. The light chains of the 16/6 idiotype-positive autoantibodies (HF2-1/13b, HF2-1/17, HF2-18/2, and HF3-16/6) had identical amino acid sequences from residues 1 to 40. Their framework structures were characteristic of VKI light chains. The light chain of the 16/6 idiotype-negative autoantibody HF6-21/28 was characteristic of the VKII subgroup. The heavy chains of the 16/6 idiotype-positive autoantibodies had nearly identical amino acid sequences from residues 1 to 40. The framework structures were characteristic of the VHIII subgroup. In contrast, the GM4672 fusion partner of the hybridoma produced small quantities of an IgG with a VHI heavy chain and a VKI light chain. The heavy chains of the lupus autoantibodies and the light chains of those autoantibodies that were idiotypically related to the 16/6 system had marked sequence homology with WEA, a Waldenstrom IgM that binds to Klebsiella polysaccharides and expresses the 16/6 idiotype. These results indicate a striking homology in the amino termini of the heavy and light chains of the lupus autoantibodies studied and suggest that the V regions of the heavy and light chains of the 16/6 idiotype-positive DNA-binding lupus auto-antibodies are each encoded by a single germ line gene. PMID:3921567

  6. Ice core sulfur and methanesulfonic acid (MSA) records from southern Greenland document North American and European air pollution and suggest a decline in regional biogenic sulfur emissions.

    NASA Astrophysics Data System (ADS)

    Pasteris, D. R.; McConnell, J. R.; Burkhart, J. F.; Saltzman, E. S.

    2014-12-01

    Sulfate aerosols have an important cooling effect on the Earth because they scatter sunlight back to space and form cloud condensation nuclei. However, understanding of the atmospheric sulfur cycle is incomplete, leading to uncertainty in the assessment of past, present and future climate forcing. Here we use annually resolved observations of sulfur and methanesulfonic acid (MSA) concentration in an array of precisely dated Southern Greenland ice cores to assess the history of sulfur pollution emitted from North America and Europe and the history of biogenic sulfate aerosol derived from the North Atlantic Ocean over the last 250 years. The ice core sulfur time series is found to closely track sulfur concentrations in North American and European precipitation since records began in 1965, and also closely tracks estimated sulfur emissions since 1850 within the air mass source region as determined by back trajectory analysis. However, a decline to near-preindustrial sulfur concentrations in the ice cores after 1995 that is not so extensive in the source region emissions indicates that there has been a change in sulfur cycling over the last 150 years. The ice core MSA time series shows a decline of 60% since the 1860s, and is well correlated with declining sea ice concentrations around Greenland, suggesting that the phytoplankton source of biogenic sulfur has declined due to a loss of marginal sea ice zone habitat. Incorporating the implied decrease in biogenic sulfur in our analysis improves the match between the ice core sulfur record and the source region emissions throughout the last 150 years, and solves the problem of the recent return to near-preindustrial levels in the Greenland ice. These findings indicate that the transport efficiency of sulfur air pollution has been relatively stable through the industrial era and that biogenic sulfur emissions in the region have declined.

  7. Cloning and sequencing of the Bet v 1-homologous allergen Fra a 1 in strawberry (Fragaria ananassa) shows the presence of an intron and little variability in amino acid sequence.

    PubMed

    Musidlowska-Persson, Anna; Alm, Rikard; Emanuelsson, Cecilia

    2007-02-01

    The Fra a 1 allergen in strawberry (Fragaria ananassa) is homologous to the major birch pollen allergen Bet v 1, which has numerous isoforms differing in terms of amino acid sequence and immunological impact. To map the extent of sequence differences in the Fra a 1 allergen, PCR cloning and sequencing was applied. Several genomic sequences of Fra a 1, with a length of either 584, 591 or 594 nucleotides, were obtained from three different strawberry varieties. All contained one intron, with the length of either 101 or 110 nucleotides. By sequencing 30 different clones, eight different DNA sequences were obtained, giving in total five potential Fra a 1 protein isoforms, with high sequence similarity (>97% sequence identity) and only seven positions of amino acid variability, which were largely confirmed by mass spectrometry of expressed proteins. We conclude that the sequence variability in the strawberry allergen Fra a 1 is small, within and between strawberry varieties, and that multiple spots, previously detected in 2DE, are presumably due to differences in post-translational modification rather than differences in amino acid sequence. The most abundant Fra a 1 isoform sequence, recombinantly expressed in Escherichia coli after removal of the intron, was recognized by IgE from strawberry allergic patients. It cross-reacted with antibodies to Bet v 1 and the homologous apple allergen Mal d 1 (61 and 78% sequence identity, respectively), and will be used in further analyses of variation in Fra a 1-expression.

  8. Analysis of the complete sequences of two biologically distinct Zucchini yellow mosaic virus isolates further evidences the involvement of a single amino acid in the virus pathogenicity.

    PubMed

    Nováková, S; Svoboda, J; Glasa, M

    2014-01-01

    The complete genome sequences of two Slovak Zucchini yellow mosaic virus isolates (ZYMV-H and ZYMV-SE04T) were determined. These isolates differ significantly in their pathogenicity, producing either severe or very mild symptoms on susceptible cucurbit hosts. The viral genome of both isolates consisted of 9593 nucleotides in size, and contained an open reading frame encoding a single polyprotein of 3080 amino acids. Despite their different biological properties, an extremely high nucleotide identity could be noted (99.8%), resulting in differences of only 5 aa, located in the HC-Pro, P3, and NIb, respectively. In silico analysis including 5 additional fully-sequenced and phylogenetically closely-related isolates known to induce different symptoms in cucurbits was performed. This suggested that the key single mutation responsible for virus pathogenicity is likely located in the N-terminal part of P3, adjacent to the PIPO.

  9. K-Pax2: Bayesian identification of cluster-defining amino acid positions in large sequence datasets

    PubMed Central

    Grad, Yonatan; Cobey, Sarah; Puranen, Juha Santeri; Corander, Jukka

    2015-01-01

    The recent growth in publicly available sequence data has introduced new opportunities for studying microbial evolution and spread. Because the pace of sequence accumulation tends to exceed the pace of experimental studies of protein function and the roles of individual amino acids, statistical tools to identify meaningful patterns in protein diversity are essential. Large sequence alignments from fast-evolving micro-organisms are particularly challenging to dissect using standard tools from phylogenetics and multivariate statistics because biologically relevant functional signals are easily masked by neutral variation and noise. To meet this need, a novel computational method is introduced that is easily executed in parallel using a cluster environment and can handle thousands of sequences with minimal subjective input from the user. The usefulness of this kind of machine learning is demonstrated by applying it to nearly 5000 haemagglutinin sequences of influenza A/H3N2.Antigenic and 3D structural mapping of the results show that the method can recover the major jumps in antigenic phenotype that occurred between 1968 and 2013 and identify specific amino acids associated with these changes. The method is expected to provide a useful tool to uncover patterns of protein evolution. PMID:28348810

  10. Isolation and a partial amino acid sequence of insulin from the islet tissue of cod (Gadus callarias)

    PubMed Central

    Grant, P. T.; Reid, K. B. M.

    1968-01-01

    1. Insulin has been isolated by gel filtration and ion-exchange chromatography from extracts of the discrete islet tissue of cod. The final preparation yielded a single band on electrophoresis at two pH values. The biological potency was 11·5 international units/mg. in mouse-convulsion and other assay procedures. 2. Glycine and methionine were shown to be the N-terminal amino acids of the A and B chains respectively. An estimate of the molecular weight together with amino acid analyses indicated that cod insulin, like the bovine hormone, consists of 51 amino acid residues. In contrast, the amino acid composition differs markedly from bovine insulin. 3. Oxidation of insulin with performic acid yielded the A and B peptide chains, which were separated by ion-exchange chromatography. Sequence studies on smaller peptides isolated from enzymic digests or from dilute acetic acid hydrolysates of the two chains have established the sequential order of 14 of the 21 amino acid residues of the A chain and 25 of the 30 amino acid residues of the B chain. PMID:4866431

  11. Oxygen affinity and amino acid sequence of myoglobins from endothermic and ectothermic fish.

    PubMed

    Marcinek, D J; Bonaventura, J; Wittenberg, J B; Block, B A

    2001-04-01

    Myoglobin (Mb) buffers intracellular O2 and facilitates diffusion of O2 through the cell. These functions of Mb will be most effective when intracellular PO2 is near the partial pressure of oxygen at which Mb is half saturated (P50) of the molecule. We test the hypothesis that Mb oxygen affinity has evolved such that it is conserved when adjusted for body temperature among closely related animals. We measure oxygen P50s tonometrically and oxygen dissociation rate constants with stopped flow and generate amino acid sequence from cDNA of Mbs from fish with different body temperatures. P50s for the endothermic bluefin tuna, skipjack tuna, and blue marlin at 20 degrees C were 0.62 +/- 0.02, 0.59 +/- 0.01, 0.58 +/- 0.04 mmHg, respectively, and were significantly lower than those for ectothermic bonito (1.03 +/- 0.07 mmHg) and mackerel (1.39 +/- 0.03 mmHg). Because the oxygen affinity of Mb decreases with increasing temperature, the above differences in oxygen affinity between endothermic and ectothermic fish are reduced when adjusted for the in vivo muscle temperature of the animal. Oxygen dissociation rate constants at 20 degrees C for the endothermic species ranged from 34.1 to 49.3 s(-1), whereas those for mackerel and bonito were 102 and 62 s(-1), respectively. Correlated with the low oxygen affinity and fast dissociation kinetics of mackerel Mb is a substitution of alanine for proline that would likely result in a more flexible mackerel protein.

  12. Complete genome sequence of the probiotic lactic acid bacterium Lactobacillus acidophilus NCFM

    PubMed Central

    Altermann, Eric; Russell, W. Michael; Azcarate-Peril, M. Andrea; Barrangou, Rodolphe; Buck, B. Logan; McAuliffe, Olivia; Souther, Nicole; Dobson, Alleson; Duong, Tri; Callanan, Michael; Lick, Sonja; Hamrick, Alice; Cano, Raul; Klaenhammer, Todd R.

    2005-01-01

    Lactobacillus acidophilus NCFM is a probiotic bacterium that has been produced commercially since 1972. The complete genome is 1,993,564 nt and devoid of plasmids. The average GC content is 34.71% with 1,864 predicted ORFs, of which 72.5% were functionally classified. Nine phage-related integrases were predicted, but no complete prophages were found. However, three unique regions designated as potential autonomous units (PAUs) were identified. These units resemble a unique structure and bear characteristics of both plasmids and phages. Analysis of the three PAUs revealed the presence of two R/M systems and a prophage maintenance system killer protein. A spacers interspersed direct repeat locus containing 32 nearly perfect 29-bp repeats was discovered and may provide a unique molecular signature for this organism. In silico analyses predicted 17 transposase genes and a chromosomal locus for lactacin B, a class II bacteriocin. Several mucus- and fibronectin-binding proteins, implicated in adhesion to human intestinal cells, were also identified. Gene clusters for transport of a diverse group of carbohydrates, including fructooligosaccharides and raffinose, were present and often accompanied by transcriptional regulators of the lacI family. For protein degradation and peptide utilization, the organism encoded 20 putative peptidases, homologs for PrtP and PrtM, and two complete oligopeptide transport systems. Nine two-component regulatory systems were predicted, some associated with determinants implicated in bacteriocin production and acid tolerance. Collectively, these features within the genome sequence of L. acidophilus are likely to contribute to the organisms' gastric survival and promote interactions with the intestinal mucosa and microbiota. PMID:15671160

  13. Nucleotide and deduced amino acid sequences of a subtilisin-like serine protease from a deep-sea bacterium, Alkalimonas collagenimarina AC40(T).

    PubMed

    Kurata, Atsushi; Uchimura, Kohsuke; Shimamura, Shigeru; Kobayashi, Tohru; Horikoshi, Koki

    2007-11-01

    The acpI gene encoding an alkaline protease (AcpI) from a deep-sea bacterium, Alkalimonas collagenimarina AC40(T), was shotgun-cloned and sequenced. It had a 1,617-bp open reading frame encoding a protein of 538 amino acids. Based on analysis of the deduced amino acid sequence, AcpI is a subtilisin-like serine protease belonging to subtilase family A. It consists of a prepropeptide, a catalytic domain, and a prepeptidase C-terminal domain like other serine proteases from the genera Pseudomonas, Shewanella, Alteromonas, and Xanthomonas. Heterologous expression of the acpI gene in Escherichia coli cells yielded a 28-kDa recombinant AcpI (rAcpI), suggesting that both the prepropeptide and prepeptidase C-terminal domains were cleaved off to give the mature form. Analysis of N-terminal and C-terminal amino acid sequences of purified rAcpI showed that the mature enzyme would be composed of 273 amino acids. The optimal pH and temperature for the caseinolytic activity of the purified rAcpI were 9.0-9.5 and 45 degrees C in 100 mM glycine-NaOH buffer. Calcium ions slightly enhanced the enzyme activity and stability. The enzyme favorably hydrolyzed gelatin, collagen, and casein. AcpI from A. collagenimarina AC40(T) was also purified from culture broth, and its molecular mass was around 28 kDa, indicating that the cleavage manner of the enzyme is similar to that in E. coli cells.

  14. Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

    PubMed Central

    Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

    1986-01-01

    A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461

  15. [Creation of DNA vaccine vector based on codon-optimized gene of rabies virus glycoprotein (G protein) with consensus amino acid sequence].

    PubMed

    Starodubova, E S; Kuzmenko, Y V; Latanova, A A; Preobrazhenskaya, O V; Karpov, V L

    2016-01-01

    An optimized design of the rabies virus glycoprotein (G protein) for use within DNA vaccines has been suggested. The design represents a territorially adapted antigen constructed taking into account glycoprotein amino acid sequences of the rabies viruses registered in the Russian Federation and the vaccine Vnukovo-32 strain. Based on the created consensus amino acid sequence, the nucleotide codon-optimized sequence of this modified glycoprotein was obtained and cloned into the pVAX1 plasmid (a vector of the last generation used in the creation of DNA vaccines). A twofold increase in this gene expression compared to the expression of the Vnukovo-32 strain viral glycoprotein gene in a similar vector was registered in the transfected cell culture. It has been demonstrated that the accumulation of modified G protein exceeds the number of the control protein synthesized using the plasmid with the Vnukovo-32 strain viral glycoprotein gene by 20 times. Thus, the obtained modified rabies virus glycoprotein can be considered to be a promising DNA vaccine antigen.

  16. Peptide Mass Fingerprinting and N-Terminal Amino Acid Sequencing of Glycosylated Cysteine Protease of Euphorbia nivulia Buch.-Ham.

    PubMed Central

    Badgujar, Shamkant B.; Mahajan, Raghunath T.

    2013-01-01

    A new cysteine protease named Nivulian-II has been purified from the latex of Euphorbia nivulia Buch.-Ham. The apparent molecular mass of Nivulian-II is 43670.846 Da (MALDI TOF/MS). Peptide mass fingerprint analysis revealed peptide matches to Maturase K (Q52ZV1_9MAGN) of Banksia quercifolia. The N-terminal sequence (DFPPNTCCCICC) showed partial homology with those of other cysteine proteinases of biological origin. This is the first paper to characterize a Nivulian-II of E. nivulia latex with respect to amino acid sequencing. PMID:23476742

  17. Complete amino acid sequence of Mytilus anterior byssus retractor paramyosin and its putative phosphorylation site.

    PubMed

    Watabe, S; Iwasaki, K; Funabara, D; Hirayama, Y; Nakaya, M; Kikuchi, K

    2000-01-01

    A cDNA encoding the full-length paramyosin molecule was cloned from the mussel Mytilus galloprovincialis, a species closely related to Mytilus edulis. It contained 3,497 nucleotides (nt), with 79 and 826 nt for the 5' and 3' non-coding regions, respectively. The coding region was composed of 2,592 nt for 864 amino acid residues, a size typical of paramyosin. While genomic DNA digests with either HindIII or PstI exhibited a single band when hybridized with a SacI fragment of paramyosin cDNA, the digests with either EcoRV or EcoRI showed two bands, suggesting that the mussel has at least two genes encoding paramyosin. The mRNAs encoding paramyosin were most abundant in muscle tissues from byssus retractor and adductor muscles. Only traces of paramyosin transcripts were found in the tissue of foot, gill, inner mantle, and outer mantle. The same phosphorylatable peptide previously reported for paramyosin from the bivalve Mercenaria mercenaria, Ser-Arg-Ser-Met-Ser(P)-Val-Ser-Arg (Watabe et al. 1989. Comp Biochem Physiol 94B:813-821) was found in the C-terminal non-helical part of this Mytilus paramyosin. We predict that this particular paramyosin has a coiled-coil structure composed of two alpha-helices that show the heptad repeats (a-b-c-d-e-f-g) with further 28-amino acid repeat zones, where a and d tend to be occupied by nonpolar residues.

  18. DNA Sequence and Expression Variation of Hop (Humulus lupulus) Valerophenone Synthase (VPS), a Key Gene in Bitter Acid Biosynthesis

    PubMed Central

    Castro, Consuelo B.; Whittock, Lucy D.; Whittock, Simon P.; Leggett, Grey; Koutoulis, Anthony

    2008-01-01

    Background The hop plant (Humulus lupulus) is a source of many secondary metabolites, with bitter acids essential in the beer brewing industry and others having potential applications for human health. This study investigated variation in DNA sequence and gene expression of valerophenone synthase (VPS), a key gene in the bitter acid biosynthesis pathway of hop. Methods Sequence variation was studied in 12 varieties, and expression was analysed in four of the 12 varieties in a series across the development of the hop cone. Results Nine single nucleotide polymorphisms (SNPs) were detected in VPS, seven of which were synonymous. The two non-synonymous polymorphisms did not appear to be related to typical bitter acid profiles of the varieties studied. However, real-time quantitative reverse-transcription polymerase chain reaction (qRT-PCR) analysis of VPS expression during hop cone development showed a clear link with the bitter acid content. The highest levels of VPS expression were observed in two triploid varieties, ‘Symphony’ and ‘Ember’, which typically have high bitter acid levels. Conclusions In all hop varieties studied, VPS expression was lowest in the leaves and an increase in expression was consistently observed during the early stages of cone development. PMID:18519445

  19. A knowledge engineering approach to recognizing and extracting sequences of nucleic acids from scientific literature.

    PubMed

    García-Remesal, Miguel; Maojo, Victor; Crespo, José

    2010-01-01

    In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences.

  20. Cloning, sequence, and developmental expression of a type 5, tartrate-resistant, acid phosphatase of rat bone.

    PubMed

    Ek-Rylander, B; Bill, P; Norgård, M; Nilsson, S; Andersson, G

    1991-12-25

    Tartrate-resistant acid phosphatase (TRAP) is a characteristic constituent of osteoclasts and some mononuclear preosteoclasts and, therefore, used as a histochemical and biochemical marker for osteoclasts and bone resorption. We now report the isolation of a 1397-base pair (bp) full-length TRAP/tartrate-resistant acid ATPase (TrATPase) cDNA clone from a neonatal rat calvaria lambda gt11 cDNA library. The cDNA clone consists of a 92-bp untranslated 5'-flank, an open reading frame of 981 bp and a 324-bp untranslated 3'-poly(A)-containing region. The deduced protein sequence of 327 amino acids contains a putative cleavable signal sequence of 21 amino acids. The mature polypeptide of 306 amino acids has a calculated Mr of 34,350 Da and a pI of 9.18, and it contains two potential N-glycosylation sites and the lysosomal targeting sequence DKRFQ. At the protein level, the sequence displays 89-94% homology to TRAP enzymes from human placenta, beef spleen, and uteroferrin and identity to the N terminus of purified rat bone TRAP/TrATPase. An N-terminal amino acid segment is strikingly homologous to the corresponding region in lysosomal and prostatic acid phosphatases. The cDNA recognized a 1.5-kilobase mRNA in long bones and calvaria, and in vitro translation using, as template, mRNA transcribed from the full-length insert yielded an immunoprecipitated product of 34 kDa. In neonatal rats, TRAP/TrATPase mRNA was highly expressed in skeletal tissues, with much lower (less than 10%) levels detected in spleen, thymus, liver, skin, brain, kidney, brain, lung, and heart. In situ hybridization demonstrated specific labeling of osteoclasts at endostal surfaces and bone trabeculae of long bones. Thus, despite the apparent similarity of this osteoclastic TRAP/TrATPase with type 5, tartrate-resistant and purple, acid phosphatases expressed in other mammalian tissues, this gene appears to be preferentially expressed at skeletal sites.

  1. Snake venoms. The amino-acid sequence of protein S5C4 from Dendroaspis jamesoni kaimosae (Jameson's mamba) venom.

    PubMed

    Joubert, F J; Strydom, A J; Taljaard, N

    1978-06-01

    A major component (S5C4) was purified from Jameson's mamba by gel filtration on Sephadex G-50 and by ion-exchange chromotography on CM-cellulose. Protein S5C4 contains 60 amino acid residues and is cross-linked by four intrachain disulphide bridges. The complete primary structure of the protein has been elucidated. The toxicities, the immunochemical properties, the sequence and the invariant amino acid residues of protein S5C4 resemble subgroup II of the angusticeps-type proteins.

  2. Plasma Acylcarnitine Profiles Suggest Incomplete Fatty Acid ß-Oxidation and Altered Tricarboxylic Cycle Activity in Type 2 Diabetic African-American Women

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Inefficient muscle long-chain fatty acid (LCFA) combustion is associated with insulin resistance, but molecular links between mitochondrial fat catabolism and insulin action remain controversial. We hypothesized that plasma acylcarnitine profiling would identify distinct metabolite patterns reflect...

  3. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  4. NMR study suggests a major role for Arg111 in maintaining the structure and dynamical properties of type II human cellular retinoic acid binding protein.

    PubMed

    Wang, L; Yan, H

    1998-09-15

    The solution structure of a site-directed mutant of type-II human cellular retinoic acid binding protein (CRABPII) with Arg111 replaced by methionine (R111M) has been determined by NMR spectroscopy. The sequential assignments of the 1H and 15N resonances of apo-R111M were established by multinuclear multidimensional NMR. The solution structure was calculated from 2302 distance restraints and 77 phi dihedral restraints derived from the NMR data. The root-mean-square deviation of the ensemble of 28 refined conformers that represent the structure from the mean coordinate set derived from them was 0.54 +/- 0.26 and 0.98 +/- 0.23 A for the backbone atoms and all heavy atoms, respectively. The solution structure of apo-R111M is similar to that of wild-type apo-CRABPII. However, there are significant conformational differences between the two proteins, localized mainly to three segments (Leu19-Ala36, Glu73-Cys81, and Leu99-Pro105) clustered around the ligand entrance more than 17 A away from the point mutation. In apo-R111M, all the three segments move toward the center of the ligand entrance so that the opening of the ligand-binding pocket in apo-R111M is much smaller than that in wild-type apo-CRABPII. Furthermore, the ligand-binding pocket of apo-R111M, especially the ligand entrance, is much less flexible than that of apo-CRABPII. Surprisingly, apo-R111M is more similar to holo-CRABPII than to apo-CRABPII in both structure and dynamical properties. The conformational and dynamical changes caused by the mutation are similar to those induced by binding of RA, although the magnitudes of the changes caused by the mutation are smaller than those induced by binding of RA. The results suggest that Arg111 plays a critical role in determining the structure and dynamical properties of CRABPII.

  5. Transcriptome sequencing revealed the transcriptional organization at ribosome-mediated attenuation sites in Corynebacterium glutamicum and identified a novel attenuator involved in aromatic amino acid biosynthesis.

    PubMed

    Neshat, Armin; Mentz, Almut; Rückert, Christian; Kalinowski, Jörn

    2014-11-20

    The Gram-positive bacterium Corynebacterium glutamicum belongs to the order Corynebacteriales and is used as a producer of amino acids at industrial scales. Due to its economic importance, gene expression and particularly the regulation of amino acid biosynthesis has been investigated extensively. Applying the high-resolution technique of transcriptome sequencing (RNA-seq), recently a vast amount of data has been generated that was used to comprehensively analyze the C. glutamicum transcriptome. By analyzing RNA-seq data from a small RNA cDNA library of C. glutamicum, short transcripts in the known transcriptional attenuators sites of the trp operon, the ilvBNC operon and the leuA gene were verified. Furthermore, whole transcriptome RNA-seq data were used to elucidate the transcriptional organization of these three amino acid biosynthesis operons. In addition, we discovered and analyzed the novel attenuator aroR, located upstream of the aroF gene (cg1129). The DAHP synthase encoded by aroF catalyzes the first step in aromatic amino acid synthesis. The AroR leader peptide contains the amino acid sequence motif F-Y-F, indicating a regulatory effect by phenylalanine and tyrosine. Analysis by real-time RT-PCR suggests that the attenuator regulates the transcription of aroF in dependence of the cellular amount of tRNA loaded with phenylalanine when comparing a phenylalanine-auxotrophic C. glutamicum mutant fed with limiting and excess amounts of a phenylalanine-containing dipeptide. Additionally, the very interesting finding was made that all analyzed attenuators are leaderless transcripts.

  6. Extended amino acid sequences around the active-site lysine residue of class-I fructose 1,6-bisphosphate aldolases from rabbit muscle, sturgeon muscle, trout muscle and ox liver.

    PubMed Central

    Benfield, P A; Forcina, B G; Gibbons, I; Perham, R N

    1979-01-01

    1. Amino acid sequences covering the region between residues 173 and 248 [adopting the numbering system proposed by Lai, Nakai & Chang (1974) Science 183, 1204-1206] were derived for trout (Salmo trutta) muscle aldolase and for ox liver aldolase. A comparable sequence was derived for residues 180-248 of sturgeon (Acipenser transmontanus) muscle aldolase. The close homology with the rabbit muscle enzyme was used to align the peptides of the other aldolases from which the sequences were derived. The results also allowed a partial sequence for the N-terminal 39 residues for the ox liver enzyme to be deduced. 2. In the light of the strong homology evinced for these enzymes, a re-investigation of the amino acid sequence of rabbit muscle aldolase between residues 181 and 185 was undertaken. This indicated the presence of a hitherto unsuspected -Ile-Val-sequence between residues 181 and 182 and the need to invert the sequence -Glu-Val- to -Val-Glx- at positions 184 and 185. 3. Comparison of the available amino acid sequences of these enzymes suggested an early evolutionary divergence of the genes for muscle and liver aldolases. It was also consistent with other evidence that the central region of the primary structure of these enzymes (which includes the active-site lysine-227) forms part of a conserved folding domain in the protein subunit. 4. Detailed evidence for the amino acid sequences proposed has been deposited as Suy Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1978) 169, 5. PMID:534504

  7. Complete amino acid sequence of an acidic, cardiotoxic phospholipase A2 from the venom of Ophiophagus hannah (King Cobra): a novel cobra venom enzyme with "pancreatic loop".

    PubMed

    Huang, M Z; Gopalakrishnakone, P; Chung, M C; Kini, R M

    1997-02-15

    A phospholipase A2 (OHV A-PLA2) from the venom of Ophiophagus hannah (King cobra) is an acidic protein exhibiting cardiotoxicity, myotoxicity, and antiplatelet activity. The complete amino acid sequence of OHV A-PLA2 has been determined using a combination of Edman degradation and mass spectrometric techniques. OHV A-PLA2 is composed of a single chain of 124 amino acid residues with 14 cysteines and a calculated molecular weight of 13719 Da. It contains the loop of residues (62-66) found in pancreatic PLA2s and hence belongs to class IB enzymes. This pancreatic loop is between two proline residues (Pro 59 and Pro 68) and contains several hydrophilic amino acids (Ser and Asp). This region has high degree of conformational flexibility and is on the surface of the molecule, and hence it may be a potential protein-protein interaction site. A relatively low sequence homology is found between OHV A-PLA2 and other known cardiotoxic PLA2s, and hence a contiguous segment could not be identified as a site responsible for the cardiotoxic activity.

  8. Snake venoms. The amino-acid sequence of trypsin inhibitor E of Dendroaspis polylepis polylepis (Black Mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1978-06-01

    Trypsin inhibitor E from black mamba venom comprises 59 amino acid residues in a single polypeptide chain, cross-linked by three intrachain disulphide bridges. The complete primary structure of inhibitor E was elucidated. The sequence is homologous with trypsin inhibitors from different sources. Unique among this homologous series of proteinase inhibitors, inhibitor E has an affinity for transition metal ions, exemplified here by Cu2 and Co2+.

  9. The amino acid sequence of the zinc-requiring beta-lactamase II from the bacterium Bacillus cereus 569.

    PubMed

    Ambler, R P; Daniel, M; Fleming, J; Hermoso, J M; Pang, C; Waley, S G

    1985-09-23

    The amino acid sequence of the zinc-requiring beta-lactamase II from Bacillus cereus strain 569 has been determined. It consists of a single polypeptide chain of 227 residues. It is the only example so far fully characterized of a class B beta-lactamase, and is structurally and mechanistically distinct from both the widely distributed class A beta-lactamases (such as the Escherichia coli RTEM enzyme) and from the chromosomally encoded class C enzymes from Gram-negative bacteria.

  10. A simple ligation-based method to increase the information density in sequencing reactions used to deconvolute nucleic acid selections

    PubMed Central

    Childs-Disney, Jessica L.; Disney, Matthew D.

    2008-01-01

    Herein, a method is described to increase the information density of sequencing experiments used to deconvolute nucleic acid selections. The method is facile and should be applicable to any selection experiment. A critical feature of this method is the use of biotinylated primers to amplify and encode a BamHI restriction site on both ends of a PCR product. After amplification, the PCR reaction is captured onto streptavidin resin, washed, and digested directly on the resin. Resin-based digestion affords clean product that is devoid of partially digested products and unincorporated PCR primers. The product's complementary ends are annealed and ligated together with T4 DNA ligase. Analysis of ligation products shows formation of concatemers of different length and little detectable monomer. Sequencing results produced data that routinely contained three to four copies of the library. This method allows for more efficient formulation of structure-activity relationships since multiple active sequences are identified from a single clone. PMID:18065718

  11. The isolation, purification and amino-acid sequence of insulin from the teleost fish Cottus scorpius (daddy sculpin).

    PubMed

    Cutfield, J F; Cutfield, S M; Carne, A; Emdin, S O; Falkmer, S

    1986-07-01

    Insulin from the principal islets of the teleost fish, Cottus scorpius (daddy sculpin), has been isolated and sequenced. Purification involved acid/alcohol extraction, gel filtration, and reverse-phase high-performance liquid chromatography to yield nearly 1 mg pure insulin/g wet weight islet tissue. Biological potency was estimated as 40% compared to porcine insulin. The sculpin insulin crystallised in the absence of zinc ions although zinc is known to be present in the islets in significant amounts. Two other hormones, glucagon and pancreatic polypeptide, were copurified with the insulin, and an N-terminal sequence for pancreatic polypeptide was determined. The primary structure of sculpin insulin shows a number of sequence changes unique so far amongst teleost fish. These changes occur at A14 (Arg), A15 (Val), and B2 (Asp). The B chain contains 29 amino acids and there is no N-terminal extension as seen with several other fish. Presumably as a result of the amino acid substitutions, sculpin insulin does not readily form crystals containing zinc-insulin hexamers, despite the presence of the coordinating B10 His.

  12. Sequence of the canine herpesvirus thymidine kinase gene: taxon-preferred amino acid residues in the alphaherpesviral thymidine kinases.

    PubMed

    Rémond, M; Sheldrick, P; Lebreton, F; Foulon, T

    1995-12-01

    Multiple sequence alignments of evolutionarily related proteins are finding increasing use as indicators of critical amino acid residues necessary for structural stability or involved in functional domains responsible for catalytic activities. In the past, a number of alignments have provided such information for the herpesviral thymidine kinases, for which three-dimensional structures are not yet available. We have sequenced the thymidine kinase gene of a canine herpesvirus, and with a multiple alignment have identified amino acids preferentially conserved in either of two taxons, the genera Varicellovirus and Simplexvirus, of the subfamily Alphaherpesvirinae. Since some regions of the thymidine kinases show otherwise elevated levels of substitutional tolerance, these conserved amino acids are candidates for critical residues which have become fixed through selection during the evolutionary divergence of these enzymes. Several pairs with distinctive patterns of distribution among the various viruses occur in or near highly conserved sequence motifs previously proposed to form the catalytic site, and we speculate that they may represent interacting, co-ordinately variable residues.

  13. The amino acid sequence of a carbohydrate-containing immunoglobulin-light-chain-type amyloid-fibril protein.

    PubMed Central

    Tveteraas, T; Sletten, K; Westermark, P

    1985-01-01

    The amino acid sequence of an amyloid-fibril protein Es492 of immunoglobulin-lambda-light-chain origin (AL) was elucidated. The amyloid fibrils were obtained from the spleen of a patient who died from systemic amyloidosis. The amino acid sequence was elucidated from structural studies of peptides derived from digestion of the protein with trypsin, thermolysin, chymotrypsin and Staphylococcus aureus V8 proteinase and from cleavage of the protein with CNBr and BNPS-skatole. A heterogeneity in the length of the polypeptide was seen in the C-terminal region. The protein was by sequence homology to other lambda-chains shown to be of the V lambda II subgroup. Although an extensive homology was seen, some amino acid residues in positions 26, 31, 32, 40, 44, 93, 97, 98 and 99 have not previously been reported in these positions of V lambda II proteins. The significance of these residues in the fibril formation is unclear. The protein was found to contain carbohydrate, with glycosylation sites in two of the hypervariable regions. PMID:3936482

  14. Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis.

    PubMed Central

    Gorbalenya, A E; Koonin, E V; Donchenko, A P; Blinov, V M

    1989-01-01

    Amino acid sequences of 2 giant non-structural polyproteins (F1 and F2) of infectious bronchitis virus (IBV), a member of Coronaviridae, were compared, by computer-assisted methods, to sequences of a number of other positive strand RNA viral and cellular proteins. By this approach, juxtaposed putative RNA-dependent RNA polymerase, nucleic acid binding ("finger"-like) and RNA helicase domains were identified in F2. Together, these domains might constitute the core of the protein complex involved in the primer-dependent transcription, replication and recombination of coronaviruses. In F1, two cysteine protease-like domains and a growth factor-like one were revealed. One of the putative proteases of IBV is similar to 3C proteases of picornaviruses and related enzymes of como- nepo- and potyviruses. Search of IBV F1 and F2 sequences for sites similar to those cleaved by the latter proteases and intercomparison of the surrounding sequence stretches revealed 13 dipeptides Q/S(G) which are probably cleaved by the coronavirus 3C-like protease. Based on these observations, a partial tentative scheme for the functional organization and expression strategy of the non-structural polyproteins of IBV was proposed. It implies that, despite the general similarity to other positive strand RNA viruses, and particularly to potyviruses, coronaviruses possess a number of unique structural and functional features. PMID:2526320

  15. Recognition of 5'-YpG-3' sequences by coupled stacking/hydrogen bonding interactions with amino acid residues.

    PubMed

    Lamoureux, Jason S; Maynes, Jason T; Glover, J N Mark

    2004-01-09

    The combined biochemical and structural study of hundreds of protein-DNA complexes has indicated that sequence-specific interactions are mediated by two mechanisms termed direct and indirect readout. Direct readout involves direct interactions between the protein and base-specific atoms exposed in the major and minor grooves of DNA. For indirect readout, the protein recognizes DNA by sensing conformational variations in the structure dependent on nucleotide sequence, typically through interactions with the phosphodiester backbone. Based on our recent structure of Ndt80 bound to DNA in conjunction with a search of the existing PDB database, we propose a new method of sequence-specific recognition that utilizes both direct and indirect readout. In this mode, a single amino acid side-chain recognizes two consecutive base-pairs. The 3'-base is recognized by canonical direct readout, while the 5'-base is recognized through a variation of indirect readout, whereby the conformational flexibility of the particular dinucleotide step, namely a 5'-pyrimidine-purine-3' step, facilitates its recognition by the amino acid via cation-pi interactions. In most cases, this mode of DNA recognition helps explain the sequence specificity of the protein for its target DNA.

  16. Introduction of Ca(2+)-binding amino-acid sequence into the T4 lysozyme.

    PubMed

    Leontiev, V V; Uversky, V N; Permyakov, E A; Murzin, A G

    1993-03-05

    The 51-62 loop of T4 phage lysozyme was altered by site-directed mutagenesis to obtain maximal homology with the typical EF-hand motif. A Ca(2+)-binding site was designed and created by replacing both Gly-51 and Asn-53 with aspartic acid. The mutant T4 lysozyme (G51D/N53D) was expressed in Escherichia coli. The activity of the G51D/N53D-mutant was about 60% of that of the wild-type protein. This mutant can bind Ca2+ ions specifically, while the effective dissociation constant was essentially greater than that of the EF-hand proteins. Stability of the G51D/N53D-mutant apo-form to urea- or temperature-induced denaturation was the same as that of the wild-type protein. In the presence of Ca2+ ions in solution the stability of the mutant T4 phage lysozyme was less than that of the wild-type protein. It is suggested that the binding of Ca2+ by the mutant is accompanied by the considerable conformational changes in the 'corrected' loop, which can lead to the Ca(2+)-induced destabilization of the protein.

  17. Bioinformatics analysis of the oxidosqualene cyclase gene and the amino acid sequence in mangrove plants

    NASA Astrophysics Data System (ADS)

    Basyuni, M.; Wati, R.

    2017-01-01

    This study described the bioinformatics methods to analyze seven oxidosqualene cyclase (OSC) genes from mangrove plants on DDBJ/EMBL/GenBank as well as predicted the structure, composition, similarity, subcellular localization and phylogenetic. The physical and chemical properties of seven mangrove OSC showed variation among the genes. The percentage of the secondary structure of seven mangrove OSC genes followed the order of a helix > random coil > extended chain structure. The values of chloroplast or signal peptide were too low, indicated that no chloroplast transit peptide or signal peptide of secretion pathway in mangrove OSC genes. The target peptide value of mitochondria varied from 0.163 to 0.430, indicated it was possible to exist. These results suggested the importance of understanding the diversity and functional of properties of the different amino acids in mangrove OSC genes. To clarify the relationship among the mangrove OSC gene, a phylogenetic tree was constructed. The phylogenetic tree shows that there are three clusters, Kandelia KcMS join with Bruguiera BgLUS, Rhizophora RsM1 was close to Bruguiera BgbAS, and Rhizophora RcCAS join with Kandelia KcCAS. The present study, therefore, supported the previous results that plant OSC genes form distinct clusters in the tree.

  18. Meta-Analysis of Global Transcriptomics Suggests that Conserved Genetic Pathways are Responsible for Quercetin and Tannic Acid Mediated Longevity in C. elegans.

    PubMed

    Pietsch, Kerstin; Saul, Nadine; Swain, Suresh C; Menzel, Ralph; Steinberg, Christian E W; Stürzenbaum, Stephen R

    2012-01-01

    Recent research has highlighted that the polyphenols Quercetin and Tannic acid are capable of extending the lifespan of Caenorhabditis elegans. To gain a deep understanding of the underlying molecular genetics, we analyzed the global transcriptional patterns of nematodes exposed to three concentrations of Quercetin or Tannic acid, respectively. By means of an intricate meta-analysis it was possible to compare the transcriptomes of polyphenol exposure to recently published datasets derived from (i) longevity mutants or (ii) infection. This detailed comparative in silico analysis facilitated the identification of compound specific and overlapping transcriptional profiles and allowed the prediction of putative mechanistic models of Quercetin and Tannic acid mediated longevity. Lifespan extension due to Quercetin was predominantly driven by the metabolome, TGF-beta signaling, Insulin-like signaling, and the p38 MAPK pathway and Tannic acid's impact involved, in part, the amino acid metabolism and was modulated by the TGF-beta and the p38 MAPK pathways. DAF-12, which integrates TGF-beta and Insulin-like downstream signaling, and genetic players of the p38 MAPK pathway therefore seem to be crucial regulators for both polyphenols. Taken together, this study underlines how meta-analyses can provide an insight of molecular events that go beyond the traditional categorization into gene ontology-terms and Kyoto encyclopedia of genes and genomes-pathways. It also supports the call to expand the generation of comparative and integrative databases, an effort that is currently still in its infancy.

  19. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance

    PubMed Central

    Baranzoni, Gian Marco; Reichenberger, Erin R.; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  20. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance.

    PubMed

    Baranzoni, Gian Marco; Fratamico, Pina M; Reichenberger, Erin R; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-07-28

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here.

  1. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  2. ALDH1A1 Deficiency in Gorlin Syndrome Suggests a Central Role for Retinoic Acid and ATM Deficits in Radiation Carcinogenesis.

    PubMed

    Weber, Thomas J; Magnaldo, Thierry; Xiong, Yijia

    2014-09-11

    We hypothesize that aldehyde dehydrogenase 1A1 (ALDH1A1) deficiency will result in impaired ataxia-telangiectasia mutated (ATM) activation in a retinoic acid-sensitive fashion. Data supporting this hypothesis include (1) reduced ATM activation in irradiated primary dermal fibroblasts from ALDH1A1-deficient Gorlin syndrome patients (GDFs), relative to ALDH1A1-positive normal human dermal fibroblasts (NHDFs) and (2) increased ATM activation by X-radiation in GDFs pretreated with retinoic acid, however, the impact of donor variability on ATM activation in fibroblasts was not assessed and is a prudent consideration in future studies. Clonogenic survival of irradiated cells showed differential responses to retinoic acid as a function of treatment time. Long-term (5 Day) retinoic acid treatment functioned as a radiosensitizer and was associated with downregulation of ATM protein levels. Short-term (7 h) retinoic acid treatment showed a trend toward increased survival of irradiated cells and did not downregulate ATM protein levels. Using a newly developed IncubATR technology, which defines changes in bulk chemical bond patterns in live cells, we can discriminate between the NHDF and GDF phenotypes, but treatment of GDFs with retinoic acid does not induce reversion of bulk chemical bond patterns associated with GDFs toward the NHDF phenotype. Collectively, our preliminary investigation of the Gorlin phenotype has identified deficient ALDH1A1 expression associated with deficient ATM activation as a possible susceptibility factor that is consistent with the high incidence of spontaneous and radiation-induced carcinogenesis in these patients. The IncubATR technology exhibits sufficient sensitivity to detect phenotypic differences in live cells that may be relevant to radiation health effects.

  3. ALDH1A1 Deficiency in Gorlin Syndrome Suggests a Central Role for Retinoic Acid and ATM Deficits in Radiation Carcinogenesis

    PubMed Central

    Weber, Thomas J.; Magnaldo, Thierry; Xiong, Yijia

    2014-01-01

    We hypothesize that aldehyde dehydrogenase 1A1 (ALDH1A1) deficiency will result in impaired ataxia-telangiectasia mutated (ATM) activation in a retinoic acid-sensitive fashion. Data supporting this hypothesis include (1) reduced ATM activation in irradiated primary dermal fibroblasts from ALDH1A1-deficient Gorlin syndrome patients (GDFs), relative to ALDH1A1-positive normal human dermal fibroblasts (NHDFs) and (2) increased ATM activation by X-radiation in GDFs pretreated with retinoic acid, however, the impact of donor variability on ATM activation in fibroblasts was not assessed and is a prudent consideration in future studies. Clonogenic survival of irradiated cells showed differential responses to retinoic acid as a function of treatment time. Long-term (5 Day) retinoic acid treatment functioned as a radiosensitizer and was associated with downregulation of ATM protein levels. Short-term (7 h) retinoic acid treatment showed a trend toward increased survival of irradiated cells and did not downregulate ATM protein levels. Using a newly developed IncubATR technology, which defines changes in bulk chemical bond patterns in live cells, we can discriminate between the NHDF and GDF phenotypes, but treatment of GDFs with retinoic acid does not induce reversion of bulk chemical bond patterns associated with GDFs toward the NHDF phenotype. Collectively, our preliminary investigation of the Gorlin phenotype has identified deficient ALDH1A1 expression associated with deficient ATM activation as a possible susceptibility factor that is consistent with the high incidence of spontaneous and radiation-induced carcinogenesis in these patients. The IncubATR technology exhibits sufficient sensitivity to detect phenotypic differences in live cells that may be relevant to radiation health effects. PMID:28250390

  4. Sequence heterogeneity of cannabidiolic- and tetrahydrocannabinolic acid-synthase in Cannabis sativa L. and its relationship with chemical phenotype.

    PubMed

    Onofri, Chiara; de Meijer, Etienne P M; Mandolino, Giuseppe

    2015-08-01

    Sequence variants of THCA- and CBDA-synthases were isolated from different Cannabis sativa L. strains expressing various wild-type and mutant chemical phenotypes (chemotypes). Expressed and complete sequences were obtained from mature inflorescences. Each strain was shown to have a different specificity and/or ability to convert the precursor CBGA into CBDA and/or THCA type products. The comparison of the expressed sequences led to the identification of different mutations, all of them due to SNPs. These SNPs were found to relate to the cannabinoid composition of the inflorescence at maturity and are therefore proposed to have a functional significance. The amount of variation was found to be higher within the CBDAS sequence family than in the THCAS family, suggesting a more recent evolution of THCA-forming enzymes from the CBDAS group. We therefore consider CBDAS as the ancestral type of these synthases.

  5. Isolation, characterization, and amino acid sequences of auracyanins, blue copper proteins from the green photosynthetic bacterium Chloroflexus aurantiacus

    NASA Technical Reports Server (NTRS)

    McManus, J. D.; Brune, D. C.; Han, J.; Sanders-Loehr, J.; Meyer, T. E.; Cusanovich, M. A.; Tollin, G.; Blankenship, R. E.

    1992-01-01

    Three small blue copper proteins designated auracyanin A, auracyanin B-1, and auracyanin B-2 have been isolated from the thermophilic green gliding photosynthetic bacterium Chloroflexus aurantiacus. All three auracyanins are peripheral membrane proteins. Auracyanin A was described previously (Trost, J. T., McManus, J. D., Freeman, J. C., Ramakrishna, B. L., and Blankenship, R. E. (1988) Biochemistry 27, 7858-7863) and is not glycosylated. The two B forms are glycoproteins and have almost identical properties to each other, but are distinct from the A form. The sodium dodecyl sulfate-polyacrylamide gel electrophoresis apparent monomer molecular masses are 14 (A), 18 (B-2), and 22 (B-1) kDa. The amino acid sequences of the B forms are presented. All three proteins have similar absorbance, circular dichroism, and resonance Raman spectra, but the electron spin resonance signals are quite different. Laser flash photolysis kinetic analysis of the reactions of the three forms of auracyanin with lumiflavin and flavin mononucleotide semiquinones indicates that the site of electron transfer is negatively charged and has an accessibility similar to that found in other blue copper proteins. Copper analysis indicates that all three proteins contain 1 mol of copper per mol of protein. All three auracyanins exhibit a midpoint redox potential of +240 mV. Light-induced absorbance changes and electron spin resonance signals suggest that auracyanin A may play a role in photosynthetic electron transfer. Kinetic data indicate that all three proteins can donate electrons to cytochrome c-554, the electron donor to the photosynthetic reaction center.

  6. The phosphate clamp: sequence selective nucleic acid binding profiles and conformational induction of endonuclease inhibition by cationic Triplatin complexes

    PubMed Central

    Prisecaru, Andreea; Molphy, Zara; Kipping, Ralph G.; Peterson, Erica J.; Qu, Yun; Kellett, Andrew; Farrell, Nicholas P.

    2014-01-01

    The substitution-inert polynuclear platinum(II) complex (PPC) series, [{trans-Pt(NH3)2(NH2(CH2)nNH3)}2-μ-(trans-Pt(NH3)2(NH2(CH2)nNH2)2}](NO3)8, where n = 5 (AH78P), 6 (AH78 TriplatinNC) and 7 (AH78H), are potent non-covalent DNA binding agents where nucleic acid recognition is achieved through use of the ‘phosphate clamp' where the square-planar tetra-am(m)ine Pt(II) coordination units all form bidentate N–O–N complexes through hydrogen bonding with phosphate oxygens. The modular nature of PPC–DNA interactions results in high affinity for calf thymus DNA (Kapp ∼5 × 107 M−1). The phosphate clamp–DNA interactions result in condensation of superhelical and B-DNA, displacement of intercalated ethidium bromide and facilitate cooperative binding of Hoechst 33258 at the minor groove. The effect of linker chain length on DNA conformational changes was examined and the pentane-bridged complex, AH78P, was optimal for condensing DNA with results in the nanomolar region. Analysis of binding affinity and conformational changes for sequence-specific oligonucleotides by ITC, dialysis, ICP-MS, CD and 2D-1H NMR experiments indicate that two limiting modes of phosphate clamp binding can be distinguished through their conformational changes and strongly suggest that DNA condensation is driven by minor-groove spanning. Triplatin-DNA binding prevents endonuclease activity by type II restriction enzymes BamHI, EcoRI and SalI, and inhibition was confirmed through the development of an on-chip microfluidic protocol. PMID:25414347

  7. Amino acid sequence and some properties of phytolacain G, a cysteine protease from growing fruit of pokeweed, Phytolacca americana.

    PubMed

    Uchikoba, T; Arima, K; Yonezawa, H; Shimada, M; Kaneda, M

    2000-10-18

    A protease, phytolacain G, has been found to appear on CM-Sepharose ion-exchange chromatography of greenish small-size fruits of pokeweed, Phytolacca americana L, from ca. 2 weeks after flowering, and increases during fruit enlargement. Reddish ripe fruit of the pokeweed contained both phytolacain G and R. The molecular mass of phytolacain G was estimated to be 25.5 kDa by SDS-PAGE. Its amino acid sequence was reconstructed by automated sequence analysis of the peptides obtained after cleavage with Achromobacter protease I, chymotrypsin, and cyanogen bromide. The enzyme is composed of 216 amino acid residues, of which it shares 152 identical amino acid residues (70%) with phytolacain R, 126 (58%) with melain G, 108 (50%) with papain, 106 (49%) with actinidain, and 96 (44%) with stem bromelain. The amino acid residues forming the substrate binding S(2) pocket of papain, Tyr67, Pro68, Trp69, Val133, and Phe207, were predicted to be replaced by Trp, Met, His, Ala, and Ser in phytolacain G, respectively. As a consequence of these substitutions, the S(2) pocket is expected to be less hydrophobic in phytolacain G than in papain.

  8. Meta-Analysis of Global Transcriptomics Suggests that Conserved Genetic Pathways are Responsible for Quercetin and Tannic Acid Mediated Longevity in C. elegans

    PubMed Central

    Pietsch, Kerstin; Saul, Nadine; Swain, Suresh C.; Menzel, Ralph; Steinberg, Christian E. W.; Stürzenbaum, Stephen R.

    2012-01-01

    Recent research has highlighted that the polyphenols Quercetin and Tannic acid are capable of extending the lifespan of Caenorhabditis elegans. To gain a deep understanding of the underlying molecular genetics, we analyzed the global transcriptional patterns of nematodes exposed to three concentrations of Quercetin or Tannic acid, respectively. By means of an intricate meta-analysis it was possible to compare the transcriptomes of polyphenol exposure to recently published datasets derived from (i) longevity mutants or (ii) infection. This detailed comparative in silico analysis facilitated the identification of compound specific and overlapping transcriptional profiles and allowed the prediction of putative mechanistic models of Quercetin and Tannic acid mediated longevity. Lifespan extension due to Quercetin was predominantly driven by the metabolome, TGF-beta signaling, Insulin-like signaling, and the p38 MAPK pathway and Tannic acid’s impact involved, in part, the amino acid metabolism and was modulated by the TGF-beta and the p38 MAPK pathways. DAF-12, which integrates TGF-beta and Insulin-like downstream signaling, and genetic players of the p38 MAPK pathway therefore seem to be crucial regulators for both polyphenols. Taken together, this study underlines how meta-analyses can provide an insight of molecular events that go beyond the traditional categorization into gene ontology-terms and Kyoto encyclopedia of genes and genomes-pathways. It also supports the call to expand the generation of comparative and integrative databases, an effort that is currently still in its infancy. PMID:22493606

  9. A case study on the genetic origin of the high oleic acid trait through FAD2-1 DNA sequence variation in safflower (Carthamus tinctorius L.).

    PubMed

    Rapson, Sara; Wu, Man; Okada, Shoko; Das, Alpana; Shrestha, Pushkar; Zhou, Xue-Rong; Wood, Craig; Green, Allan; Singh, Surinder; Liu, Qing

    2015-01-01

    The safflower (Carthamus tinctorius L.) is considered a strongly domesticated species with a long history of cultivation. The hybridization of safflower with its wild relatives has played an important role in the evolution of cultivars and is of particular interest with regards to their production of high quality edible oils. Original safflower varieties were all rich in linoleic acid, while varieties rich in oleic acid have risen to prominence in recent decades. The high oleic acid trait is controlled by a partially recessive allele ol at a single locus OL. The ol allele was found to be a defective microsomal oleate desaturase FAD2-1. Here we present DNA sequence data and Southern blot analysis suggesting that there has been an ancient hybridization and introgression of the FAD2-1 gene into C. tinctorius from its wild relative C. palaestinus. It is from this gene that FAD2-1Δ was derived more recently. Identification and characterization of the genetic origin and diversity of FAD2-1 could aid safflower breeders in reducing population size and generations required for the development of new high oleic acid varieties by using perfect molecular marker-assisted selection.

  10. Modulation of anti-endotoxin property of Temporin L by minor amino acid substitution in identified phenylalanine zipper sequence.

    PubMed

    Srivastava, Saurabh; Kumar, Amit; Tripathi, Amit Kumar; Tandon, Anshika; Ghosh, Jimut Kanti

    2016-11-01

    A 13-residue frog antimicrobial peptide Temporin L (TempL) possesses versatile antimicrobial activities and is considered a lead molecule for the development of new antimicrobial agents. To find out the amino acid sequences that influence the anti-microbial property of TempL, a phenylalanine zipper-like sequence was identified in it which was not reported earlier. Several alanine-substituted analogs and a scrambled peptide having the same composition of TempL were designed for evaluating the role of this motif. To investigate whether leucine residues instead of phenylalanine residues at 'a' and/or 'd' position(s) of the heptad repeat sequence could alter its antimicrobial property, several TempL analogs were synthesized after replacing these phenylalanine residues with leucine residues. Replacing phenylalanine residues with alanine residues in the phenylalanine zipper sequence significantly compromised the anti-endotoxin property of TempL. This is evident from the higher production of tumor necrosis factor-α and interleukin-6 in lipopolysaccharide (LPS)-stimulated rat bone-marrow-derived macrophage cells in the presence of its alanine-substituted analogs than TempL itself. However, replacement of these phenylalanine residues with leucine residues significantly augmented anti-endotoxin property of TempL. A single alanine-substituted TempL analog (F8A-TempL) showed significantly reduced cytotoxicity but retained the antibacterial activity of TempL, while the two single leucine-substituted analogs (F5L-TempL and F8L-TempL), although exhibiting lower cytotoxicity, were able to retain the antibacterial activity of the parent peptide. The results demonstrate how minor amino acid substitutions in the identified phenylalanine zipper sequence in TempL could yield analogs with better antibacterial and/or anti-endotoxin properties with their plausible mechanism of action.

  11. A suggested model for potato MIVOISAP involving functions of central carbohydrate and amino acid metabolism, as well as actin cytoskeleton and endocytosis.

    PubMed

    Ezquer, Ignacio; Li, Jun; Ovecka, Miroslav; Baroja-Fernández, Edurne; Muñoz, Francisco José; Montero, Manuel; Díaz de Cerio, Jessica; Hidalgo, Maite; Sesma, María Teresa; Bahaji, Abdellatif; Etxeberria, Ed; Pozueta-Romero, Javier

    2010-12-01

    We have recently found that microbial species ranging from Gram-negative and Gram-positive bacteria to different fungi emit volatiles that strongly promote starch accumulation in leaves of both mono- and di-cotyledonous plants. Transcriptome and enzyme activity analyses of potato leaves exposed to volatiles emitted by Alternaria alternata revealed that starch over-accumulation was accompanied by enhanced 3-phosphoglycerate to Pi ratio, and changes in functions involved in both central carbohydrate and amino acid metabolism. Exposure to microbial volatiles also promoted changes in the expression of genes that code for enzymes involved in endocytic uptake and traffic of solutes. With the overall data we propose a metabolic model wherein important determinants of accumulation of exceptionally high levels of starch include (a) upregulation of ADPglucose-producing SuSy, starch synthase III and IV, proteins involved in the endocytic uptake and traffic of sucrose, (b) down-regulation of acid invertase, starch breakdown enzymes and proteins involved in internal amino acid provision, and (c) 3-phosphoglycerate-mediated allosteric activation of ADPglucose pyrophosphorylase.

  12. Identification of microRNAs Actively Involved in Fatty Acid Biosynthesis in Developing Brassica napus Seeds Using High-Throughput Sequencing

    PubMed Central

    Wang, Jia; Jian, Hongju; Wang, Tengyue; Wei, Lijuan; Li, Jiana; Li, Chao; Liu, Liezhao

    2016-01-01

    Seed development has a critical role during the spermatophyte life cycle. In Brassica napus, a major oil crop, fatty acids are synthesized and stored in specific tissues during embryogenesis, and understanding the molecular mechanism underlying fatty acid biosynthesis during seed development is an important research goal. In this study, we constructed three small RNA libraries from early seeds at 14, 21, and 28 days after flowering (DAF) and used high-throughput sequencing to examine microRNA (miRNA) expression. A total of 85 known miRNAs from 30 families and 1160 novel miRNAs were identified, of which 24, including 5 known and 19 novel miRNAs, were found to be involved in fatty acid biosynthesis.bna-miR156b, bna-miR156c, bna-miR156g, novel_mir_1706, novel_mir_1407, novel_mir_173, and novel_mir_104 were significantly down-regulated at 21 DAF and 28 DAF, whereas bna-miR159, novel_mir_1081, novel_mir_19 and novel_mir_555 were significantly up-regulated. In addition, we found that some miRNAs regulate functional genes that are directly involved in fatty acid biosynthesis and that other miRNAs regulate the process of fatty acid biosynthesis by acting on a large number of transcription factors. The miRNAs and their corresponding predicted targets were partially validated by quantitative RT-PCR. Our data suggest that diverse and complex miRNAs are involved in the seed development process and that miRNAs play important roles in fatty acid biosynthesis during seed development. PMID:27822220

  13. Deep sequencing of the TCR-β repertoire of human forkhead box protein 3 (FoxP3)(+) and FoxP3(-) T cells suggests that they are completely distinct and non-overlapping.

    PubMed

    Golding, A; Darko, S; Wylie, W H; Douek, D C; Shevach, E M

    2017-04-01

    Maintenance of peripheral tolerance requires a balance between autoreactive conventional T cells (Tconv ) and thymically derived forkhead box protein 3 (FoxP3)(+) regulatory T cells (tTregs ). Considerable controversy exists regarding the similarities/differences in T cell receptor (TCR) repertoires expressed by Tconv and tTregs . We generated highly purified populations of human adult and cord blood Tconv and tTregs based on the differential expression of CD25 and CD127. The purity of the sorted populations was validated by intracellular staining for FoxP3 and Helios. We also purified an overlap group of CD4 T cells from adult donors to ensure that considerable numbers of shared clonotypes could be detected when present. We used deep sequencing of entire TCR-β CDR3 sequences to analyse the TCR repertoire of Tconv and tTregs . Our studies suggest that both neonatal and adult human Tconv and tTreg cells are, in fact, entirely distinct CD4 T cell lineages.

  14. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms

    PubMed Central

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/ PMID:26424080

  15. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    PubMed

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/.

  16. The human erythrocyte anion-transport protein. Partial amino acid sequence, conformation and a possible molecular mechanism for anion exchange.

    PubMed Central

    Brock, C J; Tanner, M J; Kempf, C

    1983-01-01

    The N-terminal 72 residues of an integral membrane fragment, P5, of the human erythrocyte anion-transport protein, which is known to be directly involved in the anion-exchange process, was shown to have the following amino acid sequence: Met-Val-Pro-Lys-Pro-Gln-Gly-Pro-Leu-Pro-Asn-Thr-Ala-Leu-Leu-Ser-Leu-Val-Leu-Met -Ala-Gly-Thr-Phe-Phe-Phe-Ala-Met-Met-Leu-Arg-Lys-Phe-Lys-Asn-Ser-Ser-Tyr-Phe-Pro-Gly-Lys-Leu-Arg-Arg-Val-Ile-Gly-Asp-Phe-Gly-Val-Pro-Ile-Ser-Ile-Leu-Ile-Met-Val-Leu-Val-Asp-Phe-Phe-Ile-Gln-Asp-Thr-Tyr-Thr-Gln- The structure of this fragment was analysed, with account being taken of the constraints that apply to the folding of integral membrane proteins and the topographical locations of various sites in the sequence. It was concluded that this sequence forms two transmembrane alpha-helices. These are probably part of a cluster of amphipathic transmembrane alpha-helices, which could comprise that part of the protein responsible for transport activity. The presently available evidence relating to the anion-exchange process was considered with the structural features noted in this study and a possible molecular mechanism is proposed. In this model the rearrangement of a network of intramembranous charged pairs mediates the translocation of an anion between anion-binding regions at each surface of the membrane, which are composed of clusters of positively charged amino acids. This model imposes a sequential exchange mechanism on the system. Supplementary material, including Tables and Figures describing the compositions of peptides determined by amino acid analysis and sequence studies, quantitative and qualitative data that provide a residue-by-residue justification for the sequence assignment and a description of modifications to and use of the solid-phase sequencer has been deposited as Supplementary Publication SUP 50123 (12 pages) with the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be

  17. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2230 from Karijini National Park, Australia

    PubMed Central

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O’Hara, Graham; Goodwin, Lynne; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2013-01-01

    Burkholderia sp. strain WSM2230 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod isolated from acidic soil collected in 2001 from Karijini National Park, Western Australia, using Kennedia coccinea (Coral Vine) as a host. WSM2230 was initially effective in nitrogen-fixation with K. coccinea, but subsequently lost symbiotic competence. Here we describe the features of Burkholderia sp. strain WSM2230, together with genome sequence information and its annotation. The 6,309,801 bp high-quality-draft genome is arranged into 33 scaffolds of 33 contigs containing 5,590 protein-coding genes and 63 RNA-only encoding genes. The genome sequence of WSM2230 failed to identify nodulation genes and provides an explanation for the observed failure of the laboratory grown strain to nodulate. The genome of this strain is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project. PMID:25197440

  18. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2230 from Karijini National Park, Australia.

    PubMed

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O'Hara, Graham; Goodwin, Lynne; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2014-06-15

    Burkholderia sp. strain WSM2230 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod isolated from acidic soil collected in 2001 from Karijini National Park, Western Australia, using Kennedia coccinea (Coral Vine) as a host. WSM2230 was initially effective in nitrogen-fixation with K. coccinea, but subsequently lost symbiotic competence. Here we describe the features of Burkholderia sp. strain WSM2230, together with genome sequence information and its annotation. The 6,309,801 bp high-quality-draft genome is arranged into 33 scaffolds of 33 contigs containing 5,590 protein-coding genes and 63 RNA-only encoding genes. The genome sequence of WSM2230 failed to identify nodulation genes and provides an explanation for the observed failure of the laboratory grown strain to nodulate. The genome of this strain is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  19. Cloning, sequence analysis and expression of the F1F0-ATPase beta-subunit from wine lactic acid bacteria.

    PubMed

    Sievers, Martin; Uermösi, Christina; Fehlmann, Marc; Krieger, Sibylle

    2003-09-01

    The nucleotide sequences of the genes encoding the F1F0-ATPase beta-subunit from Oenococcus oeni, Leuconostoc mesenteroides subsp. mesenteroides, Pediococcus damnosus, Pediococcus parvulus, Lactobacillus brevis and Lactobacillus hilgardii were determined. Their deduced amino acid sequences showed homology values of 79-98%. Data from the alignment and ATPase tree indicated that O. oeni and L. mesenteroides subsp. mesenteroides formed a group well-separated from P. damnosus and P. parvulus and from the group comprises L. brevis and L. hilgardii. The N-terminus of the F1F0-ATPase beta-subunit of O. oeni contains a stretch of additional 38 amino acid residues. The catalytic site of the ATPase beta-subunit of the investigated strains is characterized by the two conserved motifs GGAGVGKT and GERTRE. The amplified atpD coding sequences were inserted into the pCRT7/CT-TOPO vector using TA-cloning strategy and transformed in Escherichia coli. SDS-PAGE and Western blot analyses confirmed that O. oeni has an ATPase beta-subunit protein which is larger in size than the corresponding molecules from the investigated strains.

  20. Whole-Exome Sequencing in a South American Cohort Links ALDH1A3, FOXN1 and Retinoic Acid Regulation Pathways to Autism Spectrum Disorders

    PubMed Central

    Moreno-Ramos, Oscar A.; Olivares, Ana María; Haider, Neena B.; de Autismo, Liga Colombiana; Lattig, María Claudia

    2015-01-01

    Autism spectrum disorders (ASDs) are a range of complex neurodevelopmental conditions principally characterized by dysfunctions linked to mental development. Previous studies have shown that there are more than 1000 genes likely involved in ASD, expressed mainly in brain and highly interconnected among them. We applied whole exome sequencing in Colombian—South American trios. Two missense novel SNVs were found in the same child: ALDH1A3 (RefSeq NM_000693: c.1514T>C (p.I505T)) and FOXN1 (RefSeq NM_003593: c.146C>T (p.S49L)). Gene expression studies reveal that Aldh1a3 and Foxn1 are expressed in ~E13.5 mouse embryonic brain, as well as in adult piriform cortex (PC; ~P30). Conserved Retinoic Acid Response Elements (RAREs) upstream of human ALDH1A3 and FOXN1 and in mouse Aldh1a3 and Foxn1 genes were revealed using bioinformatic approximation. Chromatin immunoprecipitation (ChIP) assay using Retinoid Acid Receptor B (Rarb) as the immunoprecipitation target suggests RA regulation of Aldh1a3 and Foxn1 in mice. Our results frame a possible link of RA regulation in brain to ASD etiology, and a feasible non-additive effect of two apparently unrelated variants in ALDH1A3 and FOXN1 recognizing that every result given by next generation sequencing should be cautiously analyzed, as it might be an incidental finding. PMID:26352270

  1. Whole-Exome Sequencing in a South American Cohort Links ALDH1A3, FOXN1 and Retinoic Acid Regulation Pathways to Autism Spectrum Disorders.

    PubMed

    Moreno-Ramos, Oscar A; Olivares, Ana María; Haider, Neena B; de Autismo, Liga Colombiana; Lattig, María Claudia

    2015-01-01

    Autism spectrum disorders (ASDs) are a range of complex neurodevelopmental conditions principally characterized by dysfunctions linked to mental development. Previous studies have shown that there are more than 1000 genes likely involved in ASD, expressed mainly in brain and highly interconnected among them. We applied whole exome sequencing in Colombian-South American trios. Two missense novel SNVs were found in the same child: ALDH1A3 (RefSeq NM_000693: c.1514T>C (p.I505T)) and FOXN1 (RefSeq NM_003593: c.146C>T (p.S49L)). Gene expression studies reveal that Aldh1a3 and Foxn1 are expressed in ~E13.5 mouse embryonic brain, as well as in adult piriform cortex (PC; ~P30). Conserved Retinoic Acid Response Elements (RAREs) upstream of human ALDH1A3 and FOXN1 and in mouse Aldh1a3 and Foxn1 genes were revealed using bioinformatic approximation. Chromatin immunoprecipitation (ChIP) assay using Retinoid Acid Receptor B (Rarb) as the immunoprecipitation target suggests RA regulation of Aldh1a3 and Foxn1 in mice. Our results frame a possible link of RA regulation in brain to ASD etiology, and a feasible non-additive effect of two apparently unrelated variants in ALDH1A3 and FOXN1 recognizing that every result given by next generation sequencing should be cautiously analyzed, as it might be an incidental finding.

  2. A molecular mechanism realizing sequence-specific recognition of nucleic acids by TDP-43

    PubMed Central

    Furukawa, Yoshiaki; Suzuki, Yoh; Fukuoka, Mami; Nagasawa, Kenichi; Nakagome, Kenta; Shimizu, Hideaki; Mukaiyama, Atsushi; Akiyama, Shuji

    2016-01-01

    TAR DNA-binding protein 43 (TDP-43) is a DNA/RNA-binding protein containing two consecutive RNA recognition motifs (RRM1 and RRM2) in tandem. Functional abnormality of TDP-43 has been proposed to cause neurodegeneration, but it remains obscure how the physiological functions of this protein are regulated. Here, we show distinct roles of RRM1 and RRM2 in the sequence-specific substrate recognition of TDP-43. RRM1 was found to bind a wide spectrum of ssDNA sequences, while no binding was observed between RRM2 and ssDNA. When two RRMs are fused in tandem as in native TDP-43, the fused construct almost exclusively binds ssDNA with a TG-repeat sequence. In contrast, such sequence-specificity was not observed in a simple mixture of RRM1 and RRM2. We thus propose that the spatial arrangement of multiple RRMs in DNA/RNA binding proteins provides steric effects on the substrate-binding site and thereby controls the specificity of its substrate nucleotide sequences. PMID:26838063

  3. Nucleotide sequence and spatial expression pattern of a drought- and abscisic Acid-induced gene of tomato.

    PubMed

    Plant, A L; Cohen, A; Moses, M S; Bray, E A

    1991-11-01

    The nucleotide sequence of le16, a tomato (Lycopersicon esculentum Mill.) gene induced by drought stress and regulated by abscisic acid specifically in aerial vegetative tissue, is presented. The single open reading frame contained within the gene has the capacity to encode a polypeptide of 12.7 kilodaltons and is interrupted by a small intron. The predicted polypeptide is rich in leucine, glycine, and alanine and has an isoelectric point of 8.7. The amino terminus is hydrophobic and characteristic of signal sequences that target polypeptides for export from the cytoplasm. There is homology (47.2% identity) between the amino terminus of the LE 16 polypeptide and the corresponding amino terminal domain of the maize phospholipid transfer protein. le16 was expressed in drought-stressed leaf, petiole, and stem tissue and to a much lower extent in the pericarp of mature green tomato fruit and developing seeds. No expression was detected in the pericarp of red fruit or in drought-stressed roots. Expression of le16 was also induced in leaf tissue by a variety of other abiotic stresses including polyethylene glycol-mediated water deficit, salinity, cold stress, and heat stress. None of these stresses or direct applications of abscisic acid induced the expression of le16 in the roots of the same plants. The unique expression characteristics of this gene indicates that novel regulatory mechanisms, in addition to endogenous abscisic acid, are involved in controlling gene expression.

  4. Characterization of fatty acid-producing wastewater microbial communities using next generation sequencing technologies

    EPA Science Inventory

    While wastewater represents a viable source of bacterial biodiesel production, very little is known on the composition of these microbial communities. We studied the taxonomic diversity and succession of microbial communities in bioreactors accumulating fatty acids using 454-pyro...

  5. A reliable and sensitive bead-based fluorescence assay for identification of nucleic acid sequences

    NASA Astrophysics Data System (ADS)

    Klamp, Tobias; Yahiatène, Idir; Lampe, André; Schüttpelz, Mark; Sauer, Markus

    2011-03-01

    The sensitive and rapid detection of pathogenic DNA is of tremendous importance in the field of diagnostics. We demonstrate the ability of detecting and quantifying single- and double-stranded pathogenic DNA with picomolar sensitivity in a bead-based fluorescence assay. Selecting appropriate capturing and detection sequences enables rapid (2 h) and reliable DNA quantification. We show that synthetic sequences of S. pneumoniae and M. luteus can be quantified in very small sample volumes (20 μL) across a linear detection range over four orders of magnitude from 1 nM to 1 pM, using a miniaturized wide-field fluorescence microscope without amplification steps. The method offers single molecule detection sensitivity without using complex setups and thus volunteers as simple, robust, and reliable method for the sensitive detection of DNA and RNA sequences.

  6. Lactose synthesis in a monotreme, the echidna (Tachyglossus aculeatus): isolation and amino acid sequence of echidna alpha-lactalbumin.

    PubMed

    Messer, M; Griffiths, M; Rismiller, P D; Shaw, D C

    1997-10-01

    alpha-Lactalbumin and lysozyme were each isolated from echidna (Tachyglossus aculeatus) milk by gel permeation and ion exchange chromatography. The alpha-lactalbumin modified the action of echidna milk galactosyltransferase to promote the synthesis of lactose but had very little effect on bovine galactosyltransferase. Echidna alpha-lactalbumin is a glycosylated protein with an apparent molecular weight of 20,000 (SDS-PAGE) whose concentration in the milk is very low compared with the concentrations of alpha-lactalbumin in the milk of other species. Its amino acid sequence is more similar to that of another monotreme, the platypus (Ornithorhynchus anatinus), than to the sequences of eutherian or marsupial alpha-lactalbumins. Echidna milk lysozyme, even at high concentrations, did not promote the synthesis of lactose by either echidna or bovine galactosyltransferase. We conclude that lactose synthesis in the echidna occurs by the same mechanism as that found in the platypus and other mammals.

  7. Effects of the amino acid sequence on thermal conduction through β-sheet crystals of natural silk protein.

    PubMed

    Zhang, Lin; Bai, Zhitong; Ban, Heng; Liu, Ling

    2015-11-21

    Recent experiments have discovered very different thermal conductivities between the spider silk and the silkworm silk. Decoding the molecular mechanisms underpinning the distinct thermal properties may guide the rational design of synthetic silk materials and other biomaterials for multifunctionality and tunable properties. However, such an understanding is lacking, mainly due to the complex structure and phonon physics associated with the silk materials. Here, using non-equilibrium molecular dynamics, we demonstrate that the amino acid sequence plays a key role in the thermal conduction process through β-sheets, essential building blocks of natural silks and a variety of other biomaterials. Three representative β-sheet types, i.e. poly-A, poly-(GA), and poly-G, are shown to have distinct structural features and phonon dynamics leading to different thermal conductivities. A fundamental understanding of the sequence effects may stimulate the design and engineering of polymers and biopolymers for desired thermal properties.

  8. The Role of HIV-1 gp41 Glycoprotein in Infectious Tropism Inferred from Physico-Chemical Properties of its Amino Acid Sequence

    NASA Astrophysics Data System (ADS)

    Figueroa, E.; Villarreal, C.; Huerta, L.; Cocho, G.

    2006-09-01

    We performed a statistical analysis of the amino acid sequence of the gp41 ectodomain of the Human Immunodeficiency Virus type 1. We found strong correlations between physicochemical properties of highly variable residues and the viral infectious tropism.

  9. New Insights into Poly(Lactic-co-glycolic acid) Microstructure: Using Repeating Sequence Copolymers to Decipher Complex NMR and Thermal Behavior

    PubMed Central

    Stayshich, Ryan M.; Meyer, Tara Y.

    2012-01-01

    Sequence, which Nature uses to spectacular advantage, has not been fully exploited in synthetic copolymers. To investigate the effect of sequence and stereosequence on the physical properties of copolymers a family of complex isotactic, syndiotactic and atactic repeating sequence poly(lactic-co-glycolic acid) copolymers (RSC PLGAs) were prepared and their NMR and thermal behavior was studied. The unique suitability of polymers prepared from the bioassimilable lactic and glycolic acid monomers for biomedical applications makes them ideal candidates for this type of sequence engineering. Polymers with repeating units of LG, GLG and LLG (L = lactic, G = glycolic) with controlled and varied tacticities were synthesized by assembly of sequence specific, stereopure dimeric, trimeric and hexameric segmer units. Specifically labeled deuterated lactic and glycolic acid segmers were likewise prepared and polymerized. Molecular weights for the copolymers ranged from Mn = 12-40 kDa by size exclusion chromatography in THF. Although the effects of sequence-influenced solution conformation were visible in all resonances of the 1H and 13C NMR spectra, the diastereotopic methylene resonances in the 1H NMR (CDCl3) for the glycolic units of the copolymers proved most sensitive. An octad level of resolution, which corresponds to an astounding 31-atom distance between the most separated stereocenters, was observed in some mixed sequence polymers. Importantly, the level of sensitivity of a particular NMR resonance to small differences in sequence was found to depend on the sequence itself. Thermal properties were also correlated with sequence. PMID:20681726

  10. Amino acid sequence of myoglobin from the chiton Liolophura japonica and a phylogenetic tree for molluscan globins.

    PubMed

    Suzuki, T; Furukohri, T; Okamoto, S

    1993-02-01

    Myoglobin was isolated from the radular muscle of the chiton Liolophura japonica, a primitive archigastropodic mollusc. Liolophura contains three monomeric myoglobins (I, II, and III), and the complete amino acid sequence of myoglobin I has been determined. It is composed of 145 amino acid residues, and the molecular mass was calculated to be 16,070 D. The E7 distal histidine, which is replaced by valine or glutamine in several molluscan globins, is conserved in Liolophura myoglobin. The autoxidation rate at physiological conditions indicated that Liolophura oxymyoglobin is fairly stable when compared with other molluscan myoglobins. The amino acid sequence of Liolophura myoglobin shows low homology (11-21%) with molluscan dimeric myoglobins and hemoglobins, but shows higher homology (26-29%) with monomeric myoglobins from the gastropodic molluscs Aplysia, Dolabella, and Bursatella. A phylogenetic tree was constructed from 19 molluscan globin sequences. The tree separated them into two distinct clusters, a cluster for muscle myoglobins and a cluster for erythrocyte or gill hemoglobins. The myoglobin cluster is divided further into two subclusters, corresponding to monomeric and dimeric myoglobins, respectively. Liolophura myoglobin was placed on the branch of monomeric myoglobin lineage, showing that it diverged earlier from other monomeric myoglobins. The hemoglobin cluster is also divided into two subclusters. One cluster contains homodimeric, heterodimeric, tetrameric, and didomain chains of erythrocyte hemoglobins of the blood clams Anadara, Scapharca, and Barbatia. Of special interest is the other subcluster. It consists of three hemoglobin chains derived from the bacterial symbiontharboring clams Calyptogena and Lucina, in which hemoglobins are supposed to play an important role in maintaining the symbiosis with sulfide bacteria.

  11. Amino acid sequences of two novel long-chain neurotoxins from the venom of the sea snake Laticauda colubrina.

    PubMed

    Kim, H S; Tamiya, N

    1982-11-01

    From the venom of a population of the sea snake Laticauda colubrina from the Solomon Islands, a neurotoxic component, Laticauda colubrina a (toxin Lc a), was isolated in 16.6% (A280) yield. Similarly, from the venom of a population of L. colubrina from the Philippines, a neurotoxic component, Laticauda colubrina b (toxin Lc b), was obtained in 10.0% (A280) yield. The LD50 values of these toxins were 0.12 microgram/g body wt. on intramuscular injection in mice. Toxins Lc a and Lc b were each composed of molecules containing 69 amino acid residues with eight half-cystine residues. The complete amino acid sequences of these two toxins were elucidated. Toxins Lc a and Lc b are different from each other at five positions of their sequences, namely at positions 31 (Phe/Ser), 32 (Leu/Ile), 33 (Lys/Arg), 50 (Pro/Arg) and 53 (Asp/His) (residues in parentheses give the residues in toxins Lc a and Lc b respectively). Toxins Lc a and Lc b have a novel structure in that they have only four disulphide bridges, although the whole amino acid sequences are homologous to those of other known long-chain neurotoxins. It is remarkable that toxins Lc a and Lc b are not coexistent at the detection error of 6% of the other toxin. Populations of Laticauda colubrina from the Solomon Islands and from the Philippines have either toxin Lc a or toxin Lc b and not both of them.

  12. Rapid Nucleic Acid Sequencing Methods--Alternative Approaches to Facilitating Learning.

    ERIC Educational Resources Information Center

    Bryce, Charles F. A.

    1982-01-01

    Because advanced students had difficulty in interpreting cleavage patterns obtained by gel electrophoresis related to rapid sequencing techniques for DNA and RNA, several formats were developed to aid in understanding this topic. Formats included print, print plus scrambled print, interactive computer-based instruction, and high-resolution…

  13. Nucleotide sequence of a lysine transfer ribonucleic Acid from bakers' yeast.

    PubMed

    Madison, J T; Boguslawski, S J; Teetor, G H

    1972-05-12

    The nucleotide sequence of one of the two major lysine transfer RNA's from bakers' yeast has been determined. Its structure is compared to that of a lysine tRNA from a haploid yeast. A total of 21 nucleotides differ in the two molecules. Only the T-psi-C-G (thymidine-pseudouridine-cytidine-guanosine) loop and its supporting stem are identical.

  14. Amorphous/nanocrystalline silicon biosensor for the specific identification of unamplified nucleic acid sequences using gold nanoparticle probes

    NASA Astrophysics Data System (ADS)

    Martins, Rodrigo; Baptista, Pedro; Raniero, Leandro; Doria, Gonçalo; Silva, Leonardo; Franco, Ricardo; Fortunato, Elvira

    2007-01-01

    Amorphous/nanocrystalline silicon pi 'ii'n devices fabricated on micromachined glass substrates are integrated with oligonucleotide-derivatized gold nanoparticles for a colorimetric detection method. The method enables the specific detection and quantification of unamplified nucleic acid sequences (DNA and RNA) without the need to functionalize the glass surface, allowing for resolution of single nucleotide differences between DNA and RNA sequences—single nucleotide polymorphism and mutation detection. The detector's substrate is glass and the sample is directly applied on the back side of the biosensor, ensuring a direct optical coupling of the assays with a concomitant maximum photon capture and the possibility to reuse the sensor.

  15. A possible general mechanism for ultrasound-assisted extraction (UAE) suggested from the results of UAE of chlorogenic acid from Cynara scolymus L. (artichoke) leaves.

    PubMed

    Saleh, I A; Vinatoru, M; Mason, T J; Abdel-Azim, N S; Aboutabl, E A; Hammouda, F M

    2016-07-01

    The use of ultrasound-assisted extraction (UAE) for the extraction of chlorogenic acid (CA) from Cynara scolymus L., (artichoke) leaves using 80% methanol at room temperature over 15 min gave a significant increase in yield (up to a 50%) compared with maceration at room temperature and close to that obtained by boiling over the same time period. A note of caution is introduced when comparing UAE with Soxhlet extraction because, in the latter case, the liquid entering the Soxhlet extractor is more concentrated in methanol (nearly 100%) that the solvent in the reservoir (80% methanol) due to fractionation during distillation. The mechanism of UAE is discussed in terms of the effects of cavitation on the swelling index, solvent diffusion and the removal of a stagnant layer of solvent surrounding the plant material.

  16. Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles

    PubMed Central

    Rodrigue, Nicolas; Philippe, Hervé; Lartillot, Nicolas

    2010-01-01

    Modeling the interplay between mutation and selection at the molecular level is key to evolutionary studies. To this end, codon-based evolutionary models have been proposed as pertinent means of studying long-range evolutionary patterns and are widely used. However, these approaches have not yet consolidated results from amino acid level phylogenetic studies showing that selection acting on proteins displays strong site-specific effects, which translate into heterogeneous amino acid propensities across the columns of alignments; related codon-level studies have instead focused on either modeling a single selective context for all codon columns, or a separate selective context for each codon column, with the former strategy deemed too simplistic and the latter deemed overparameterized. Here, we integrate recent developments in nonparametric statistical approaches to propose a probabilistic model that accounts for the heterogeneity of amino acid fitness profiles across the coding positions of a gene. We apply the model to a dozen real protein-coding gene alignments and find it to produce biologically plausible inferences, for instance, as pertaining to site-specific amino acid constraints, as well as distributions of scaled selection coefficients. In their account of mutational features as well as the heterogeneous regimes of selection at the amino acid level, the modeling approaches studied here can form a backdrop for several extensions, accounting for other selective features, for variable population size, or for subtleties of mutational features, all with parameterizations couched within population-genetic theory. PMID:20176949

  17. Amino acid sequence of a neurotoxic phospholipase A2 enzyme from common death adder (Acanthophis antracticus) venom.

    PubMed

    van der Weyden, L; Hains, P; Broady, K; Shaw, D; Milburn, P

    2001-02-01

    The amino acid sequence of the first neurotoxic phospholipase A2, acanthoxin A1, purified from the venom of the Common death adder (Acanthophis antarcticus) was determined. Acanthoxin A1 shows high homology with other Australian elapid PLA2 neurotoxins, in particular Acanthin-I and -II, also from Death adder, Pseudexin A from the Red-bellied black snake (Pseudechis porphyriacus), and Pa-12a and Pa-9c from the King brown snake (Pseudechis australis). Acanthoxin A1 is a single-chain 118 amino acid residue PLA2, including 14 half cystine residues and the essential residues forming the ubiquitous calcium binding pocket and catalytic site. Critical analysis of the residues hypothesized to be important for neurotoxicity is presented.

  18. An Interpretation of the Ancestral Codon from Miller’s Amino Acids and Nucleotide Correlations in Modern Coding Sequences

    PubMed Central

    Carels, Nicolas; de Leon, Miguel Ponce

    2015-01-01

    Purine bias, which is usually referred to as an “ancestral codon”, is known to result in short-range correlations between nucleotides in coding sequences, and it is common in all species. We demonstrate that RWY is a more appropriate pattern than the classical RNY, and purine bias (Rrr) is the product of a network of nucleotide compensations induced by functional constraints on the physicochemical properties of proteins. Through deductions from universal correlation properties, we also demonstrate that amino acids from Miller’s spark discharge experiment are compatible with functional primeval proteins at the dawn of living cell radiation on earth. These amino acids match the hydropathy and secondary structures of modern proteins. PMID:25922573

  19. Deduced amino acid sequence, functional expression, and unique enzymatic properties of the form I and form II ribulose bisphosphate carboxylase/oxygenase from the chemoautotrophic bacterium Thiobacillus denitrificans.

    PubMed

    Hernandez, J M; Baker, S H; Lorbach, S C; Shively, J M; Tabita, F R

    1996-01-01

    The cbbL cbbS and cbbM genes of Thiobacillus denitrificans, encoding form I and form II ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO), respectively, were found to complement a RubisCO-negative mutant of Rhodobacter sphaeroides to autotrophic growth. Endogenous T. denitrificans promoters were shown to function in R. sphaeroides, resulting in high levels of cbbL cbbS and cbbM expression in the R. sphaeroides host. This expression system provided high levels of both T. denitrificans enzymes, each of which was highly purified. The deduced amino acid sequence of the form I enzyme indicated that the large subunit was closely homologous to previously sequenced form I RubisCO enzymes from sulfur-oxidizing bacteria. The form I T. denitrificans enzyme possessed a very low substrate specificity factor and did not exhibit fallover, and yet this enzyme showed a poor ability to recover from incubation with ribulose 1,5-bisphosphate. The deduced amino acid sequence of the form II T. denitrificans enzyme resembled those of other form II RubisCO enzymes. The substrate specificity factor was characteristically low, and the lack of fallover and the inhibition by ribulose 1,5-bisphosphate were similar to those of form II RubisCO obtained from nonsulfur purple bacteria. Both form I and form II RubisCO from T. denitrificans possessed high KCO2 values, suggesting that this organism might suffer in environments containing low levels of dissolved CO2. These studies present the initial description of the kinetic properties of form I and form II RubisCO from a chemoautotrophic bacterium that synthesizes both types of enzyme.

  20. Deduced amino acid sequence, functional expression, and unique enzymatic properties of the form I and form II ribulose bisphosphate carboxylase/oxygenase from the chemoautotrophic bacterium Thiobacillus denitrificans.

    PubMed Central

    Hernandez, J M; Baker, S H; Lorbach, S C; Shively, J M; Tabita, F R

    1996-01-01

    The cbbL cbbS and cbbM genes of Thiobacillus denitrificans, encoding form I and form II ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO), respectively, were found to complement a RubisCO-negative mutant of Rhodobacter sphaeroides to autotrophic growth. Endogenous T. denitrificans promoters were shown to function in R. sphaeroides, resulting in high levels of cbbL cbbS and cbbM expression in the R. sphaeroides host. This expression system provided high levels of both T. denitrificans enzymes, each of which was highly purified. The deduced amino acid sequence of the form I enzyme indicated that the large subunit was closely homologous to previously sequenced form I RubisCO enzymes from sulfur-oxidizing bacteria. The form I T. denitrificans enzyme possessed a very low substrate specificity factor and did not exhibit fallover, and yet this enzyme showed a poor ability to recover from incubation with ribulose 1,5-bisphosphate. The deduced amino acid sequence of the form II T. denitrificans enzyme resembled those of other form II RubisCO enzymes. The substrate specificity factor was characteristically low, and the lack of fallover and the inhibition by ribulose 1,5-bisphosphate were similar to those of form II RubisCO obtained from nonsulfur purple bacteria. Both form I and form II RubisCO from T. denitrificans possessed high KCO2 values, suggesting that this organism might suffer in environments containing low levels of dissolved CO2. These studies present the initial description of the kinetic properties of form I and form II RubisCO from a chemoautotrophic bacterium that synthesizes both types of enzyme. PMID:8550452

  1. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    SciTech Connect

    Xie, Gary; Dalin, Eileen; Tice, Hope; Chertkov, Olga; Land, Miriam L

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer-ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi-cellulose. This bacterium is also considered as a potential probiotic. Complete genome squence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  2. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    SciTech Connect

    Rhee, Mun Su; Moritz, Brelan E.; Xie, Gary; Glavina Del Rio, Tijana; Dalin, Eileen; Tice, Hope; Bruce, David; Goodwin, Lynne A.; Chertkov, Olga; Brettin, Thomas S; Han, Cliff; Detter, J. Chris; Pitluck, Sam; Land, Miriam L; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, Keelnathan T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer- ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this spo- rogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attrac- tive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi- cellulose. This bacterium is also considered as a potential probiotic. Complete genome se- quence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  3. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... submissions in computer readable form. (a) The computer readable form required by § 1.821(e) shall meet the following requirements: (1) The computer readable form shall contain a single “Sequence Listing” as either...

  4. Protein ordered sequences are formed by random joining of amino acids in protein 0(th)-order structure, followed by evolutionary process.

    PubMed

    Ikehara, Kenji

    2014-12-01

    Only random processes should occur on the primitive Earth. In contrast, many ordered sequences are synthesized according to genetic information on the present Earth. In this communication, I have proposed an idea that protein 0(th)-order structures or specific amino acid compositions would mediate the transfer from random process to formation of ordered sequences, after formation of double-stranded genes.

  5. Biochemical characterization of the murine S100A9 (MRP14) protein suggests that it is functionally equivalent to its human counterpart despite its low degree of sequence homology.

    PubMed

    Nacken, W; Sopalla, C; Pröpper, C; Sorg, C; Kerkhoff, C

    2000-01-01

    Due to the low degree of sequence similarity it has been speculated that murine and human S100A9 (MRP14), an inflammatory marker protein belonging to the S100 protein family, may have different cellular functions in mouse and man. The present study was undertaken to investigate the murine S100A9 protein (mS100A9) biochemically. We demonstrate that in murine peripheral CD11b+ cells up to 20% of the protein of the cytosolic fraction consists of mS100A9 and that several minor mS100A9 isoforms are present. Cell fractionation experiments with CD11b+ murine leukocytes showed that mS100A9 is found in the cytosol as well as in the insoluble fraction. Transient expression of a green fluorescence protein-mS100A9 fusion in mammalian cells revealed that mS100A9 is localized in neither the nucleus nor the vesicles. Recombinantly expressed murine S100A9 interacts in vitro with murine and human S100A8 in an in vitro glutathione S-transferase pull-down assay. Homodimerization was not observed. For further biochemical analysis the myeloid 32D cell line is presented as a suitable model, to study murine myeloid expressed S100 proteins. Both murine S100A9 and its dimerization partner mS100A8 are expressed at the onset of granulocyte-colony stimulating factor induced myeloid differentiation. Substantial amounts of this complex are constitutively secreted by granulocytic 32D cells into the medium. In summary, these data suggest, that the human and murine S100A9 may share a higher degree of functional homology than of sequence similarity.

  6. From Amino Acid to Glucosinolate Biosynthesis: Protein Sequence Changes in the Evolution of Methylthioalkylmalate Synthase in Arabidopsis[W][OA

    PubMed Central

    de Kraker, Jan-Willem; Gershenzon, Jonathan

    2011-01-01

    Methylthioalkylmalate synthase (MAM) catalyzes the committed step in the side chain elongation of Met, yielding important precursors for glucosinolate biosynthesis in Arabidopsis thaliana and other Brassicaceae species. MAM is believed to have evolved from isopropylmalate synthase (IPMS), an enzyme involved in Leu biosynthesis, based on phylogenetic analyses and an overlap of catalytic abilities. Here, we investigated the changes in protein structure that have occurred during the recruitment of IPMS from amino acid to glucosinolate metabolism. The major sequence difference between IPMS and MAM is the absence of 120 amino acids at the C-terminal end of MAM that constitute a regulatory domain for Leu-mediated feedback inhibition. Truncation of this domain in Arabidopsis IPMS2 results in loss of Leu feedback inhibition and quaternary structure, two features common to MAM enzymes, plus an 8.4-fold increase in the kcat/Km for a MAM substrate. Additional exchange of two amino acids in the active site resulted in a MAM-like enzyme that had little residual IPMS activity. Hence, combination of the loss of the regulatory domain and a few additional amino acid exchanges can explain the evolution of MAM from IPMS during its recruitment from primary to secondary metabolism. PMID:21205930

  7. Structural, Biochemical, and Phylogenetic Analyses Suggest That Indole-3-Acetic Acid Methyltransferase Is an Evolutionarily Ancient Member of the SABATH Family

    SciTech Connect

    Zhao,N.; Ferrer, J.; Ross, J.; Guan, J.; Yang, Y.; Pichersky, E.; Noel, J.; Chen, F.

    2008-01-01

    The plant SABATH protein family encompasses a group of related small-molecule methyltransferases (MTs) that catalyze the S-adenosyl-L-methionine-dependent methylation of natural chemicals encompassing widely divergent structures. Indole-3-acetic acid (IAA) methyltransferase (IAMT) is a member of the SABATH family that modulates IAA homeostasis in plant tissues through methylation of IAA's free carboxyl group. The crystal structure of Arabidopsis (Arabidopsis thaliana) IAMT (AtIAMT1) was determined and refined to 2.75 Angstroms resolution. The overall tertiary and quaternary structures closely resemble the two-domain bilobed monomer and the dimeric arrangement, respectively, previously observed for the related salicylic acid carboxyl methyltransferase from Clarkia breweri (CbSAMT). To further our understanding of the biological function and evolution of SABATHs, especially of IAMT, we analyzed the SABATH gene family in the rice (Oryza sativa) genome. Forty-one OsSABATH genes were identified. Expression analysis showed that more than one-half of the OsSABATH genes were transcribed in one or multiple organs. The OsSABATH gene most similar to AtIAMT1 is OsSABATH4. Escherichia coli-expressed OsSABATH4 protein displayed the highest level of catalytic activity toward IAA and was therefore named OsIAMT1. OsIAMT1 exhibited kinetic properties similar to AtIAMT1 and poplar IAMT (PtIAMT1). Structural modeling of OsIAMT1 and PtIAMT1 using the experimentally determined structure of AtIAMT1 reported here as a template revealed conserved structural features of IAMTs within the active-site cavity that are divergent from functionally distinct members of the SABATH family, such as CbSAMT. Phylogenetic analysis revealed that IAMTs from Arabidopsis, rice, and poplar (Populus spp.) form a monophyletic group. Thus, structural, biochemical, and phylogenetic evidence supports the hypothesis that IAMT is an evolutionarily ancient member of the SABATH family likely to play a critical role in

  8. Hybridization probe for femtomolar quantification of selected nucleic acid sequences on a disposable electrode.

    PubMed

    Jenkins, Daniel M; Chami, Bilal; Kreuzer, Matthias; Presting, Gernot; Alvarez, Anne M; Liaw, Bor Yann

    2006-04-01

    Mixed monolayers of electroactive hybridization probes on gold surfaces of a disposable electrode were investigated as a technology for simple, sensitive, selective, and rapid gene identification. Hybridization to the ferrocene-labeled hairpin probes reproducibly diminished cyclic redox currents, presumably due to a displacement of the label from the electrode. Observed peak current densities were roughly 1000x greater than those observed in previous studies, such that results could easily be interpreted without the use of algorithms to correct for background polarization currents. Probes were sensitive to hybridization with a number of oligonucleotide sequences with varying homology, but target oligonucleotides could be distinguished from competing nontarget sequences based on unique "melting" profiles from the probe. Detection limits were demonstrated down to nearly 100 fM, which may be low enough to identify certain genetic conditions or infections without amplification. This technology has rich potential for use in field devices for gene identification as well as in gene microarrays.

  9. The amino acid sequence of Neurospora NADP-specific glutamate dehydrogenase. The tryptic peptides.

    PubMed Central

    Wootton, J C; Taylor, J G; Jackson, A A; Chambers, G K; Fincham, J R

    1975-01-01

    The NADP-specific glutamate dehydrogenase of Neurospora crassa was digested with trypsin, and peptides accounting for 441 out of the 452 residues of the polypeptide chain were isolated and substantially sequenced. Additional experimental detail has been deposited as Supplementary Publication SUP 50052 (11 pages) with the British Library (Lending Division), Boston Spa, Wetherby, W. Yorkshire LS23 7BQ, U.K., from whom copies may be obtained under the terms given in Biochem J. (1975) 145, 5. PMID:1000

  10. Analys. DNA: a computer program for nucleic acid sequence data processing.

    PubMed

    Amthauer, R; Araya, A

    1984-09-01

    A computer program written in BASIC language is described. The program allows processing and analysis of DNA data and has been designed to be used by persons with little or no computer experience. The operator using different options can search for direct homologies with varying degrees of matching, generate complementary strands, find restriction sites, invert the polarity of the sequence and edit a print-out.

  11. Heavy-atom Database System: a tool for the preparation of heavy-atom derivatives of protein crystals based on amino-acid sequence and crystallization conditions.

    PubMed

    Sugahara, Michihiro; Asada, Yukuhiko; Ayama, Haruhiko; Ukawa, Hisashi; Taka, Hideyuki; Kunishima, Naoki

    2005-09-01

    Heavy-atom Database System (HATODAS) is a WWW-based tool designed to assist the heavy-atom derivatization of proteins. The conventional procedure for the preparation of derivatives is usually a time-consuming 'trial-and-error' process. The present program provides a solution for this problem using a database of known heavy-atom derivatives. A database search suggests potential heavy-atom reagents for any target protein based on its amino-acid sequence and crystallization conditions. A mining of the database identified 93 preferred motifs for heavy-atom binding. The motifs are observed frequently at the actual heavy-atom-binding sites encountered in the process of structure determination.

  12. Complete genome sequence of Bacillus methanolicus MGA3, a thermotolerant amino acid producing methylotroph.

    PubMed

    Irla, Marta; Neshat, Armin; Winkler, Anika; Albersmeier, Andreas; Heggeset, Tonje M B; Brautaset, Trygve; Kalinowski, Jörn; Wendisch, Volker F; Rückert, Christian

    2014-10-20

    Bacillus methanolicus MGA3 was isolated from freshwater marsh soil and characterised as a thermotolerant and methylotrophic L-glutamate producer. The complete genome consists of a circular chromosome and the two plasmids pBM19 and pBM69. It includes genomic information about C1 metabolism and amino acid biosynthetic pathways.

  13. Size and distribution of polyadenylic acid sequences in Drosophila polytene DNA and RNA.

    PubMed

    Alonso, C; Pages, M; García, M L

    1977-12-02

    [3H]Poly(U) hybridizes very rapidly to polytene DNA from Drosophila hydei. When hybridization is performed at 30 degrees C in 2 X SSC to a large excess of DNA, 95% of the poly(U) becomes ribonuclease resistant. Also, complementary RNA transcribed in vitro from polytene DNA hybridizes to poly(U). 023--0.25% of the DNA is composed of (dA)-rich sequences and 0.23--0.31% of cRNA hybridizes to [3H]poly(U). The length of the (dA)-rich sequences on the DNA and cRNA is 40 nucleotides. The Tm values of these hybrids formed between DNA or cRNA-poly(U) is 45 degrees C. The poly(A) fragments from cytoplasmic RNA ranged from 80 to 170 nucleotides in lenght, and migrated in polyacrilamide gels as a broad peak. The average sizes of the poly(A) fragments from the poly(A)-containing RNA transcribed by nuclei isolated from salivary glands in vivo or in vitro were 40, 70, 170 and 70 nucleotides, respectively. Hybridization in situ of [3H]-poly(U) to chromosome squashes indicated that the (dA)-rich sequences are randomly distributed over the whole genome.

  14. Predicting Protein–Protein Interaction Sites Using Sequence Descriptors and Site Propensity of Neighboring Amino Acids

    PubMed Central

    Kuo, Tzu-Hao; Li, Kuo-Bin

    2016-01-01

    Information about the interface sites of Protein–Protein Interactions (PPIs) is useful for many biological research works. However, despite the advancement of experimental techniques, the identification of PPI sites still remains as a challenging task. Using a statistical learning technique, we proposed a computational tool for predicting PPI interaction sites. As an alternative to similar approaches requiring structural information, the proposed method takes all of the input from protein sequences. In addition to typical sequence features, our method takes into consideration that interaction sites are not randomly distributed over the protein sequence. We characterized this positional preference using protein complexes with known structures, proposed a numerical index to estimate the propensity and then incorporated the index into a learning system. The resulting predictor, without using structural information, yields an area under the ROC curve (AUC) of 0.675, recall of 0.597, precision of 0.311 and accuracy of 0.583 on a ten-fold cross-validation experiment. This performance is comparable to the previous approach in which structural information was used. Upon introducing the B-factor data to our predictor, we demonstrated that the AUC can be further improved to 0.750. The tool is accessible at http://bsaltools.ym.edu.tw/predppis. PMID:27792167

  15. Purification and N-terminal amino acid sequence comparisons of structural proteins from retrovirus-D/Washington and Mason-Pfizer monkey virus.

    PubMed Central

    Henderson, L E; Sowder, R; Smythers, G; Benveniste, R E; Oroszlan, S

    1985-01-01

    A new D-type retrovirus originally designated SAIDS-D/Washington and here referred to as retrovirus-D/Washington (R-D/W) was recently isolated at the University of Washington Primate Center, Seattle, Wash., from a rhesus monkey with an acquired immunodeficiency syndrome and retroperitoneal fibromatosis. To better establish the relationship of this new D-type virus to the prototype D-type virus, Mason-Pfizer monkey virus (MPMV), we have purified and compared six structural proteins from each virus. The proteins purified from each D-type retrovirus include p4, p10, p12, p14, p27, and a phosphoprotein designated pp18 for MPMV and pp20 for R-D/W. Amino acid analysis and N-terminal amino acid sequence analysis show that the p4, p12, p14, and p27 proteins of R-D/W are distinct from the homologous proteins of MPMV but that these proteins from the two different viruses share a high degree of amino acid sequence homology. The p10 proteins from the two viruses have similar amino acid compositions, and both are blocked to N-terminal Edman degradation. The phosphoproteins from the two viruses each contain phosphoserine but are different from each other in amino acid composition, molecular weight, and N-terminal amino acid sequence. The data thus show that each of the R-D/W proteins examined is distinguishable from its MPMV homolog and that a major difference between these two D-type retroviruses is found in the viral phosphoproteins. The N-terminal amino acid sequences of D-type retroviral proteins were used to search for sequence homologies between D-type and other retroviral amino acid sequences. An unexpected amino acid sequence homology was found between R-D/W pp20 (a gag protein) and a 28-residue segment of the env precursor polyprotein of Rous sarcoma virus. The N-terminal amino acid sequences of the D-type major gag protein (p27) and the nucleic acid-binding protein (p14) show only limited amino acid sequence homology to functionally homologous proteins of C

  16. Anti-inflammation activities of mycosporine-like amino acids (MAAs) in response to UV radiation suggest potential anti-skin aging activity.

    PubMed

    Suh, Sung-Suk; Hwang, Jinik; Park, Mirye; Seo, Hyo Hyun; Kim, Hyoung-Shik; Lee, Jeong Hun; Moh, Sang Hyun; Lee, Taek-Kyun

    2014-10-14

    Certain photosynthetic marine organisms have evolved mechanisms to counteract UV-radiation by synthesizing UV-absorbing compounds, such as mycosporine-like amino acids (MAAs). In this study, MAAs were separated from the extracts of marine green alga Chlamydomonas hedleyi using HPLC and were identified as porphyra-334, shinorine, and mycosporine-glycine (mycosporine-Gly), based on their retention times and maximum absorption wavelengths. Furthermore, their structures were confirmed by triple quadrupole MS/MS. Their roles as UV-absorbing compounds were investigated in the human fibroblast cell line HaCaT by analyzing the expression levels of genes associated with antioxidant activity, inflammation, and skin aging in response to UV irradiation. The mycosporine-Gly extract, but not the other MAAs, had strong antioxidant activity in the 2,2-diphenyl-1-picryhydrazyl (DPPH) assay. Furthermore, treatment with mycosporine-Gly resulted in a significant decrease in COX-2 mRNA levels, which are typically increased in response to inflammation in the skin, in a concentration-dependent manner. Additionally, in the presence of MAAs, the UV-suppressed genes, procollagen C proteinase enhancer (PCOLCE) and elastin, which are related to skin aging, had increased expression levels equal to those in UV-mock treated cells. Interestingly, the increased expression of involucrin after UV exposure was suppressed by treatment with the MAAs mycosporine-Gly and shinorine, but not porphyra-334. This is the first report investigating the biological activities of microalgae-derived MAAs in human cells.

  17. Sequence-selective recognition of double-stranded RNA and enhanced cellular uptake of cationic nucleobase and backbone-modified peptide nucleic acids.

    PubMed

    Hnedzko, Dziyana; McGee, Dennis W; Karamitas, Yannis A; Rozners, Eriks

    2017-01-01

    Sequence-selective recognition of complex RNAs in live cells could find broad applications in biology, biomedical research, and biotechnology. However, specific recognition of structured RNA is challenging, and generally applicable and effective methods are lacking. Recently, we found that peptide nucleic acids (PNAs) were unusually well-suited ligands for recognition of double-stranded RNAs. Herein, we report that 2-aminopyridine (M) modified PNAs and their conjugates with lysine and arginine tripeptides form strong (Ka = 9.4 to 17 × 10(7) M(-1)) and sequence-selective triple helices with RNA hairpins at physiological pH and salt concentration. The affinity of PNA-peptide conjugates for the matched RNA hairpins was unusually high compared to the much lower affinity for DNA hairpins of the same sequence (Ka = 0.05 to 1.1 × 10(7) M(-1)). The binding of double-stranded RNA by M-modified PNA-peptide conjugates was a relatively fast process (kon = 2.9 × 10(4) M(-1) sec(-1)) compared to the notoriously slow triple helix formation by oligodeoxynucleotides (kon ∼ 10(3) M(-1) sec(-1)). M-modified PNA-peptide conjugates were not cytotoxic and were efficiently delivered in the cytosol of HEK293 cells at 10 µM. Surprisingly, M-modified PNAs without peptide conjugation were also taken up by HEK293 cells, which, to the best of our knowledge, is the first example of heterocyclic base modification that enhances the cellular uptake of PNA. Our results suggest that M-modified PNA-peptide conjugates are promising probes for sequence-selective recognition of double-stranded RNA in live cells and other biological systems.

  18. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2232 from Karijini National Park, Australia

    PubMed Central

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O’Hara, Graham; Goodwin, Lynne; Han, James; Reddy, Tatiparthi; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2013-01-01

    Burkholderia sp. strain WSM2232 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod that was trapped in 2001 from acidic soil collected from Karijini National Park (Australia) using Gastrolobium capitatum as a host. WSM2232 was effective in nitrogen fixation with G. capitatum but subsequently lost symbiotic competence during long-term storage. Here we describe the features of Burkholderia sp. strain WSM2232, together with genome sequence information and its annotation. The 7,208,311 bp standard-draft genome is arranged into 72 scaffolds of 72 contigs containing 6,322 protein-coding genes and 61 RNA-only encoding genes. The loss of symbiotic capability can now be attributed to the loss of nodulation and nitrogen fixation genes from the genome. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project. PMID:25197442

  19. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2232 from Karijini National Park, Australia.

    PubMed

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O'Hara, Graham; Goodwin, Lynne; Han, James; Reddy, Tatiparthi; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2014-06-15

    Burkholderia sp. strain WSM2232 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod that was trapped in 2001 from acidic soil collected from Karijini National Park (Australia) using Gastrolobium capitatum as a host. WSM2232 was effective in nitrogen fixation with G. capitatum but subsequently lost symbiotic competence during long-term storage. Here we describe the features of Burkholderia sp. strain WSM2232, together with genome sequence information and its annotation. The 7,208,311 bp standard-draft genome is arranged into 72 scaffolds of 72 contigs containing 6,322 protein-coding genes and 61 RNA-only encoding genes. The loss of symbiotic capability can now be attributed to the loss of nodulation and nitrogen fixation genes from the genome. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  20. An amphipathic trans-acting phosphorothioate DNA element delivers uncharged PNA and PMO nucleic acid sequences in mammalian cells

    PubMed Central

    Jain, Harsh V.; Beaucage, Serge L.

    2016-01-01

    An innovative approach to the delivery of uncharged peptide nucleic acids (PNA) and phosphorodiamidate morpholino (PMO) oligomers in mammalian cells is described and consists of extending the sequence of those oligomers with a short PNA-polyA or PMO-polyA tail. Recognition of the polyA-tailed PNA or PMO oligomers by an amphipathic trans-acting polythymidylic thiophosphate triester element (dTtaPS) results in efficient internalization of those oligomers in several cell lines. Our findings indicate that cellular uptake of the oligomers occurs through an energy-dependent mechanism and macropinocytosis appears to be the predo-minant endocytic pathway used for internalization. The functionality of the internalized oligomers is demonstrated by alternate splicing of the pre-mRNA encoding luciferase in HeLa pLuc 705 cells. Amphipathic phosphorothioate DNA elements may represent a unique class of cellular transporters for robust delivery of uncharged nucleic acid sequences in live mammalian cells. PMID:27516815

  1. Complete Genome Sequence of Moraxella osloensis Strain KMC41, a Producer of 4-Methyl-3-Hexenoic Acid, a Major Malodor Compound in Laundry

    PubMed Central

    Hirakawa, Hideki; Morita, Yuji; Tomida, Junko; Sato, Jun; Matsumura, Yuta; Mitani, Asako; Niwano, Yu; Takeuchi, Kohei; Kubota, Hiromi; Kawamura, Yoshiaki

    2016-01-01

    We report the complete genome sequence of Moraxella osloensis strain KMC41, isolated from laundry with malodor. The KMC41 genome comprises a 2,445,556-bp chromosome and three plasmids. A fatty acid desaturase and at least four β-oxidation-related genes putatively associated with 4-methyl-3-hexenoic acid generation were detected in the KMC41 chromosome. PMID:27445387

  2. Genome Sequence of the Thermophilic Strain Bacillus coagulans2-6, an Efficient Producer of High-Optical-Purity l-Lactic Acid

    PubMed Central

    Su, Fei; Yu, Bo; Sun, Jibin; Ou, Hong-Yu; Zhao, Bo; Wang, Limin; Qin, Jiayang; Tang, Hongzhi; Tao, Fei; Jarek, Michael; Scharfe, Maren; Ma, Cuiqing; Ma, Yanhe; Xu, Ping

    2011-01-01

    Bacillus coagulans2-6 is an efficient producer of lactic acid. The genome of B. coagulans2-6 has the smallest genome among the members of the genus Bacillusknown to date. The frameshift mutation at the start of the d-lactate dehydrogenase sequence might be responsible for the production of high-optical-purity l-lactic acid. PMID:21705584

  3. Genome sequence of the thermophilic strain Bacillus coagulans 2-6, an efficient producer of high-optical-purity L-lactic acid.

    PubMed

    Su, Fei; Yu, Bo; Sun, Jibin; Ou, Hong-Yu; Zhao, Bo; Wang, Limin; Qin, Jiayang; Tang, Hongzhi; Tao, Fei; Jarek, Michael; Scharfe, Maren; Ma, Cuiqing; Ma, Yanhe; Xu, Ping

    2011-09-01

    Bacillus coagulans 2-6 is an efficient producer of lactic acid. The genome of B. coagulans 2-6 has the smallest genome among the members of the genus Bacillus known to date. The frameshift mutation at the start of the d-lactate dehydrogenase sequence might be responsible for the production of high-optical-purity l-lactic acid.

  4. Snake venom toxins. The amino acid sequence of toxin Vi2, a homologue of pancreatic trypsin inhibitor, from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Strydom, D J

    1977-04-25

    The amino acid sequence of venom component Vi2, a protein of low toxicity from Dendroaspis polylepis polylepis venom was determined by automatic sequence analysis in combination with sequence studies on tryptic peptides. This protein, the most retarded fraction of this venom on a cation-exchange resin, is a homologue of bovine pancreatic trypsin inhibitor consisting of a single chain of 57 amino acid residues containing six half-cystine residues. The active site lysyl residue of bovine trypsin inhibitor is conserved in Vi2 although large differences are found in the rest of the molecule.

  5. Developmental variation and amino acid sequences of cytochromes c of the fruit fly Drosophila melanogaster and the flesh fly Boettcherisca peregrina.

    PubMed

    Inoue, S; Inoue, H; Hiroyoshi, T; Matsubara, H; Yamanaka, T

    1986-10-01

    The amino acid sequences of cytochromes c purified from the fruit fly Drosophila melanogaster and the flesh fly Boettcherisca peregrina were determined. In contrast with the case of the housefly, isocytochromes c were not detected in these flies at any developmental stage. The sequence of fruit fly cytochrome c differed from that reported previously but was identical with that predicted from the nucleotide sequence of the fruit fly cytochrome c gene (DC4) (Limbach, K.J. & Wu, R. (1985) Nucl. Acids Res. 13, 631-644). Isocytochrome c of the fruit fly, reported to be encoded by the DC3 gene, was not detected as a functional cytochrome c molecule.

  6. Detection of DBD-carbamoyl amino acids in amino acid sequence and D/L configuration determination of peptides with fluorogenic Edman reagent 7-[(N,N-dimethylamino)sulfonyl]-2,1,3-benzoxadiazol-4-yl isothiocyanate.

    PubMed

    Huang, Y; Matsunaga, H; Toriba, A; Santa, T; Fukushima, T; Imai, K

    1999-06-01

    A method for amino acid sequence and D/L configuration identification of peptides by using fluorogenic Edman reagent 7-[(N, N-dimethylamino)sulfonyl]-2,1,3-benzoxadiazol-4-yl isothiocyanate (DBD-NCS) has been developed. This method was based on the Edman degradation principle with some modifications. A peptide or protein was coupled with DBD-NCS under basic conditions and then cyclized/cleaved to produce DBD-thiazolinone (TZ) derivative by BF3, a Lewis acid, which could significantly suppress the amino acid racemization. The liberated DBD-TZ amino acid was hydrolyzed to DBD-thiocarbamoyl (TC) amino acid under a weakly acidic condition and then oxidized by NaNO2/H+ to DBD-carbamoyl (CA) amino acid which was a stable and had a strong fluorescence intensity. The individual DBD-CA amino acids were separated on a reversed-phase high-performance liquid chromatography (RP-HPLC) for amino acid sequencing and their enantiomers were resolved on a chiral stationary-phase HPLC for identifying their D/L configurations. Combination of the two HPLC systems, the amino acid sequence and D/L configuration of peptides could be determined. This method will be useful for searching D-amino-acid-containing peptides in animals.

  7. The amino acid sequence of protein AA from a burro (Equus asinus).

    PubMed

    Sletten, Knut; Johnson, Kenneth H; Westermark, Per

    2003-09-01

    The primary structure of amyloid fibril protein AA of a burro has been determined by Edman degradation. The 80 amino acid residue long protein shows strong resemblance to that of other mammalian AA-proteins and differs from equine protein AA at 5 positions: Burro/horse positions 20 (Q/N), 44 (R,Q, K/K,Q), 59 (G,L/G,A), 61 (Q/E) and 65 (N/R).

  8. Complete genome sequence of probiotic Bacillus coagulans HM-08: A potential lactic acid producer.

    PubMed

    Yao, Guoqiang; Gao, Pengfei; Zhang, Wenyi

    2016-06-20

    Bacillus coagulans HM-08 is a commercialized probiotic strain in China. Its genome contains a 3.62Mb circular chromosome with an average GC content of 46.3%. In silico analysis revealed the presence of one xyl operon as well as several other genes that are correlated to xylose utilization. The genetic information provided here may help to expand its future biotechnology potential in lactic acid production.

  9. Amino acid sequence and posttranslational modifications of human factor VII sub a from plasma and transfected baby hamster kidney cells

    SciTech Connect

    Thim, L.; Bjoern, S.; Christensen, M.; Nicolaisen, E.M.; Lund-Hansen, T.; Pedersen, A.H.; Hedner, U. )

    1988-10-04

    Blood coagulation factor VII is a vitamin K dependent glycoprotein which in its activated form, factor VII{sub a}, participates in the coagulation process by activating factor X and/or factor IX in the presence of Ca{sup 2+} and tissue factor. Three types of potential posttranslational modifications exist in the human factor VII{sub a} molecule, namely, 10 {gamma}-carboxylated, N-terminally located glutamic acid residues, 1 {beta}-hydroxylated aspartic acid residue, and 2 N-glycosylated asparagine residues. In the present study, the amino acid sequence and posttranslational modifications of recombinant factor VII{sub a} as purified from the culture medium of a transfected baby hamster kidney cell line have been compared to human plasma factor VII{sub a}. By use of HPLC, amino acid analysis, peptide mapping, and automated Edman degradation, the protein backbone of recombinant factor VII{sub a} was found to be identical with human factor VII{sub a}. Asparagine residues 145 and 322 were found to be fully N-glycosylated in human plasma factor VII{sub a}. In the recombinant factor VII{sub a}, asparagine residue 322 was fully glycosylated whereas asparagine residue 145 was only partially (approximately 66%) glycosylated. Besides minor differences in the sialic acid and fucose contents, the overall carbohydrate compositions were nearly identical in recombinant factor VII{sub a} and human plasma factor VII{sub a}. These results show that factor VII{sub a} as produced in the transfected baby hamster kidney cells is very similar to human plasma factor VII{sub a} and that this cell line thus might represent an alternative source for human factor VII{sub a}.

  10. Cloning and nucleotide sequencing of a novel 7 beta-(4-carboxybutanamido)cephalosporanic acid acylase gene of Bacillus laterosporus and its expression in Escherichia coli and Bacillus subtilis.

    PubMed

    Aramori, I; F