Science.gov

Sample records for acid sequence divergence

  1. Inferences from protein and nucleic acid sequences - Early molecular evolution, divergence of kingdoms and rates of change

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.; Barker, W. C.; Mclaughlin, P. J.

    1974-01-01

    Description of new sensitive, objective methods for establishing the probable common ancestry of very distantly related sequences and the quantitative evolutionary change which has taken place. These methods are applied to four families of proteins and nucleic acids and evolutionary trees will be derived where possible. Of the three families containing duplications of genetic material, two are nucleic acids: transfer RNA and 5S ribosomal RNA. Both of these structures are functional in the synthesis of coded proteins, and prototypes must have been present in the cell at the inception of the fundamental coding process that all living things share. There are many types of tRNA which recognize the various nucleotide triplets and the 20 amino acids. These types are thought to have arisen as a result of many gene duplications. Relationships among these types are discussed. The 5S ribosomal RNA, presently functional in both eukaryotes and prokaryotes, is very likely descended from an early form incorporating almost a complete duplication of genetic material. The amount of evolution in the various lines can again be compared. The other two families containing duplications are proteins; ferredoxin and cytochrome c.

  2. Variation in seed fatty acid composition and sequence divergence in the FAD2 gene coding region between wild and cultivated sesame.

    PubMed

    Chen, Zhenbang; Tonnis, Brandon; Morris, Brad; Wang, Richard B; Zhang, Amy L; Pinnow, David; Wang, Ming Li

    2014-12-03

    Sesame germplasm harbors genetic diversity which can be useful for sesame improvement in breeding programs. Seven accessions with different levels of oleic acid were selected from the entire USDA sesame germplasm collection (1232 accessions) and planted for morphological observation and re-examination of fatty acid composition. The coding region of the FAD2 gene for fatty acid desaturase (FAD) in these accessions was also sequenced. Cultivated sesame accessions flowered and matured earlier than the wild species. The cultivated sesame seeds contained a significantly higher percentage of oleic acid (40.4%) than the seeds of the wild species (26.1%). Nucleotide polymorphisms were identified in the FAD2 gene coding region between wild and cultivated species. Some nucleotide polymorphisms led to amino acid changes, one of which was located in the enzyme active site and may contribute to the altered fatty acid composition. Based on the morphology observation, chemical analysis, and sequence analysis, it was determined that two accessions were misnamed and need to be reclassified. The results obtained from this study are useful for sesame improvement in molecular breeding programs.

  3. Variation in Seed Fatty Acid Composition, and Sequence Divergence in the FAD2 Gene Coding Region between Wild and Cultivated Sesame

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Sesame germplasm harbors genetic diversity which can be useful for sesame improvement in breeding programs. Seven accessions with different levels of oleic acid were selected from the entire USDA sesame germplasm collection (1232 accessions) and planted for morphological observation and re-examinati...

  4. Mitochondrial DNA Sequence Divergence among Lycopersicon and Related Solanum Species

    PubMed Central

    McClean, Phillip E.; Hanson, Maureen R.

    1986-01-01

    Sequence divergence among the mitochondrial (mt) DNA of nine Lycopersicon and two closely related Solanum species was estimated using the shared fragment method. A portion of each mt genome was highlighted by probing total DNA with a series of plasmid clones containing mt-specific DNA fragments from Lycopersicon pennellii. A total of 660 fragments were compared. As calculated by the shared fragment method, sequence divergence among the mtDNAs ranged from 0.4% for the L. esculentum-L. esculentum var. cerasiforme pair to 2.7% for the Solanum rickii-L. pimpinellifolium and L. cheesmanii-L. chilense pairs. The mtDNA divergence is higher than that reported for Lycopersicon chloroplast (cp) DNA, which indicates that the DNAs of the two plant organelles are evolving at different rates. The percentages of shared fragments were used to construct a phenogram that illustrates the present-day relationships of the mtDNAs. The mtDNA-derived phenogram places L. hirsutum closer to L. esculentum than taxonomic and cpDNA comparisons. Further, the recent assignment of L. pennellii to the genus Lycopersicon is supported by the mtDNA analysis. PMID:17246320

  5. AlignMiner: a Web-based tool for detection of divergent regions in multiple sequence alignments of conserved sequences

    PubMed Central

    2010-01-01

    Background Multiple sequence alignments are used to study gene or protein function, phylogenetic relations, genome evolution hypotheses and even gene polymorphisms. Virtually without exception, all available tools focus on conserved segments or residues. Small divergent regions, however, are biologically important for specific quantitative polymerase chain reaction, genotyping, molecular markers and preparation of specific antibodies, and yet have received little attention. As a consequence, they must be selected empirically by the researcher. AlignMiner has been developed to fill this gap in bioinformatic analyses. Results AlignMiner is a Web-based application for detection of conserved and divergent regions in alignments of conserved sequences, focusing particularly on divergence. It accepts alignments (protein or nucleic acid) obtained using any of a variety of algorithms, which does not appear to have a significant impact on the final results. AlignMiner uses different scoring methods for assessing conserved/divergent regions, Entropy being the method that provides the highest number of regions with the greatest length, and Weighted being the most restrictive. Conserved/divergent regions can be generated either with respect to the consensus sequence or to one master sequence. The resulting data are presented in a graphical interface developed in AJAX, which provides remarkable user interaction capabilities. Users do not need to wait until execution is complete and can.even inspect their results on a different computer. Data can be downloaded onto a user disk, in standard formats. In silico and experimental proof-of-concept cases have shown that AlignMiner can be successfully used to designing specific polymerase chain reaction primers as well as potential epitopes for antibodies. Primer design is assisted by a module that deploys several oligonucleotide parameters for designing primers "on the fly". Conclusions AlignMiner can be used to reliably detect

  6. PhyPA: Phylogenetic method with pairwise sequence alignment outperforms likelihood methods in phylogenetics involving highly diverged sequences.

    PubMed

    Xia, Xuhua

    2016-09-01

    While pairwise sequence alignment (PSA) by dynamic programming is guaranteed to generate one of the optimal alignments, multiple sequence alignment (MSA) of highly divergent sequences often results in poorly aligned sequences, plaguing all subsequent phylogenetic analysis. One way to avoid this problem is to use only PSA to reconstruct phylogenetic trees, which can only be done with distance-based methods. I compared the accuracy of this new computational approach (named PhyPA for phylogenetics by pairwise alignment) against the maximum likelihood method using MSA (the ML+MSA approach), based on nucleotide, amino acid and codon sequences simulated with different topologies and tree lengths. I present a surprising discovery that the fast PhyPA method consistently outperforms the slow ML+MSA approach for highly diverged sequences even when all optimization options were turned on for the ML+MSA approach. Only when sequences are not highly diverged (i.e., when a reliable MSA can be obtained) does the ML+MSA approach outperforms PhyPA. The true topologies are always recovered by ML with the true alignment from the simulation. However, with MSA derived from alignment programs such as MAFFT or MUSCLE, the recovered topology consistently has higher likelihood than that for the true topology. Thus, the failure to recover the true topology by the ML+MSA is not because of insufficient search of tree space, but by the distortion of phylogenetic signal by MSA methods. I have implemented in DAMBE PhyPA and two approaches making use of multi-gene data sets to derive phylogenetic support for subtrees equivalent to resampling techniques such as bootstrapping and jackknifing.

  7. Sequence of inequalities among fuzzy mean difference divergence measures and their applications.

    PubMed

    Tomar, Vijay Prakash; Ohlan, Anshu

    2014-01-01

    This paper presents a sequence of fuzzy mean difference divergence measures. The validity of these fuzzy mean difference divergence measures is proved axiomatically. In addition, it introduces a sequence of inequalities among some of these fuzzy mean difference divergence measures. The applications of proposed fuzzy mean difference divergence measures in the context of pattern recognition have been presented using a numerical example. It is shown that the proposed fuzzy mean difference divergence measures are well suited to use with linguistic variables. Finally, on establishing inequalities, we find that our proposed measures are computationally much more efficient.

  8. Extraordinary Sequence Divergence at Tsga8, an X-linked Gene Involved in Mouse Spermiogenesis

    PubMed Central

    Good, Jeffrey M.; Vanderpool, Dan; Smith, Kimberly L.; Nachman, Michael W.

    2011-01-01

    The X chromosome plays an important role in both adaptive evolution and speciation. We used a molecular evolutionary screen of X-linked genes potentially involved in reproductive isolation in mice to identify putative targets of recurrent positive selection. We then sequenced five very rapidly evolving genes within and between several closely related species of mice in the genus Mus. All five genes were involved in male reproduction and four of the genes showed evidence of recurrent positive selection. The most remarkable evolutionary patterns were found at Testis-specific gene a8 (Tsga8), a spermatogenesis-specific gene expressed during postmeiotic chromatin condensation and nuclear transformation. Tsga8 was characterized by extremely high levels of insertion–deletion variation of an alanine-rich repetitive motif in natural populations of Mus domesticus and M. musculus, differing in length from the reference mouse genome by up to 89 amino acids (27% of the total protein length). This population-level variation was coupled with striking divergence in protein sequence and length between closely related mouse species. Although no clear orthologs had previously been described for Tsga8 in other mammalian species, we have identified a highly divergent hypothetical gene on the rat X chromosome that shares clear orthology with the 5′ and 3′ ends of Tsga8. Further inspection of this ortholog verified that it is expressed in rat testis and shares remarkable similarity with mouse Tsga8 across several general features of the protein sequence despite no conservation of nucleotide sequence across over 60% of the rat-coding domain. Overall, Tsga8 appears to be one of the most rapidly evolving genes to have been described in rodents. We discuss the potential evolutionary causes and functional implications of this extraordinary divergence and the possible contribution of Tsga8 and the other four genes we examined to reproductive isolation in mice. PMID:21186189

  9. Site-specific structural constraints on protein sequence evolutionary divergence: local packing density versus solvent exposure.

    PubMed

    Yeh, So-Wei; Liu, Jen-Wei; Yu, Sung-Huan; Shih, Chien-Hua; Hwang, Jenn-Kang; Echave, Julian

    2014-01-01

    Protein sequences evolve under selection pressures imposed by functional and biophysical requirements, resulting in site-dependent rates of amino acid substitution. Relative solvent accessibility (RSA) and local packing density (LPD) have emerged as the best candidates to quantify structural constraint. Recent research assumes that RSA is the main determinant of sequence divergence. However, it is not yet clear which is the best predictor of substitution rates. To address this issue, we compared RSA and LPD with site-specific rates of evolution for a diverse data set of enzymes. In contrast with recent studies, we found that LPD measures correlate better than RSA with evolutionary rate. Moreover, the independent contribution of RSA is minor. Taking into account that LPD is related to backbone flexibility, we put forward the possibility that the rate of evolution of a site is determined by the ease with which the backbone deforms to accommodate mutations.

  10. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  11. Molecular evolution and sequence divergence of plant chalcone synthase and chalcone synthase-Like genes.

    PubMed

    Han, Yingying; Zhao, Wenwen; Wang, Zhicui; Zhu, Jingying; Liu, Qisong

    2014-06-01

    Plant chalcone synthase (CHS) and CHS-Like (CHSL) proteins are polyketide synthases. In this study, we evaluated the molecular evolution of this gene family using representative types of CHSL genes, including stilbene synthase (STS), 2-pyrone synthase (2-PS), bibenzyl synthase (BBS), acridone synthase (ACS), biphenyl synthase (BIS), benzalacetone synthase, coumaroyl triacetic acid synthase (CTAS), and benzophenone synthase (BPS), along with their CHS homologs from the same species of both angiosperms and gymnosperms. A cDNA-based phylogeny indicated that CHSLs had diverse evolutionary patterns. STS, ACS, and 2-PS clustered with CHSs from the same species (late diverged pattern), while CTAS, BBS, BPS, and BIS were distant from their CHS homologs (early diverged pattern). The amino-acid phylogeny suggested that CHS and CHSL proteins formed clades according to enzyme function. The CHSs and CHSLs from Polygonaceae and Arachis had unique evolutionary histories. Synonymous mutation rates were lower in late diverged CHSLs than in early diverged ones, indicating that gene duplications occurred more recently in late diverged CHSLs than in early diverged ones. Relative rate tests proved that late diverged CHSLs had unequal rates to CHSs from the same species when using fatty acid synthase, which evolved from the common ancestor with the CHS superfamily, as the outgroup, while the early diverged lineages had equal rates. This indicated that late diverged CHSLs experienced more frequent mutation than early diverged CHSLs after gene duplication, allowing obtaining new functions in relatively short period of time.

  12. Characterization of the sets of divergence for sequences of operators with the localization property

    SciTech Connect

    Karagulyan, Grigorii A

    2011-01-31

    General theorems characterizing the sets of divergence for sequences of operators with the localization property are established and then used to obtain a complete characterization of the sets of divergence for Fourier series and their Cesaro means in classical orthonormal systems. Bibliography: 28 titles.

  13. Characterization of the sets of divergence for sequences of operators with the localization property

    NASA Astrophysics Data System (ADS)

    Karagulyan, Grigorii A.

    2011-01-01

    General theorems characterizing the sets of divergence for sequences of operators with the localization property are established and then used to obtain a complete characterization of the sets of divergence for Fourier series and their Cesàro means in classical orthonormal systems. Bibliography: 28 titles.

  14. Mitochondrial DNA Sequence Divergence among Meloidogyne incognita, Romanomermis culicivorax, Ascaris suum, and Caenorhabditis elegans

    PubMed Central

    Powers, T. O.; Harris, T. S.; Hyman, B. C.

    1993-01-01

    Mitochondrial DNA sequences were obtained from the NADH dehydrogenase subunit 3 (ND3), large rRNA, and cytochrome b genes from Meloidogyne incognita and Romanomermis culicivorax. Both species show considerable genetic distance within these same genes when compared with Caenorhabditis elegans or Ascaris suum, two species previously analyzed. Caenorhabditis, Ascaris, and Meloidogyne were selected as representatives of three subclasses in the nematode class Secernentea: Rhabditia, Spiruria, and Diplogasteria, respectively. Romanomermis served as a representative out-group of the class Adenophorea. The divergence between the phytoparasitic lineage (represented by Meloidogyne) and the three other species is so great that virtually every variable position in these genes appears to have accumulated multiple mutations, obscuring the phylogenetic information obtainable from these comparisons. The 39 and 42% amino acid similarity between the M. incognita and C. elegans ND3 and cytochrome b coding sequences, respectively, are approximately the same as those of C. elegans-mouse comparisons for the same genes (26 and 44%). This discovery calls into question the feasibility of employing cloned C. elegans probes as reagents to isolate phytoparasitic nematode genes. The genetic distance between the phytoparasitic nematode lineage and C. elegans markedly contrasts with the 79% amino acid similarity between C. elegans and A. suum for the same sequences. The molecular data suggest that Caenorhabditis and Ascaris belong to the same subclass. PMID:19279810

  15. Functionally conserved enhancers with divergent sequences in distant vertebrates

    DOE PAGES

    Yang, Song; Oksenberg, Nir; Takayama, Sachiko; ...

    2015-10-30

    To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. In conclusion, our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.

  16. Structure and sequence divergence of two archaebacterial genes

    SciTech Connect

    Cue, D.; Beckler, G.S.; Reeve, J.N.; Konisky, J.

    1985-06-01

    The DNA sequences of a region that includes the hisA gene of two related methanogenic archaebacteria, Methanococcus voltae and Methanococcus vannielii, have been compared. Both organisms show a similar genome organization in this region, displaying three open reading frames (ORFs) separated by regions of very high A+T content. Two of the ORFs, including ORFHisA, show significant DNA sequence homology. As might be expected for organisms having a genome that is A+T-rich, there is a high preference for A and U as the third base in codons. A ribosome binding site, G-G-T-G, is located 6 base pairs preceding the ATG translation initiation sequence of both hisA genes. The sequences upstream of the two hisA genes show only limited sequence homology. The M. voltae intergenic region contains four tandemly arranged repetitions of an 11-base-pair sequence, whereas the M. vannielii sequence contains both direct and inverted repetitive sequences. Based on the degree of hisA sequence homology, the authors conclude that M. voltae and M. vannielii are less closely related taxonomically than are members of the enteric group of eubacteria.

  17. Comparative Species Divergence across Eight Triplets of Spiny Lizards (Sceloporus) Using Genomic Sequence Data

    PubMed Central

    Leaché, Adam D.; Harris, Rebecca B.; Maliska, Max E.; Linkem, Charles W.

    2013-01-01

    Species divergence is typically thought to occur in the absence of gene flow, but many empirical studies are discovering that gene flow may be more pervasive during species formation. Although many examples of divergence with gene flow have been identified, few clades have been investigated in a comparative manner, and fewer have been studied using genome-wide sequence data. We contrast species divergence genetic histories across eight triplets of North American Sceloporus lizards using a maximum likelihood implementation of the isolation–migration (IM) model. Gene flow at the time of species divergence is modeled indirectly as variation in species divergence time across the genome or explicitly using a migration rate parameter. Likelihood ratio tests (LRTs) are used to test the null model of no gene flow at speciation against these two alternative gene flow models. We also use the Akaike information criterion to rank the models. Hundreds of loci are needed for the LRTs to have statistical power, and we use genome sequencing of reduced representation libraries to obtain DNA sequence alignments at many loci (between 340 and 3,478; mean = 1,678) for each triplet. We find that current species distributions are a poor predictor of whether a species pair diverged with gene flow. Interrogating the genome using the triplet method expedites the comparative study of species divergence history and the estimation of genetic parameters associated with speciation. PMID:24259316

  18. Complete genome sequences of two highly divergent Japanese isolates of Plantago asiatica mosaic virus.

    PubMed

    Komatsu, Ken; Yamashita, Kazuo; Sugawara, Kota; Verbeek, Martin; Fujita, Naoko; Hanada, Kaoru; Uehara-Ichiki, Tamaki; Fuji, Shin-Ichi

    2017-02-01

    Plantago asiatica mosaic virus (PlAMV) is a member of the genus Potexvirus and has an exceptionally wide host range. It causes severe damage to lilies. Here we report on the complete nucleotide sequences of two new Japanese PlAMV isolates, one from the eudicot weed Viola grypoceras (PlAMV-Vi), and the other from the eudicot shrub Nandina domestica Thunb. (PlAMV-NJ). Their genomes contain five open reading frames (ORFs), which is characteristic of potexviruses. Surprisingly, the isolates showed only 76.0-78.0 % sequence identity with each other and with other PlAMV isolates, including isolates from Japanese lily and American nandina. Amino acid alignments of the replicase coding region encoded by ORF1 showed that the regions between the methyltransferase and helicase domains were less conserved than other regions, with several insertions and/or deletions. Phylogenetic analyses of the full-length nucleotide sequences revealed a moderate correlation between phylogenetic clustering and the original host plants of the PlAMV isolates. This study revealed the presence of two highly divergent PlAMV isolates in Japan.

  19. High speed nucleic acid sequencing

    SciTech Connect

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  20. To the beat of a different drum: determinants implicated in the asymmetric sequence divergence of Caenorhabditis elegans paralogs

    PubMed Central

    2013-01-01

    Background Gene duplicates often exhibit asymmetric rates of molecular evolution in their early evolutionary existence. This asymmetry in rates is thought to signify the maintenance of the ancestral function by one copy and the removal of functional constraint on the other copy, enabling it to embark on a novel evolutionary trajectory. Here I focused on a large population of evolutionarily young gene duplicates (KS ≤ 0.14) in the Caenorhabditis elegans genome in order to conduct the first combined analysis of four predictors (evolutionary age, chromosomal location, structural resemblance between duplicates, and duplication span) which may be implicated in the asymmetric sequence divergence of paralogs at the nucleotide and amino acid level. In addition, I investigate if either paralog is equally likely to embark on a trajectory of accelerated sequence evolution or whether the derived paralog is more likely to exhibit faster sequence evolution. Results Three predictors (evolutionary age of duplicates, chromosomal location and duplication span) serve as major determinants of sequence asymmetry between C. elegans paralogs. Paralogs diverge asymmetrically in sequence with increasing evolutionary age, the relocation of one copy to a different chromosome and attenuated duplication spans that likely fail to capture the entire ancestral repertoire of coding sequence and regulatory elements. Furthermore, for paralogs residing on the same chromosome, opposite transcriptional orientation and increased genomic distance do not increase sequence asymmetry between paralogs. For a subset of duplicate pairs wherein the ancestral versus derived paralog could be distinguished, the derived paralogs are more likely to evolve at accelerated rates. Conclusions This genome-wide study of evolutionarily young duplicates stemming primarily from DNA-mediated small-scale duplication events demonstrates that genomic relocation to a new chromosome has important consequences for asymmetric

  1. Simple sequence repeat variations expedite phage divergence: Mechanisms of indels and gene mutations.

    PubMed

    Lin, Tiao-Yin

    2016-07-01

    Phages are the most abundant biological entities and influence prokaryotic communities on Earth. Comparing closely related genomes sheds light on molecular events shaping phage evolution. Simple sequence repeat (SSR) variations impart over half of the genomic changes between T7M and T3, indicating an important role of SSRs in accelerating phage genetic divergence. Differences in coding and noncoding regions of phages infecting different hosts, coliphages T7M and T3, Yersinia phage ϕYeO3-12, and Salmonella phage ϕSG-JL2, frequently arise from SSR variations. Such variations modify noncoding and coding regions; the latter efficiently changes multiple amino acids, thereby hastening protein evolution. Four classes of events are found to drive SSR variations: insertion/deletion of SSR units, expansion/contraction of SSRs without alteration of genome length, changes of repeat motifs, and generation/loss of repeats. The categorization demonstrates the ways SSRs mutate in genomes during phage evolution. Indels are common constituents of genome variations and human diseases, yet, how they occur without preexisting repeat sequence is less understood. Non-repeat-unit-based misalignment-elongation (NRUBME) is proposed to be one mechanism for indels without adjacent repeats. NRUBME or consecutive NRUBME may also change repeat motifs or generate new repeats. NRUBME invoking a non-Watson-Crick base pair explains insertions that initiate mononucleotide repeats. Furthermore, NRUBME successfully interprets many inexplicable human di- to tetranucleotide repeat generations. This study provides the first evidence of SSR variations expediting phage divergence, and enables insights into the events and mechanisms of genome evolution. NRUBME allows us to emulate natural evolution to design indels for various applications.

  2. Expression Divergence Is Correlated with Sequence Evolution but Not Positive Selection in Conifers.

    PubMed

    Hodgins, Kathryn A; Yeaman, Sam; Nurkowski, Kristin A; Rieseberg, Loren H; Aitken, Sally N

    2016-06-01

    The evolutionary and genomic determinants of sequence evolution in conifers are poorly understood, and previous studies have found only limited evidence for positive selection. Using RNAseq data, we compared gene expression profiles to patterns of divergence and polymorphism in 44 seedlings of lodgepole pine (Pinus contorta) and 39 seedlings of interior spruce (Picea glauca × engelmannii) to elucidate the evolutionary forces that shape their genomes and their plastic responses to abiotic stress. We found that rapidly diverging genes tend to have greater expression divergence, lower expression levels, reduced levels of synonymous site diversity, and longer proteins than slowly diverging genes. Similar patterns were identified for the untranslated regions, but with some exceptions. We found evidence that genes with low expression levels had a larger fraction of nearly neutral sites, suggesting a primary role for negative selection in determining the association between evolutionary rate and expression level. There was limited evidence for differences in the rate of positive selection among genes with divergent versus conserved expression profiles and some evidence supporting relaxed selection in genes diverging in expression between the species. Finally, we identified a small number of genes that showed evidence of site-specific positive selection using divergence data alone. However, estimates of the proportion of sites fixed by positive selection (α) were in the range of other plant species with large effective population sizes suggesting relatively high rates of adaptive divergence among conifers.

  3. Conservation patterns in different functional sequence categoriesof divergent Drosophila species

    SciTech Connect

    Papatsenko, Dmitri; Kislyuk, Andrey; Levine, Michael; Dubchak, Inna

    2005-10-01

    We have explored the distributions of fully conservedungapped blocks in genome-wide pairwise alignments of recently completedspecies of Drosophila: D.yakuba, D.ananassae, D.pseudoobscura, D.virilisand D.mojavensis. Based on these distributions we have found that nearlyevery functional sequence category possesses its own distinctiveconservation pattern, sometimes independent of the overall sequenceconservation level. In the coding and regulatory regions, the ungappedblocks were longer than in introns, UTRs and non-functional sequences. Atthe same time, the blocks in the coding regions carried 3N+2 signaturecharacteristic to synonymic substitutions in the 3rd codon positions.Larger block sizes in transcription regulatory regions can be explainedby the presence of conserved arrays of binding sites for transcriptionfactors. We also have shown that the longest ungapped blocks, or'ultraconserved' sequences, are associated with specific gene groups,including those encoding ion channels and components of the cytoskeleton.We discussed how restrained conservation patterns may help in mappingfunctional sequence categories and improving genomeannotation.

  4. Genomic islands of divergence in hybridizing Heliconius butterflies identified by large-scale targeted sequencing

    PubMed Central

    Nadeau, Nicola J.; Whibley, Annabel; Jones, Robert T.; Davey, John W.; Dasmahapatra, Kanchon K.; Baxter, Simon W.; Quail, Michael A.; Joron, Mathieu; ffrench-Constant, Richard H.; Blaxter, Mark L.; Mallet, James; Jiggins, Chris D.

    2012-01-01

    Heliconius butterflies represent a recent radiation of species, in which wing pattern divergence has been implicated in speciation. Several loci that control wing pattern phenotypes have been mapped and two were identified through sequencing. These same gene regions play a role in adaptation across the whole Heliconius radiation. Previous studies of population genetic patterns at these regions have sequenced small amplicons. Here, we use targeted next-generation sequence capture to survey patterns of divergence across these entire regions in divergent geographical races and species of Heliconius. This technique was successful both within and between species for obtaining high coverage of almost all coding regions and sufficient coverage of non-coding regions to perform population genetic analyses. We find major peaks of elevated population differentiation between races across hybrid zones, which indicate regions under strong divergent selection. These ‘islands’ of divergence appear to be more extensive between closely related species, but there is less clear evidence for such islands between more distantly related species at two further points along the ‘speciation continuum’. We also sequence fosmid clones across these regions in different Heliconius melpomene races. We find no major structural rearrangements but many relatively large (greater than 1 kb) insertion/deletion events (including gain/loss of transposable elements) that are variable between races. PMID:22201164

  5. Patterns of Y and X chromosome DNA sequence divergence during the Felidae radiation.

    PubMed Central

    Pecon Slattery, J; O'Brien, S J

    1998-01-01

    The 37 species of modern cats have evolved from approximately eight phylogenetic lineages within the past 10 to 15 million years. The Felidae family has been described with multiple measures of morphologic and molecular evolutionary methods that serve as a framework for tracking gene divergence during brief evolutionary periods. In this report, we compare the mode and tempo of evolution of noncoding sequences of a large intron within Zfy (783 bp) and Zfx (854 bp), homologous genes located on the felid Y and X chromosomes, respectively. Zfy sequence variation evolves at about twice the rate of Zfx, and both gene intron sequences track feline hierarchical topologies accurately. As homoplasies are infrequent in patterns of nucleotide substitution, the Y chromosome sequence displays a remarkable degree of phylogenetic consistency among cat species and provides a highly informative glimpse of divergence of sex chromosome sequences in Felidae. PMID:9539439

  6. Intermediate divergence levels maximize the strength of structure-sequence correlations in enzymes and viral proteins.

    PubMed

    Jackson, Eleisha L; Shahmoradi, Amir; Spielman, Stephanie J; Jack, Benjamin R; Wilke, Claus O

    2016-07-01

    Structural properties such as solvent accessibility and contact number predict site-specific sequence variability in many proteins. However, the strength and significance of these structure-sequence relationships vary widely among different proteins, with absolute correlation strengths ranging from 0 to 0.8. In particular, two recent works have made contradictory observations. Yeh et al. (Mol. Biol. Evol. 31:135-139, 2014) found that both relative solvent accessibility (RSA) and weighted contact number (WCN) are good predictors of sitewise evolutionary rate in enzymes, with WCN clearly out-performing RSA. Shahmoradi et al. (J. Mol. Evol. 79:130-142, 2014) considered these same predictors (as well as others) in viral proteins and found much weaker correlations and no clear advantage of WCN over RSA. Because these two studies had substantial methodological differences, however, a direct comparison of their results is not possible. Here, we reanalyze the datasets of the two studies with one uniform analysis pipeline, and we find that many apparent discrepancies between the two analyses can be attributed to the extent of sequence divergence in individual alignments. Specifically, the alignments of the enzyme dataset are much more diverged than those of the virus dataset, and proteins with higher divergence exhibit, on average, stronger structure-sequence correlations. However, the highest structure-sequence correlations are observed at intermediate divergence levels, where both highly conserved and highly variable sites are present in the same alignment.

  7. Genomic divergences among cattle, dog and human estimated from large-scale alignments of genomic sequences

    PubMed Central

    Liu, George E; Matukumalli, Lakshmi K; Sonstegard, Tad S; Shade, Larry L; Van Tassell, Curtis P

    2006-01-01

    Background Approximately 11 Mb of finished high quality genomic sequences were sampled from cattle, dog and human to estimate genomic divergences and their regional variation among these lineages. Results Optimal three-way multi-species global sequence alignments for 84 cattle clones or loci (each >50 kb of genomic sequence) were constructed using the human and dog genome assemblies as references. Genomic divergences and substitution rates were examined for each clone and for various sequence classes under different functional constraints. Analysis of these alignments revealed that the overall genomic divergences are relatively constant (0.32–0.37 change/site) for pairwise comparisons among cattle, dog and human; however substitution rates vary across genomic regions and among different sequence classes. A neutral mutation rate (2.0–2.2 × 10(-9) change/site/year) was derived from ancestral repetitive sequences, whereas the substitution rate in coding sequences (1.1 × 10(-9) change/site/year) was approximately half of the overall rate (1.9–2.0 × 10(-9) change/site/year). Relative rate tests also indicated that cattle have a significantly faster rate of substitution as compared to dog and that this difference is about 6%. Conclusion This analysis provides a large-scale and unbiased assessment of genomic divergences and regional variation of substitution rates among cattle, dog and human. It is expected that these data will serve as a baseline for future mammalian molecular evolution studies. PMID:16759380

  8. Phylogenetic relationships and divergence dates of the whole mitochondrial genome sequences among three gibbon genera.

    PubMed

    Matsudaira, Kazunari; Ishida, Takafumi

    2010-05-01

    Phylogenetic relationships of gibbons (4 genera, 12 species) deduced from short/partial mitochondrial DNA sequences were with low resolution and inconsistent. Their divergence over the short period of time may require longer sequences for the higher resolution. To solve the problems, we newly sequenced the whole mitochondrial genome of Hylobates agilis, Hylobates pileatus, Nomascus sp. and Symphalangus syndactylus. A highly resolved phylogenetic tree was obtained for the mitochondrial genome in the neighbor-joining, maximum-parsimony and maximum-likelihood analyses. The divergence dates of mitochondrial genome were estimated by Bayesian method; Nomascus and the other two genera to 8.0 million years ago (MYA), Symphalangus and Hylobates to 7.0 MYA, Hylobates pileatus and the other two Hylobates species to 3.9 MYA and Hylobates lar and Hylobates agilis to 3.3 MYA. Our results support the generic rank of Nomascus, Symphalangus and Hylobates, and proposed a new scenario that the Hylobates gibbons diverged in the Pliocene. This study shows the potential of the whole mitochondrial genome analyses to reveal detailed divergence history of gibbons.

  9. Using multi-locus allelic sequence data to estimate genetic divergence among four Lilium (Liliaceae) cultivars.

    PubMed

    Shahin, Arwa; Smulders, Marinus J M; van Tuyl, Jaap M; Arens, Paul; Bakker, Freek T

    2014-01-01

    Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium.

  10. The impact of sequence divergence and DNA mismatch repair on homeologous recombination in Arabidopsis.

    PubMed

    Li, Liangliang; Jean, Martine; Belzile, François

    2006-03-01

    We examined the effects of substrate divergence and DNA mismatch repair (MMR) on recombination in Arabidopsis thaliana. Relative to the frequency observed in plants with a homologous construct (0% divergence), recombination was decreased 4.1-, 9.6-, 11.7- or 20.3-fold, respectively, in lines with constructs containing 0.5%, 2%, 4% or 9% divergence between the recombination substrates. To evaluate the contribution of the MMR system in this decrease, 12 independent reporter lines (two or three lines per reporter construct) were crossed to an AtMSH2 T-DNA insertional mutant. We examined the recombination frequency in progeny homozygous for a reporter T-DNA and homozygous either for the wild type or the mutant allele of AtMSH2. The loss of MMR activity led to a two- to ninefold increase in homeologous recombination and the size of the increase did not seem to correlate with the amount of divergence. Inversely, complementation of the insertional mutant with a wild-type cDNA of AtMSH2 reduced recombination. Our results demonstrate clearly that sequence divergence can dramatically reduce the recombination frequency in plants and that the MMR system plays a part in this decrease.

  11. Finding a (pine) needle in a haystack: chloroplast genome sequence divergence in rare and widespread pines.

    PubMed

    Whittall, J B; Syring, J; Parks, M; Buenrostro, J; Dick, C; Liston, A; Cronn, R

    2010-03-01

    Critical to conservation efforts and other investigations at low taxonomic levels, DNA sequence data offer important insights into the distinctiveness, biogeographic partitioning and evolutionary histories of species. The resolving power of DNA sequences is often limited by insufficient variability at the intraspecific level. This is particularly true of studies involving plant organelles, as the conservative mutation rate of chloroplasts and mitochondria makes it difficult to detect polymorphisms necessary to track genealogical relationships among individuals, populations and closely related taxa, through space and time. Massively parallel sequencing (MPS) makes it possible to acquire entire organelle genome sequences to identify cryptic variation that would be difficult to detect otherwise. We are using MPS to evaluate intraspecific chloroplast-level divergence across biogeographic boundaries in narrowly endemic and widespread species of Pinus. We focus on one of the world's rarest pines - Torrey pine (Pinus torreyana) - due to its conservation interest and because it provides a marked contrast to more widespread pine species. Detailed analysis of nearly 90% ( approximately 105 000 bp each) of these chloroplast genomes shows that mainland and island populations of Torrey pine differ at five sites in their plastome, with the differences fixed between populations. This is an exceptionally low level of divergence (1 polymorphism/ approximately 21 kb), yet it is comparable to intraspecific divergence present in widespread pine species and species complexes. Population-level organelle genome sequencing offers new vistas into the timing and magnitude of divergence within species, and is certain to provide greater insight into pollen dispersal, migration patterns and evolutionary dynamics in plants.

  12. Phylogenetic analyses of complete mitochondrial genome sequences suggest a basal divergence of the enigmatic rodent Anomalurus

    PubMed Central

    Horner, David S; Lefkimmiatis, Konstantinos; Reyes, Aurelio; Gissi, Carmela; Saccone, Cecilia; Pesole, Graziano

    2007-01-01

    Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires) have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires) is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies. PMID:17288612

  13. Mouse Vk gene classification by nucleic acid sequence similarity.

    PubMed

    Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

    1989-01-01

    Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.

  14. Discovery of a divergent HPIV4 from respiratory secretions using second and third generation metagenomic sequencing.

    PubMed

    Alquezar-Planas, David E; Mourier, Tobias; Bruhn, Christian A W; Hansen, Anders J; Vitcetz, Sarah Nathalie; Mørk, Søren; Gorodkin, Jan; Nielsen, Hanne Abel; Guo, Yan; Sethuraman, Anand; Paxinos, Ellen E; Shan, Tongling; Delwart, Eric L; Nielsen, Lars P

    2013-01-01

    Molecular detection of viruses has been aided by high-throughput sequencing, permitting the genomic characterization of emerging strains. In this study, we comprehensively screened 500 respiratory secretions from children with upper and/or lower respiratory tract infections for viral pathogens. The viruses detected are described, including a divergent human parainfluenza virus type 4 from GS FLX pyrosequencing of 92 specimens. Complete full-genome characterization of the virus followed, using Single Molecule, Real-Time (SMRT) sequencing. Subsequent "primer walking" combined with Sanger sequencing validated the RS platform's utility in viral sequencing from complex clinical samples. Comparative genomics reveals the divergent strain clusters with the only completely sequenced HPIV4a subtype. However, it also exhibits various structural features present in one of the HPIV4b reference strains, opening questions regarding their lifecycle and evolutionary relationships among these viruses. Clinical data from patients infected with the strain, as well as viral prevalence estimates using real-time PCR, is also described.

  15. Inferring the evolutionary histories of divergences in Hylobates and Nomascus gibbons through multilocus sequence data

    PubMed Central

    2013-01-01

    Background Gibbons (Hylobatidae) are the most diverse group of living apes. They exist as geographically-contiguous species which diverged more rapidly than did their close relatives, the great apes (Hominidae). Of the four extant gibbon genera, the evolutionary histories of two polyspecific genera, Hylobates and Nomascus, have been the particular focus of research but the DNA sequence data used was largely derived from the maternally inherited mitochondrial DNA (mtDNA) locus. Results To investigate the evolutionary relationships and divergence processes of gibbon species, particularly those of the Hylobates genus, we produced and analyzed a total of 11.5 kb DNA of sequence at 14 biparentally inherited autosomal loci. We find that on average gibbon genera have a high average sequence diversity but a lower degree of genetic differentiation as compared to great ape genera. Our multilocus species tree features H. pileatus in a basal position and a grouping of the four Sundaic island species (H. agilis, H. klossii, H. moloch and H. muelleri). We conducted pairwise comparisons based on an isolation-with-migration (IM) model and detect signals of asymmetric gene flow between H. lar and H. moloch, between H. agilis and H. muelleri, and between N. leucogenys and N. siki. Conclusions Our multilocus analyses provide inferences of gibbon evolutionary histories complementary to those based on single gene data. The results of IM analyses suggest that the divergence processes of gibbons may be accompanied by gene flow. Future studies using analyses of multi-population model with samples of known provenance for Hylobates and Nomascus species would expand the understanding of histories of gene flow during divergences for these two gibbon genera. PMID:23586586

  16. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  17. Using comparative genomic hybridization to survey genomic sequence divergence across species: a proof-of-concept from Drosophila

    PubMed Central

    2010-01-01

    Background Genome-wide analysis of sequence divergence among species offers profound insights into the evolutionary processes that shape lineages. When full-genome sequencing is not feasible for a broad comparative study, we propose the use of array-based comparative genomic hybridization (aCGH) in order to identify orthologous genes with high sequence divergence. Here we discuss experimental design, statistical power, success rate, sources of variation and potential confounding factors. We used a spotted PCR product microarray platform from Drosophila melanogaster to assess sequence divergence on a gene-by-gene basis in three fully sequenced heterologous species (D. sechellia, D. simulans, and D. yakuba). Because complete genome assemblies are available for these species this study presents a powerful test for the use of aCGH as a tool to measure sequence divergence. Results We found a consistent and linear relationship between hybridization ratio and sequence divergence of the sample to the platform species. At higher levels of sequence divergence (< 92% sequence identity to D. melanogaster) ~84% of features had significantly less hybridization to the array in the heterologous species than the platform species, and thus could be identified as "diverged". At lower levels of divergence (≥ 97% identity), only 13% of genes were identified as diverged. While ~40% of the variation in hybridization ratio can be accounted for by variation in sequence identity of the heterologous sample relative to D. melanogaster, other individual characteristics of the DNA sequences, such as GC content, also contribute to variation in hybridization ratio, as does technical variation. Conclusions Here we demonstrate that aCGH can accurately be used as a proxy to estimate genome-wide divergence, thus providing an efficient way to evaluate how evolutionary processes and genomic architecture can shape species diversity in non-model systems. Given the increased number of species for which

  18. Sequence divergence and chromosomal rearrangements during the evolution of human pseudoautosomal genes and their mouse homologs

    SciTech Connect

    Ellison, J.; Li, X.; Francke, U.

    1994-09-01

    The pseudoautosomal region (PAR) is an area of sequence identity between the X and Y chromosomes and is important for mediating X-Y pairing during male meiosis. Of the seven genes assigned to the human PAR, none of the mouse homologs have been isolated by a cross-hybridization strategy. Two of these homologs, Csfgmra and II3ra, have been isolated using a functional assay for the gene products. These genes are quite different in sequence from their human homologs, showing only 60-70% sequence similarity. The Csfgmra gene has been found to further differ from its human homolog in being isolated not on the sex chromosomes, but on a mouse autosome (chromosome 19). Using a mouse-hamster somatic cell hybrid mapping panel, we have mapped the II3ra gene to yet another mouse autosome, chromosome 14. Attempts to clone the mouse homolog of the ANT3 locus resulted in the isolation of two related genes, Ant1 and Ant2, but failed to yield the Ant3 gene. Southern blot analysis of the ANT/Ant genes showed the Ant1 and Ant2 sequences to be well-conserved among all of a dozen mammals tested. In contrast, the ANT3 gene only showed hybridization to non-rodent mammals, suggesting it is either greatly divergent or has been deleted in the rodent lineage. Similar experiments with other human pseudoautosomal probes likewise showed a lack of hybridization to rodent sequences. The results show a definite trend of extensive divergence of pseudoautosomal sequences in addition to chromosomal rearrangements involving X;autosome translocations and perhaps gene deletions. Such observations have interesting implications regarding the evolution of this important region of the sex chromosomes.

  19. Drosophila bloom helicase maintains genome integrity by inhibiting recombination between divergent DNA sequences.

    PubMed

    Kappeler, Michael; Kranz, Elisabeth; Woolcock, Katrina; Georgiev, Oleg; Schaffner, Walter

    2008-12-01

    DNA double strand breaks (DSB) can be repaired either via a sequence independent joining of DNA ends or via homologous recombination. We established a detection system in Drosophila melanogaster to investigate the impact of sequence constraints on the usage of the homology based DSB repair via single strand annealing (SSA), which leads to recombination between direct repeats with concomitant loss of one repeat copy. First of all, we find the SSA frequency to be inversely proportional to the spacer length between the repeats, for spacers up to 2.4 kb in length. We further show that SSA between divergent repeats (homeologous SSA) is suppressed in cell cultures and in vivo in a sensitive manner, recognizing sequence divergences smaller than 0.5%. Finally, we demonstrate that the suppression of homeologous SSA depends on the Bloom helicase (Blm), encoded by the Drosophila gene mus309. Suppression of homeologous recombination is a novel function of Blm in ensuring genomic integrity, not described to date in mammalian systems. Unexpectedly, distinct from its function in Saccharomyces cerevisiae, the mismatch repair factor Msh2 encoded by spel1 does not suppress homeologous SSA in Drosophila.

  20. Complete genome sequence of a divergent strain of Japanese yam mosaic virus from China

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A novel strain of Japanese yam mosaic virus (JYMV-CN) was identified in a yam plant with foliar mottle symptoms in China. The complete genomic sequence of JYMV-CN was determined. Its genomic sequence of 9701 nucleotides encodes a polyprotein of 3247 amino acids. Its organization was virtually identi...

  1. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  2. The complete amino acid sequence of prochymosin.

    PubMed Central

    Foltmann, B; Pedersen, V B; Jacobsen, H; Kauffman, D; Wybrandt, G

    1977-01-01

    The total sequence of 365 amino acid residues in bovine prochymosin is presented. Alignment with the amino acid sequence of porcine pepsinogen shows that 204 amino acid residues are common to the two zymogens. Further comparison and alignment with the amino acid sequence of penicillopepsin shows that 66 residues are located at identical positions in all three proteases. The three enzymes belong to a large group of proteases with two aspartate residues in the active center. This group forms a family derived from one common ancestor. PMID:329280

  3. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  4. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  5. Detecting Functional Divergence after Gene Duplication through Evolutionary Changes in Posttranslational Regulatory Sequences

    PubMed Central

    Nguyen Ba, Alex N.; Strome, Bob; Hua, Jun Jie; Desmond, Jonathan; Gagnon-Arsenault, Isabelle; Weiss, Eric L.; Landry, Christian R.; Moses, Alan M.

    2014-01-01

    Gene duplication is an important evolutionary mechanism that can result in functional divergence in paralogs due to neo-functionalization or sub-functionalization. Consistent with functional divergence after gene duplication, recent studies have shown accelerated evolution in retained paralogs. However, little is known in general about the impact of this accelerated evolution on the molecular functions of retained paralogs. For example, do new functions typically involve changes in enzymatic activities, or changes in protein regulation? Here we study the evolution of posttranslational regulation by examining the evolution of important regulatory sequences (short linear motifs) in retained duplicates created by the whole-genome duplication in budding yeast. To do so, we identified short linear motifs whose evolutionary constraint has relaxed after gene duplication with a likelihood-ratio test that can account for heterogeneity in the evolutionary process by using a non-central chi-squared null distribution. We find that short linear motifs are more likely to show changes in evolutionary constraints in retained duplicates compared to single-copy genes. We examine changes in constraints on known regulatory sequences and show that for the Rck1/Rck2, Fkh1/Fkh2, Ace2/Swi5 paralogs, they are associated with previously characterized differences in posttranslational regulation. Finally, we experimentally confirm our prediction that for the Ace2/Swi5 paralogs, Cbk1 regulated localization was lost along the lineage leading to SWI5 after gene duplication. Our analysis suggests that changes in posttranslational regulation mediated by short regulatory motifs systematically contribute to functional divergence after gene duplication. PMID:25474245

  6. Detecting functional divergence after gene duplication through evolutionary changes in posttranslational regulatory sequences.

    PubMed

    Nguyen Ba, Alex N; Strome, Bob; Hua, Jun Jie; Desmond, Jonathan; Gagnon-Arsenault, Isabelle; Weiss, Eric L; Landry, Christian R; Moses, Alan M

    2014-12-01

    Gene duplication is an important evolutionary mechanism that can result in functional divergence in paralogs due to neo-functionalization or sub-functionalization. Consistent with functional divergence after gene duplication, recent studies have shown accelerated evolution in retained paralogs. However, little is known in general about the impact of this accelerated evolution on the molecular functions of retained paralogs. For example, do new functions typically involve changes in enzymatic activities, or changes in protein regulation? Here we study the evolution of posttranslational regulation by examining the evolution of important regulatory sequences (short linear motifs) in retained duplicates created by the whole-genome duplication in budding yeast. To do so, we identified short linear motifs whose evolutionary constraint has relaxed after gene duplication with a likelihood-ratio test that can account for heterogeneity in the evolutionary process by using a non-central chi-squared null distribution. We find that short linear motifs are more likely to show changes in evolutionary constraints in retained duplicates compared to single-copy genes. We examine changes in constraints on known regulatory sequences and show that for the Rck1/Rck2, Fkh1/Fkh2, Ace2/Swi5 paralogs, they are associated with previously characterized differences in posttranslational regulation. Finally, we experimentally confirm our prediction that for the Ace2/Swi5 paralogs, Cbk1 regulated localization was lost along the lineage leading to SWI5 after gene duplication. Our analysis suggests that changes in posttranslational regulation mediated by short regulatory motifs systematically contribute to functional divergence after gene duplication.

  7. Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters.

    PubMed

    Core, Leighton J; Waterfall, Joshua J; Lis, John T

    2008-12-19

    RNA polymerases are highly regulated molecular machines. We present a method (global run-on sequencing, GRO-seq) that maps the position, amount, and orientation of transcriptionally engaged RNA polymerases genome-wide. In this method, nuclear run-on RNA molecules are subjected to large-scale parallel sequencing and mapped to the genome. We show that peaks of promoter-proximal polymerase reside on approximately 30% of human genes, transcription extends beyond pre-messenger RNA 3' cleavage, and antisense transcription is prevalent. Additionally, most promoters have an engaged polymerase upstream and in an orientation opposite to the annotated gene. This divergent polymerase is associated with active genes but does not elongate effectively beyond the promoter. These results imply that the interplay between polymerases and regulators over broad promoter regions dictates the orientation and efficiency of productive transcription.

  8. Biophysical Models of Protein Evolution: Understanding the Patterns of Evolutionary Sequence Divergence.

    PubMed

    Echave, Julian; Wilke, Claus O

    2017-03-15

    For decades, rates of protein evolution have been interpreted in terms of the vague concept of functional importance. Slowly evolving proteins or sites within proteins were assumed to be more functionally important and thus subject to stronger selection pressure. More recently, biophysical models of protein evolution, which combine evolutionary theory with protein biophysics, have completely revolutionized our view of the forces that shape sequence divergence. Slowly evolving proteins have been found to evolve slowly because of selection against toxic misfolding and misinteractions, linking their rate of evolution primarily to their abundance. Similarly, most slowly evolving sites in proteins are not directly involved in function, but mutating these sites has a large impact on protein structure and stability. In this article, we review the studies in the emerging field of biophysical protein evolution that have shaped our current understanding of sequence divergence patterns. We also propose future research directions to develop this nascent field. Expected final online publication date for the Annual Review of Biophysics Volume 46 is May 20, 2017. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

  9. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence.

    PubMed

    Gordon, Kacy L; Arthur, Robert K; Ruvinsky, Ilya

    2015-05-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements.

  10. Comparative performance of double-digest RAD sequencing across divergent arachnid lineages.

    PubMed

    Burns, Mercedes; Starrett, James; Derkarabetian, Shahan; Richart, Casey H; Cabrero, Allan; Hedin, Marshal

    2016-07-25

    Next-generation sequencing technologies now allow researchers of non-model systems to perform genome-based studies without the requirement of a (often unavailable) closely related genomic reference. We evaluated the role of restriction endonuclease (RE) selection in double-digest restriction-site-associated DNA sequencing (ddRADseq) by generating reduced representation genome-wide data using four different RE combinations. Our expectation was that RE selections targeting longer, more complex restriction sites would recover fewer loci than RE with shorter, less complex sites. We sequenced a diverse sample of non-model arachnids, including five congeneric pairs of harvestmen (Opiliones) and four pairs of spiders (Araneae). Sample pairs consisted of either conspecifics or closely related congeneric taxa, and in total 26 sample pair analyses were tested. Sequence demultiplexing, read clustering and variant calling were performed in the pyRAD program. The 6-base pair cutter EcoRI combined with methylated site-specific 4-base pair cutter MspI produced, on average, the greatest numbers of intra-individual loci and shared loci per sample pair. As expected, the number of shared loci recovered for a sample pair covaried with the degree of genetic divergence, estimated with cytochrome oxidase I sequences, although this relationship was non-linear. Our comparative results will prove useful in guiding protocol selection for ddRADseq experiments on many arachnid taxa where reference genomes, even from closely related species, are unavailable.

  11. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

    PubMed Central

    Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

    2015-01-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930

  12. Substitution patterns are GC-biased in divergent sequences across the metazoans.

    PubMed

    Capra, John A; Pollard, Katherine S

    2011-01-01

    The fastest-evolving regions in the human and chimpanzee genomes show a remarkable excess of weak (A,T) to strong (G,C) nucleotide substitutions since divergence from their common ancestor. We investigated the phylogenetic extent and possible causes of this weak to strong (W → S) bias in divergent sequences (BDS) using recently sequenced genomes and recombination maps from eight trios of eukaryotic species. To quantify evidence for BDS, we inferred substitution histories using an efficient maximum likelihood approach with a context-dependent evolutionary model. We then annotated all lineage-specific substitutions in terms of W → S bias and density on the chromosomes. Finally, we used the inferred substitutions to calculate a BDS score-a log odds ratio between substitution type and density-and assessed its statistical significance with Fisher's exact test. Applying this approach, we found significant BDS in the coding and noncoding sequence of human, mouse, dog, stickleback, fruit fly, and worm. We also observed a significant lack of W → S BDS in chicken and yeast. The BDS score varies between species and across the chromosomes within each species. It is most strongly correlated with different genomic features in different species, but a strong correlation with recombination rates is found in several species. Our results demonstrate that a W → S substitution bias in fast-evolving sequences is a widespread phenomenon. The patterns of BDS observed suggest that a recombination-associated process, such as GC-biased gene conversion, is involved in the production of the bias in many species, but the strength of the BDS likely depends on many factors, including genome stability, variability in recombination rate over time and across the genome, the frequency of meiosis, and the amount of outcrossing in each species.

  13. Oil palm genome sequence reveals divergence of interfertile species in Old and New worlds.

    PubMed

    Singh, Rajinder; Ong-Abdullah, Meilina; Low, Eng-Ti Leslie; Manaf, Mohamad Arif Abdul; Rosli, Rozana; Nookiah, Rajanaidu; Ooi, Leslie Cheng-Li; Ooi, Siew-Eng; Chan, Kuang-Lim; Halim, Mohd Amin; Azizi, Norazah; Nagappan, Jayanthi; Bacher, Blaire; Lakey, Nathan; Smith, Steven W; He, Dong; Hogan, Michael; Budiman, Muhammad A; Lee, Ernest K; DeSalle, Rob; Kudrna, David; Goicoechea, Jose Luis; Wing, Rod A; Wilson, Richard K; Fulton, Robert S; Ordway, Jared M; Martienssen, Robert A; Sambanthamurthi, Ravigadevi

    2013-08-15

    Oil palm is the most productive oil-bearing crop. Although it is planted on only 5% of the total world vegetable oil acreage, palm oil accounts for 33% of vegetable oil and 45% of edible oil worldwide, but increased cultivation competes with dwindling rainforest reserves. We report the 1.8-gigabase (Gb) genome sequence of the African oil palm Elaeis guineensis, the predominant source of worldwide oil production. A total of 1.535 Gb of assembled sequence and transcriptome data from 30 tissue types were used to predict at least 34,802 genes, including oil biosynthesis genes and homologues of WRINKLED1 (WRI1), and other transcriptional regulators, which are highly expressed in the kernel. We also report the draft sequence of the South American oil palm Elaeis oleifera, which has the same number of chromosomes (2n = 32) and produces fertile interspecific hybrids with E. guineensis but seems to have diverged in the New World. Segmental duplications of chromosome arms define the palaeotetraploid origin of palm trees. The oil palm sequence enables the discovery of genes for important traits as well as somaclonal epigenetic alterations that restrict the use of clones in commercial plantings, and should therefore help to achieve sustainability for biofuels and edible oils, reducing the rainforest footprint of this tropical plantation crop.

  14. Transcriptome-wide comparison of sequence variation in divergent ecotypes of kokanee salmon

    PubMed Central

    2013-01-01

    Background High throughput next-generation sequencing technology has enabled the collection of genome-wide sequence data and revolutionized single nucleotide polymorphism (SNP) discovery in a broad range of species. When analyzed within a population genomics framework, SNP-based genotypic data may be used to investigate questions of evolutionary, ecological, and conservation significance in natural populations of non-model organisms. Kokanee salmon are recently diverged freshwater populations of sockeye salmon (Oncorhynchus nerka) that exhibit reproductive ecotypes (stream-spawning and shore-spawning) in lakes throughout western North America and northeast Asia. Current conservation and management strategies may treat these ecotypes as discrete stocks, however their recent divergence and low levels of gene flow make in-season genetic stock identification a challenge. The development of genome-wide SNP markers is an essential step towards fine-scale stock identification, and may enable a direct investigation of the genetic basis of ecotype divergence. Results We used pooled cDNA samples from both ecotypes of kokanee to generate 750 million base pairs of transcriptome sequence data. These raw data were assembled into 11,074 high coverage contigs from which we identified 32,699 novel single nucleotide polymorphisms. A subset of these putative SNPs was validated using high-resolution melt analysis and Sanger resequencing to genotype independent samples of kokanee and anadromous sockeye salmon. We also identified a number of contigs that were composed entirely of reads from a single ecotype, which may indicate regions of differential gene expression between the two reproductive ecotypes. In addition, we found some evidence for greater pathogen load among the kokanee sampled in stream-spawning habitats, suggesting a possible evolutionary advantage to shore-spawning that warrants further study. Conclusions This study provides novel genomic resources to support population

  15. Assessing divergence time of Spirulida and Sepiida (Cephalopoda) based on hemocyanin sequences.

    PubMed

    Warnke, Kerstin Martina; Meyer, Achim; Ebner, Bettina; Lieb, Bernhard

    2011-02-01

    The phylogenetic position of the mesopelagic decabrachian cephalopod Spirula is still a matter of debate. Since hemocyanin has successfully been used to calibrate a molecular clock for many molluscan species, a molecular clock was calculated based on this gene with special attention to the cephalopod genera Spirula and Sepia. The obtained partial sequence comprising ca., one third (3567 bp) of the complete gene is similar to that of Sepia officinalis. The molecular clock was calibrated using the splits of Gastropoda-Cephalopoda (ca. 550 ± 50 mya) and Heterobranchia-Vetigastropoda (ca. 380 ± 10 mya). The resulting hemocyanin-based molecular clock is stable, and the estimated divergence time of Spirulida and Sepiida, some 150 ± 30 million years ago, can be deemed reliable.

  16. Nuclear ribosomal DNA internal transcribed spacer 1 (ITS1) in Picea (Pinaceae): sequence divergence and structure.

    PubMed

    Campbell, Christopher S; Wright, Wesley A; Cox, Margaret; Vining, Thomas F; Major, C Smoot; Arsenault, Matthew P

    2005-04-01

    The nrDNA ITS1 of Picea is 2747-3271 bp, the longest known of all plants. We obtained 24 cloned ITS1 sequences from six individuals of Picea glehnii, Picea mariana, Picea orientalis, and Picea rubens. Mean sequence divergence within these individuals (0.018+/-0.009) is more than half that between the species (0.031+/-0.011) and may be maintained against concerted evolution by separation of Picea 18S-26S rDNA repeats on multiple chromosomes. Picea ITS1 contains three subrepeats with a motif (5'-GGCCACCCTAGTC) that is conserved across Pinaceae. Two subrepeats are tandem, remote from the third, and more closely related and significantly more similar to one another than either is to the third subrepeat. This correlation between similarity and proximity may be the result of subrepeat duplication or concerted evolution within rDNA repeats. In inferred secondary structures, subrepeats generally form long hairpins, with a portion of the Pinaceae conserved motif in the terminal loop, and tandem subrepeats pair with one another over most of their length. Coalescence of ITS1 sequences occurs in P. orientalis but not in the other species.

  17. Acid stress mediated adaptive divergence in ion channel function during embryogenesis in Rana arvalis

    PubMed Central

    Shu, Longfei; Laurila, Anssi; Räsänen, Katja

    2015-01-01

    Ion channels and pumps are responsible for ion flux in cells, and are key mechanisms mediating cellular function. Many environmental stressors, such as salinity and acidification, are known to severely disrupt ionic balance of organisms thereby challenging fitness of natural populations. Although ion channels can have several vital functions during early life-stages (e.g. embryogenesis), it is currently not known i) how developing embryos maintain proper intracellular conditions when exposed to environmental stress and ii) to what extent environmental stress can drive intra-specific divergence in ion channels. Here we studied the moor frog, Rana arvalis, from three divergent populations to investigate the role of different ion channels and pumps for embryonic survival under acid stress (pH 4 vs 7.5) and whether populations adapted to contrasting acidities differ in the relative role of different ion channel/pumps. We found that ion channels that mediate Ca2+ influx are essential for embryonic survival under acidic pH, and, intriguingly, that populations differ in calcium channel function. Our results suggest that adaptive divergence in embryonic acid stress tolerance of amphibians may in part be mediated by Ca2+ balance. We suggest that ion flux may mediate adaptive divergence of natural populations at early life-stages in the face of environmental stress. PMID:26381453

  18. Three multidomain esterases from the cellulolytic rumen anaerobe Ruminococcus flavefaciens 17 that carry divergent dockerin sequences.

    PubMed

    Aurilia, V; Martin, J C; McCrae, S I; Scott, K P; Rincon, M T; Flint, H J

    2000-06-01

    Three enzymes carrying esterase domains have been identified in the rumen cellulolytic anaerobe Ruminococcus flavefaciens 17. The newly characterized CesA gene product (768 amino acids) includes an N-terminal acetylesterase domain and an unidentified C-terminal domain, while the previously characterized XynB enzyme (781 amino acids) includes an internal acetylesterase domain in addition to its N-terminal xylanase catalytic domain. A third gene, xynE, is predicted to encode a multidomain enzyme of 792 amino acids including a family 11 xylanase domain and a C-terminal esterase domain. The esterase domains from CesA and XynB share significant sequence identity (44%) and belong to carbohydrate esterase family 3; both domains are shown here to be capable of deacetylating acetylated xylans, but no evidence was found for ferulic acid esterase activity. The esterase domain of XynE, however, shares 42% amino acid identity with a family 1 phenolic acid esterase domain identified from Clostridum thermocellum XynZ. XynB, XynE and CesA all contain dockerin-like regions in addition to their catalytic domains, suggesting that these enzymes form part of a cellulosome-like multienzyme complex. The dockerin sequences of CesA and XynE differ significantly from those previously described in R. flavefaciens polysaccharidases, including XynB, suggesting that they might represent distinct dockerin specificities.

  19. The Complete Chloroplast Genome Sequences of Three Veroniceae Species (Plantaginaceae): Comparative Analysis and Highly Divergent Regions

    PubMed Central

    Choi, Kyoung Su; Chung, Myong Gi; Park, SeonJoo

    2016-01-01

    Previous studies of Veronica and related genera were weakly supported by molecular and paraphyletic taxa. Here, we report the complete chloroplast genome sequence of Veronica nakaiana and the related species Veronica persica and Veronicastrum sibiricum. The chloroplast genome length of V. nakaiana, V. persica, and V. sibiricum ranged from 150,198 bp to 152,930 bp. A total of 112 genes comprising 79 protein coding genes, 29 tRNA genes, and 4 rRNA genes were observed in three chloroplast genomes. The total number of SSRs was 48, 51, and 53 in V. nakaiana, V. persica, and V. sibiricum, respectively. Two SSRs (10 bp of AT and 12 bp of AATA) were observed in the same regions (rpoC2 and ndhD) in three chloroplast genomes. A comparison of coding genes and non-coding regions between V. nakaiana and V. persica revealed divergent sites, with the greatest variation occurring petD-rpoA region. The complete chloroplast genome sequence information regarding the three Veroniceae will be helpful for elucidating Veroniceae phylogenetic relationships. PMID:27047524

  20. Impaired nuclear import of mammalian Dlx4 proteins as a consequence of rapid sequence divergence

    SciTech Connect

    Coubrough, Melissa L.; Bendall, Andrew J. . E-mail: abendall@uoguelph.ca

    2006-11-15

    Dlx genes encode a developmentally important family of transcription factors with a variety of functions and sites of action during vertebrate embryogenesis. The murine Dlx4 gene is an enigmatic member of the family; little is known about the normal developmental function(s) of Dlx4. Here, we show that Dlx4 is expressed in the murine placenta and in a trophoblast cell line where the protein localizes to both the nucleus and cytoplasm. Despite the presence of several leucine/valine-rich motifs that match known nuclear export sequences, cytoplasmic Dlx4 is not due to CRM-1-mediated nuclear export. Rather, nuclear import of Dlx4 is compromised by specific residues that flank the nuclear localization signal. One of these residues represents a novel conserved feature of the Dlx4 protein in placental mammals, and the second represents novel variation within mouse Dlx4 isoforms. Comparison of orthologous protein sequences reveals a particularly high rate of non-synonymous change in the coding regions of mammalian Dlx4 genes. Since impaired nuclear localization is unlikely to enhance the function of a nuclear transcription factor, these data point to reduced selection pressure as the basis for the rapid divergence of the Dlx4 gene within the mammalian clade.

  1. Direct sequencing of mitochondrial DNA detects highly divergent haplotypes in blue marlin (Makaira nigricans).

    PubMed

    Finnerty, J R; Block, B A

    1992-06-01

    We were able to differentiate between species of billfish (Istiophoridae family) and to detect considerable intraspecific variation in the blue marlin (Makaira nigricans) by directly sequencing a polymerase chain reaction (PCR)-amplified, 612-bp fragment of the mitochondrial cytochrome b gene. Thirteen variable nucleotide sites separated blue marlin (n = 26) into 7 genotypes. On average, these genotypes differed by 5.7 base substitutions. A smaller sample of swordfish from an equally broad geographic distribution displayed relatively little intraspecific variation, with an average of 1.3 substitutions separating different genotypes. A cladistic analysis of blue marlin cytochrome b variants indicates two major divergent evolutionary lines within the species. The frequencies of these two major evolutionary lines differ significantly between Atlantic and Pacific ocean basins. This finding is important given that the Atlantic stocks of blue marlin are considered endangered. Migration from the Pacific can help replenish the numbers of blue marlin in the Atlantic, but the loss of certain mitochondrial DNA haplotypes in the Atlantic due to overfishing probably could not be remedied by an influx of Pacific fish because of their absence in the Pacific population. Fishery management strategies should attempt to preserve the genetic diversity within the species. The detection of DNA sequence polymorphism indicates the utility of PCR technology in pelagic fishery genetics.

  2. [Molecular phylogeny and the time of divergence of minges (Chironomidae, Nematocera, Diptera) inferred from a partial nucleotide sequence of the cytochrome oxidase I gene (COI)].

    PubMed

    Demin, A G; Polukonova, N V; Miuge, N S

    2011-10-01

    This is the first study to infer the phylogenetic structure of minges of the subfamily Chironominae from the amino acid sequence of cytochrome oxidase I (COI). The subdivision of Chironominae into two tribes, Chironomini and Tanytarsini, has been confirmed. The segregation of the genera Pseudochironomus and Riethia into a separate tribe has not been confirmed. Stenochironomus gibbus forms a branch considerably deviating from the subfamily Chironominae. The genus Micropsectra is formed by a large polyphyletic cluster that also includes the genera Virgotanytharsus, Reotanytharsus, Kenopsectra, and Parapsectra. Tanytarsus is the basal genus of the tribe Tanytarsini. The times of divergence of the main taxa of Chironominae have been estimated. The calculated time of divergence of the genus Chironomus disproves the assumption that it is phylogenetically old.

  3. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  4. Analysis of cloned cDNA and genomic sequences for phytochrome: complete amino acid sequences for two gene products expressed in etiolated Avena.

    PubMed Central

    Hershey, H P; Barker, R F; Idler, K B; Lissemore, J L; Quail, P H

    1985-01-01

    Cloned cDNA and genomic sequences have been analyzed to deduce the amino acid sequence of phytochrome from etiolated Avena. Restriction endonuclease site polymorphism between clones indicates that at least four phytochrome genes are expressed in this tissue. Sequence analysis of two complete and one partial coding region shows approximately 98% homology at both the nucleotide and amino acid levels, with the majority of amino acid changes being conservative. High sequence homology is also found in the 5'-untranslated region but significant divergence occurs in the 3'-untranslated region. The phytochrome polypeptides are 1128 amino acid residues long corresponding to a molecular mass of 125 kdaltons. The known protein sequence at the chromophore attachment site occurs only once in the polypeptide, establishing that phytochrome has a single chromophore per monomer covalently linked to Cys-321. Computer analyses of the amino acid sequences have provided predictions regarding a number of structural features of the phytochrome molecule. PMID:3001642

  5. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  6. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  7. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  8. Functional Divergence of APETALA1 and FRUITFULL is due to Changes in both Regulation and Coding Sequence

    PubMed Central

    McCarthy, Elizabeth W.; Mohamed, Abeer; Litt, Amy

    2015-01-01

    Gene duplications are prevalent in plants, and functional divergence subsequent to duplication may be linked with the occurrence of novel phenotypes in plant evolution. Here, we examine the functional divergence of Arabidopsis thaliana APETALA1 (AP1) and FRUITFULL (FUL), which arose via a duplication correlated with the origin of the core eudicots. Both AP1 and FUL play a role in floral meristem identity, but AP1 is required for the formation of sepals and petals whereas FUL is involved in cauline leaf and fruit development. AP1 and FUL are expressed in mutually exclusive domains but also differ in sequence, with unique conserved motifs in the C-terminal domains of the proteins that suggest functional differentiation. To determine whether the functional divergence of AP1 and FUL is due to changes in regulation or changes in coding sequence, we performed promoter swap experiments, in which FUL was expressed in the AP1 domain in the ap1 mutant and vice versa. Our results show that FUL can partially substitute for AP1, and AP1 can partially substitute for FUL; thus, the functional divergence between AP1 and FUL is due to changes in both regulation and coding sequence. We also mutated AP1 and FUL conserved motifs to determine if they are required for protein function and tested the ability of these mutated proteins to interact in yeast with known partners. We found that these motifs appear to play at best a minor role in protein function and dimerization capability, despite being strongly conserved. Our results suggest that the functional differentiation of these two paralogous key transcriptional regulators involves both differences in regulation and in sequence; however, sequence changes in the form of unique conserved motifs do not explain the differences observed. PMID:26697035

  9. Demographic Divergence History of Pied Flycatcher and Collared Flycatcher Inferred from Whole-Genome Re-sequencing Data

    PubMed Central

    Nadachowska-Brzyska, Krystyna; Burri, Reto; Olason, Pall I.; Kawakami, Takeshi; Smeds, Linnéa; Ellegren, Hans

    2013-01-01

    Profound knowledge of demographic history is a prerequisite for the understanding and inference of processes involved in the evolution of population differentiation and speciation. Together with new coalescent-based methods, the recent availability of genome-wide data enables investigation of differentiation and divergence processes at unprecedented depth. We combined two powerful approaches, full Approximate Bayesian Computation analysis (ABC) and pairwise sequentially Markovian coalescent modeling (PSMC), to reconstruct the demographic history of the split between two avian speciation model species, the pied flycatcher and collared flycatcher. Using whole-genome re-sequencing data from 20 individuals, we investigated 15 demographic models including different levels and patterns of gene flow, and changes in effective population size over time. ABC provided high support for recent (mode 0.3 my, range <0.7 my) species divergence, declines in effective population size of both species since their initial divergence, and unidirectional recent gene flow from pied flycatcher into collared flycatcher. The estimated divergence time and population size changes, supported by PSMC results, suggest that the ancestral species persisted through one of the glacial periods of middle Pleistocene and then split into two large populations that first increased in size before going through severe bottlenecks and expanding into their current ranges. Secondary contact appears to have been established after the last glacial maximum. The severity of the bottlenecks at the last glacial maximum is indicated by the discrepancy between current effective population sizes (20,000–80,000) and census sizes (5–50 million birds) of the two species. The recent divergence time challenges the supposition that avian speciation is a relatively slow process with extended times for intrinsic postzygotic reproductive barriers to evolve. Our study emphasizes the importance of using genome-wide data to

  10. Structural basis for the divergence of substrate specificity and biological function within HAD phosphatases in lipopolysaccharide and sialic acid biosynthesis.

    PubMed

    Daughtry, Kelly D; Huang, Hua; Malashkevich, Vladimir; Patskovsky, Yury; Liu, Weifeng; Ramagopal, Udupi; Sauder, J Michael; Burley, Stephen K; Almo, Steven C; Dunaway-Mariano, Debra; Allen, Karen N

    2013-08-13

    The haloacid dehalogenase enzyme superfamily (HADSF) is largely composed of phosphatases that have been particularly successful at adaptating to novel biological functions relative to members of other phosphatase families. Herein, we examine the structural basis for the divergence of function in two bacterial homologues: 2-keto-3-deoxy-d-manno-octulosonate 8-phosphate phosphohydrolase (KDO8P phosphatase, KDO8PP) and 2-keto-3-deoxy-9-O-phosphonononic acid phosphohydrolase (KDN9P phosphatase, KDN9PP). KDO8PP and KDN9PP catalyze the final step in KDO and KDN synthesis, respectively, prior to transfer to CMP to form the activated sugar nucleotide. KDO8PP and KDN9PP orthologs derived from an evolutionarily diverse collection of bacterial species were subjected to steady-state kinetic analysis to determine their specificities toward catalyzed KDO8P and KDN9P hydrolysis. Although each enzyme was more active with its biological substrate, the degree of selectivity (as defined by the ratio of kcat/Km for KDO8P vs KDN9P) varied significantly. High-resolution X-ray structure determination of Haemophilus influenzae KDO8PP bound to KDO/VO3(-) and Bacteriodes thetaiotaomicron KDN9PP bound to KDN/VO3(-) revealed the substrate-binding residues. The structures of the KDO8PP and KDN9PP orthologs were also determined to reveal the differences in their active-site structures that underlie the variation in substrate preference. Bioinformatic analysis was carried out to define the sequence divergence among KDN9PP and KDO8PP orthologs. The KDN9PP orthologs were found to exist as single-domain proteins or fused with the pathway nucleotidyl transferases; the fusion of KDO8PP with the transferase is rare. The KDO8PP and KDN9PP orthologs share a stringently conserved Arg residue that forms a salt bridge with the substrate carboxylate group. The split of the KDN9PP lineage from the KDO8PP orthologs is easily tracked by the acquisition of a Glu/Lys pair that supports KDN9P binding. Moreover

  11. Divergent Evolution of Nuclear Localization Signal Sequences in Herpesvirus Terminase Subunits.

    PubMed

    Sankhala, Rajeshwer S; Lokareddy, Ravi K; Cingolani, Gino

    2016-05-20

    The tripartite terminase complex of herpesviruses assembles in the cytoplasm of infected cells and exploits the host nuclear import machinery to gain access to the nucleus, where capsid assembly and genome-packaging occur. Here we analyzed the structure and conservation of nuclear localization signal (NLS) sequences previously identified in herpes simplex virus 1 (HSV-1) large terminase and human cytomegalovirus (HCMV) small terminase. We found a monopartite NLS at the N terminus of large terminase, flanking the ATPase domain, that is conserved only in α-herpesviruses. In contrast, small terminase exposes a classical NLS at the far C terminus of its helical structure that is conserved only in two genera of the β-subfamily and absent in α- and γ-herpesviruses. In addition, we predicted a classical NLS in the third terminase subunit that is partially conserved among herpesviruses. Bioinformatic analysis revealed that both location and potency of NLSs in terminase subunits evolved more rapidly than the rest of the amino acid sequence despite the selective pressure to keep terminase gene products active and localized in the nucleus. We propose that swapping NLSs among terminase subunits is a regulatory mechanism that allows different herpesviruses to regulate the kinetics of terminase nuclear import, reflecting a mechanism of virus:host adaptation.

  12. Divergent functional profiles of acidic and basic phospholipases A2 in the venom of the snake Porthidium lansbergii lansbergii.

    PubMed

    Jiménez-Charris, Eliécer; Montealegre-Sánchez, Leonel; Solano-Redondo, Luis; Castro-Herrera, Fernando; Fierro-Pérez, Leonardo; Lomonte, Bruno

    2016-09-01

    The Lansberg's hognose pitviper, Porthidium lansbergii lansbergii, inhabits northern Colombia. A recent proteomic characterization of its venom (J. Proteomics [2015] 114, 287-299) revealed the presence of phospholipases A2 (PLA2) accounting for 16.2% of its proteins. The two most abundant PLA2s were biochemically and functionally characterized. Pllans-I is a basic, dimeric enzyme with a monomer mass of 14,136 Da, while Pllans-II is an acidic, monomeric enzyme of 13,901 Da. Both have Asp49 in their partial amino acid sequences and, accordingly, are catalytically active upon natural or synthetic substrates. Nevertheless, these two enzymes differ markedly in their bioactivities. Pllans-I induces myonecrosis, edema, and is lethal by intracerebro-ventricular injection in mice, as well as cytolytic and anticoagulant in vitro. In contrast, Pllans-II is devoid of these effects, except for the induction of a moderate edema. In spite of lacking myotoxicity, Pllans-II enhances the muscle damaging action of Pllans-I in vivo. Altogether, results further illustrate the divergent functional profiles of basic and acidic PLA2s in viperid venoms, and suggest that Pllans-I plays a myotoxic role in envenomings by P. l. lansbergii, whereas Pllans-II, apparently devoid of toxicity, enhances muscle damage caused by Pllans-I.

  13. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  14. Divergence of reiterated sequences in a series of genital isolates of herpes simplex virus type 1 from individual patients.

    PubMed

    Umene, Kenichi; Kawana, Takashi

    2003-04-01

    Both serotypes of herpes simplex virus (HSV), HSV-1 and HSV-2, are aetiological agents of genital herpes, although genital herpes caused by HSV-1 recurs less frequently. The HSV-1 genome contains a number of short, tandemly repeated sequences, and some reiterated sequences can serve as sensitive markers for the differentiation of HSV-1 strains. In the present study, variation in reiterations (assumed to be due to different copy numbers of tandemly repeated sequences) was examined in HSV-1 isolates from genital lesions from the same individual. Six sets (three primary-recurrence sets and three multiple-recurrence sets) of HSV-1 isolates were analysed: the primary-recurrence set consisted of two isolates (one isolated at a primary episode and the other at a recurrent episode) from the same individual; the multiple-recurrence set consisted of plural isolates from different episodes of recurrence in the same individual. Variations in length of the major DNA fragment, containing reiteration I (within the a sequence) and/or reiteration IV (within introns of genes US1 and US12), were detected between isolates of each multiple-recurrence set, but not of the primary-recurrence set. Thus, HSV-1 isolates of multiple-recurrence sets are assumed to have diverged more widely within each set than those of primary-recurrence sets, probably because of more rounds of virus DNA replication. This divergence of reiterations seems to indicate a forward step in the division of HSV-1 from a common ancestor into different lineages.

  15. Evolutionary divergence of intrinsic and trans-regulated nucleosome positioning sequences reveals plastic rules for chromatin organization

    PubMed Central

    Tsankov, Alex; Yanagisawa, Yoshimi; Rhind, Nicholas; Regev, Aviv; Rando, Oliver J.

    2011-01-01

    The packaging of eukaryotic genomes into nuclesomes plays critical roles in chromatin organization and gene regulation. Studies in Saccharomyces cerevisiae indicate that nucleosome occupancy is partially encoded by intrinsic antinucleosomal DNA sequences, such as poly(A) sequences, as well as by binding sites for trans-acting factors that can evict nucleosomes, such as Reb1 and the Rsc3/30 complex. Here, we use genome-wide nucleosome occupancy maps in 13 Ascomycota fungi to discover large-scale evolutionary reprogramming of both intrinsic and trans determinants of chromatin structure. We find that poly(G)s act as intrinsic antinucleosomal sequences, comparable to the known function of poly(A)s, but that the abundance of poly(G)s has diverged greatly between species, obscuring their antinucleosomal effect in low-poly(G) species such as S. cerevisiae. We also develop a computational method that uses nucleosome occupancy maps for discovering trans-acting general regulatory factor (GRF) binding sites. Our approach reveals that the specific sequences bound by GRFs have diverged substantially across evolution, corresponding to a number of major evolutionary transitions in the repertoire of GRFs. We experimentally validate a proposed evolutionary transition from Cbf1 as a major GRF in pre-whole-genome duplication (WGD) yeasts to Reb1 in post-WGD yeasts. We further show that the mating type switch-activating protein Sap1 is a GRF in S. pombe, demonstrating the general applicability of our approach. Our results reveal that the underlying mechanisms that determine in vivo chromatin organization have diverged and that comparative genomics can help discover new determinants of chromatin organization. PMID:21914852

  16. Characterization and phylogenetic analysis of α-gliadin gene sequences reveals significant genomic divergence in Triticeae species.

    PubMed

    Li, Guang-Rong; Lang, Tao; Yang, En-Nian; Liu, Cheng; Yang, Zu-Jun

    2014-12-01

    Although the unique properties of wheat α-gliadin gene family are well characterized, little is known about the evolution and genomic divergence of α-gliadin gene family within the Triticeae. We isolated a total of 203 α-gliadin gene sequences from 11 representative diploid and polyploid Triticeae species, and found 108 sequences putatively functional. Our results indicate that α-gliadin genes may have possibly originated from wild Secale species, where the sequences contain the shortest repetitive domains and display minimum variation. A miniature inverted-repeat transposable element insertion is reported for the first time in α-gliadin gene sequence of Thinopyrum intermedium in this study, indicating that the transposable element might have contributed to the diversification of α-gliadin genes family among Triticeae genomes. The phylogenetic analyses revealed that the α-gliadin gene sequences of Dasypyrum, Australopyrum, Lophopyrum, Eremopyrum and Pseudoroengeria species have amplified several times. A search for four typical toxic epitopes for celiac disease within the Triticeae α-gliadin gene sequences showed that the α-gliadins of wild Secale, Australopyrum and Agropyron genomes lack all four epitopes, while other Triticeae species have accumulated these epitopes, suggesting that the evolution of these toxic epitopes sequences occurred during the course of speciation, domestication or polyploidization of Triticeae.

  17. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  18. Chironomus riparius (Diptera) genome sequencing reveals the impact of minisatellite transposable elements on population divergence.

    PubMed

    Oppold, Ann-Marie; Schmidt, Hanno; Rose, Marcel; Hellmann, Sören Lukas; Dolze, Florian; Ripp, Fabian; Weich, Bettina; Schmidt-Ott, Urs; Schmidt, Erwin; Kofler, Robert; Hankeln, Thomas; Pfenninger, Markus

    2017-03-18

    Active transposable elements (TEs) may result in divergent genomic insertion and abundance patterns among conspecific populations. Upon secondary contact, such divergent genetic backgrounds can theoretically give rise to classical Dobzhansky-Muller incompatibilities (DMI), thus contributing to the evolution of endogenous genetic barriers and eventually cause population divergence. We investigated differential TE abundance among conspecific populations of the non-biting midge Chironomus riparius and evaluated their potential role in causing endogenous genetic incompatibilities between these populations. We focussed on a Chironomus-specific TE, the minisatellite-like Cla-element, whose activity is associated with speciation in the genus. Using a newly generated and annotated draft genome for a genomic study with five natural C. riparius populations, we found highly population-specific TE insertion patterns with many private insertions. A significant correlation of the pairwise FST estimated from genome-wide single nucleotide polymorphisms (SNPs) and the FST estimated from TEs, is consistent with drift as the major force driving TE population differentiation. However, the significantly higher Cla-element FST level due to a high proportion of differentially fixed Cla-element insertions also indicates selection against segregating (i.e. heterozygous) insertions. With reciprocal crossing experiments and fluorescent in-situ hybridisation of Cla-elements to polytene chromosomes, we documented phenotypic effects on female fertility and chromosomal mispairings. We propose that the inferred negative selection on heterozygous Cla-element insertions may cause endogenous genetic barriers and therefore acts as DMI among C. riparius populations. The intrinsic genomic turnover exerted by TEs may thus have a direct impact on population divergence that is operationally different from drift and local adaptation. This article is protected by copyright. All rights reserved.

  19. Genetic Divergence between Freshwater and Marine Morphs of Alewife (Alosa pseudoharengus): A ‘Next-Generation’ Sequencing Analysis

    PubMed Central

    Czesny, Sergiusz; Epifanio, John; Michalak, Pawel

    2012-01-01

    Alewife Alosa pseudoharengus, a small clupeid fish native to Atlantic Ocean, has recently (∼150 years ago) invaded the North American Great Lakes and despite challenges of freshwater environment its populations exploded and disrupted local food web structures. This range expansion has been accompanied by dramatic changes at all levels of organization. Growth rates, size at maturation, or fecundity are only a few of the most distinct morphological and life history traits that contrast the two alewife morphs. A question arises to what extent these rapidly evolving differences between marine and freshwater varieties result from regulatory (including phenotypic plasticity) or structural mutations. To gain insights into expression changes and sequence divergence between marine and freshwater alewives, we sequenced transcriptomes of individuals from Lake Michigan and Atlantic Ocean. Population specific single nucleotide polymorphisms were rare but interestingly occurred in sequences of genes that also tended to show large differences in expression. Our results show that the striking phenotypic divergence between anadromous and lake alewives can be attributed to massive regulatory modifications rather than coding changes. PMID:22438868

  20. Genetic divergence between freshwater and marine morphs of alewife (Alosa pseudoharengus): a 'next-generation' sequencing analysis.

    PubMed

    Czesny, Sergiusz; Epifanio, John; Michalak, Pawel

    2012-01-01

    Alewife Alosa pseudoharengus, a small clupeid fish native to Atlantic Ocean, has recently (∼150 years ago) invaded the North American Great Lakes and despite challenges of freshwater environment its populations exploded and disrupted local food web structures. This range expansion has been accompanied by dramatic changes at all levels of organization. Growth rates, size at maturation, or fecundity are only a few of the most distinct morphological and life history traits that contrast the two alewife morphs. A question arises to what extent these rapidly evolving differences between marine and freshwater varieties result from regulatory (including phenotypic plasticity) or structural mutations. To gain insights into expression changes and sequence divergence between marine and freshwater alewives, we sequenced transcriptomes of individuals from Lake Michigan and Atlantic Ocean. Population specific single nucleotide polymorphisms were rare but interestingly occurred in sequences of genes that also tended to show large differences in expression. Our results show that the striking phenotypic divergence between anadromous and lake alewives can be attributed to massive regulatory modifications rather than coding changes.

  1. Phylogenetics of neotropical Platymiscium (Leguminosae: Dalbergieae): systematics, divergence times, and biogeography inferred from nuclear ribosomal and plastid DNA sequence data.

    PubMed

    Saslis-Lagoudakis, Charilaos; Chase, Mark W; Robinson, Daniel N; Russell, Stephen J; Klitgaard, Bente B

    2008-10-01

    Platymiscium is a neotropical legume genus of forest trees in the Pterocarpus clade of the pantropical "dalbergioid" clade. It comprises 19 species (29 taxa), distributed from Mexico to southern Brazil. This study presents a molecular phylogenetic analysis of Platymiscium and allies inferred from nuclear ribosomal (nrITS) and plastid (trnL, trnL-F and matK) DNA sequence data using parsimony and Bayesian methods. Divergence times are estimated using a Bayesian method assuming a relaxed molecular clock (multidivtime). Within the Pterocarpus clade, new sister relationships are recovered: Pterocarpus + Etaballia, Inocarpus + Tipuana and Paramachaerium + Maraniona. Our results support monophyly of Platymiscium, which is resolved into three major clades, each with distinct geographic ranges and ecological preferences. Diversification in Platymiscium has been driven by habitat fragmentation, invasion of novel geographic regions, and ecological diversification, revealing general patterns of diversification in the neotropics. We hypothesize that Platymiscium arose in dry habitats of South America and radiated northward. The Amazon basin was invaded twice both within the last 5.6 My and Central America twice before the closure of the Isthmus of Panama. Divergence times of the P. pubescens complex, restricted to seasonally dry tropical forests of South America, support pre-Pleistocene divergence in this biome.

  2. Mining environmental high-throughput sequence data sets to identify divergent amplicon clusters for phylogenetic reconstruction and morphotype visualization.

    PubMed

    Gimmler, Anna; Stoeck, Thorsten

    2015-08-01

    Environmental high-throughput sequencing (envHTS) is a very powerful tool, which in protistan ecology is predominantly used for the exploration of diversity and its geographic and local patterns. We here used a pyrosequenced V4-SSU rDNA data set from a solar saltern pond as test case to exploit such massive protistan amplicon data sets beyond this descriptive purpose. Therefore, we combined a Swarm-based blastn network including 11 579 ciliate V4 amplicons to identify divergent amplicon clusters with targeted polymerase chain reaction (PCR) primer design for full-length small subunit of the ribosomal DNA retrieval and probe design for fluorescence in situ hybridization (FISH). This powerful strategy allows to benefit from envHTS data sets to (i) reveal the phylogenetic position of the taxon behind divergent amplicons; (ii) improve phylogenetic resolution and evolutionary history of specific taxon groups; (iii) solidly assess an amplicons (species') degree of similarity to its closest described relative; (iv) visualize the morphotype behind a divergent amplicons cluster; (v) rapidly FISH screen many environmental samples for geographic/habitat distribution and abundances of the respective organism and (vi) to monitor the success of enrichment strategies in live samples for cultivation and isolation of the respective organisms.

  3. More reliable estimates of divergence times in Pan using complete mtDNA sequences and accounting for population structure.

    PubMed

    Stone, Anne C; Battistuzzi, Fabia U; Kubatko, Laura S; Perry, George H; Trudeau, Evan; Lin, Hsiuman; Kumar, Sudhir

    2010-10-27

    Here, we report the sequencing and analysis of eight complete mitochondrial genomes of chimpanzees (Pan troglodytes) from each of the three established subspecies (P. t. troglodytes, P. t. schweinfurthii and P. t. verus) and the proposed fourth subspecies (P. t. ellioti). Our population genetic analyses are consistent with neutral patterns of evolution that have been shaped by demography. The high levels of mtDNA diversity in western chimpanzees are unlike those seen at nuclear loci, which may reflect a demographic history of greater female to male effective population sizes possibly owing to the characteristics of the founding population. By using relaxed-clock methods, we have inferred a timetree of chimpanzee species and subspecies. The absolute divergence times vary based on the methods and calibration used, but relative divergence times show extensive uniformity. Overall, mtDNA produces consistently older times than those known from nuclear markers, a discrepancy that is reduced significantly by explicitly accounting for chimpanzee population structures in time estimation. Assuming the human-chimpanzee split to be between 7 and 5 Ma, chimpanzee time estimates are 2.1-1.5, 1.1-0.76 and 0.25-0.18 Ma for the chimpanzee/bonobo, western/(eastern + central) and eastern/central chimpanzee divergences, respectively.

  4. Los Alamos sequence analysis package for nucleic acids and proteins.

    PubMed Central

    Kanehisa, M I

    1982-01-01

    An interactive system for computer analysis of nucleic acid and protein sequences has been developed for the Los Alamos DNA Sequence Database. It provides a convenient way to search or verify various sequence features, e.g., restriction enzyme sites, protein coding frames, and properties of coded proteins. Further, the comprehensive analysis package on a large-scale database can be used for comparative studies on sequence and structural homologies in order to find unnoted information stored in nucleic acid sequences. PMID:6174934

  5. Genome-wide analysis reveals diverged patterns of codon bias, gene expression, and rates of sequence evolution in picea gene families.

    PubMed

    De La Torre, Amanda R; Lin, Yao-Cheng; Van de Peer, Yves; Ingvarsson, Pär K

    2015-03-05

    The recent sequencing of several gymnosperm genomes has greatly facilitated studying the evolution of their genes and gene families. In this study, we examine the evidence for expression-mediated selection in the first two fully sequenced representatives of the gymnosperm plant clade (Picea abies and Picea glauca). We use genome-wide estimates of gene expression (>50,000 expressed genes) to study the relationship between gene expression, codon bias, rates of sequence divergence, protein length, and gene duplication. We found that gene expression is correlated with rates of sequence divergence and codon bias, suggesting that natural selection is acting on Picea protein-coding genes for translational efficiency. Gene expression, rates of sequence divergence, and codon bias are correlated with the size of gene families, with large multicopy gene families having, on average, a lower expression level and breadth, lower codon bias, and higher rates of sequence divergence than single-copy gene families. Tissue-specific patterns of gene expression were more common in large gene families with large gene expression divergence than in single-copy families. Recent family expansions combined with large gene expression variation in paralogs and increased rates of sequence evolution suggest that some Picea gene families are rapidly evolving to cope with biotic and abiotic stress. Our study highlights the importance of gene expression and natural selection in shaping the evolution of protein-coding genes in Picea species, and sets the ground for further studies investigating the evolution of individual gene families in gymnosperms.

  6. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  7. The Complete Sequence of the Acacia ligulata Chloroplast Genome Reveals a Highly Divergent clpP1 Gene.

    PubMed

    Williams, Anna V; Boykin, Laura M; Howell, Katharine A; Nevill, Paul G; Small, Ian

    2015-01-01

    Legumes are a highly diverse angiosperm family that include many agriculturally important species. To date, 21 complete chloroplast genomes have been sequenced from legume crops confined to the Papilionoideae subfamily. Here we report the first chloroplast genome from the Mimosoideae, Acacia ligulata, and compare it to the previously sequenced legume genomes. The A. ligulata chloroplast genome is 174,233 bp in size, comprising inverted repeats of 38,225 bp and single-copy regions of 92,798 bp and 4,985 bp [corrected]. Acacia ligulata lacks the inversion present in many of the Papilionoideae, but is not otherwise significantly different in terms of gene and repeat content. The key feature is its highly divergent clpP1 gene, normally considered essential in chloroplast genomes. In A. ligulata, although transcribed and spliced, it probably encodes a catalytically inactive protein. This study provides a significant resource for further genetic research into Acacia and the Mimosoideae. The divergent clpP1 gene suggests that Acacia will provide an interesting source of information on the evolution and functional diversity of the chloroplast Clp protease complex.

  8. The Complete Sequence of the Acacia ligulata Chloroplast Genome Reveals a Highly Divergent clpP1 Gene

    PubMed Central

    Williams, Anna V.; Boykin, Laura M.; Howell, Katharine A.; Nevill, Paul G.; Small, Ian

    2015-01-01

    Legumes are a highly diverse angiosperm family that include many agriculturally important species. To date, 21 complete chloroplast genomes have been sequenced from legume crops confined to the Papilionoideae subfamily. Here we report the first chloroplast genome from the Mimosoideae, Acacia ligulata, and compare it to the previously sequenced legume genomes. The A. ligulata chloroplast genome is 158,724 bp in size, comprising inverted repeats of 25,925 bp and single-copy regions of 88,576 bp and 18,298 bp. Acacia ligulata lacks the inversion present in many of the Papilionoideae, but is not otherwise significantly different in terms of gene and repeat content. The key feature is its highly divergent clpP1 gene, normally considered essential in chloroplast genomes. In A. ligulata, although transcribed and spliced, it probably encodes a catalytically inactive protein. This study provides a significant resource for further genetic research into Acacia and the Mimosoideae. The divergent clpP1 gene suggests that Acacia will provide an interesting source of information on the evolution and functional diversity of the chloroplast Clp protease complex. PMID:25955637

  9. A Delicate Balance Between Repair and Replication Factors Regulates Recombination Between Divergent DNA Sequences in Saccharomyces cerevisiae

    PubMed Central

    Chakraborty, Ujani; George, Carolyn M.; Lyndaker, Amy M.; Alani, Eric

    2016-01-01

    Single-strand annealing (SSA) is an important homologous recombination mechanism that repairs DNA double strand breaks (DSBs) occurring between closely spaced repeat sequences. During SSA, the DSB is acted upon by exonucleases to reveal complementary sequences that anneal and are then repaired through tail clipping, DNA synthesis, and ligation steps. In baker’s yeast, the Msh DNA mismatch recognition complex and the Sgs1 helicase act to suppress SSA between divergent sequences by binding to mismatches present in heteroduplex DNA intermediates and triggering a DNA unwinding mechanism known as heteroduplex rejection. Using baker’s yeast as a model, we have identified new factors and regulatory steps in heteroduplex rejection during SSA. First we showed that Top3-Rmi1, a topoisomerase complex that interacts with Sgs1, is required for heteroduplex rejection. Second, we found that the replication processivity clamp proliferating cell nuclear antigen (PCNA) is dispensable for heteroduplex rejection, but is important for repairing mismatches formed during SSA. Third, we showed that modest overexpression of Msh6 results in a significant increase in heteroduplex rejection; this increase is due to a compromise in Msh2-Msh3 function required for the clipping of 3′ tails. Thus 3′ tail clipping during SSA is a critical regulatory step in the repair vs. rejection decision; rejection is favored before the 3′ tails are clipped. Unexpectedly, Msh6 overexpression, through interactions with PCNA, disrupted heteroduplex rejection between divergent sequences in another recombination substrate. These observations illustrate the delicate balance that exists between repair and replication factors to optimize genome stability. PMID:26680658

  10. Divergent modulation of swine ileal microbiota by formic acid and methionine hydroxy analogue-free acid.

    PubMed

    Apajalahti, J; Rademacher, M; Htoo, J K; Redshaw, M; Kettunen, A

    2009-06-01

    Management of intestinal microbiota of monogastric animals has increased in importance since the ban of growth promoting antibiotics in many countries. Organic acids have been used as alternatives to antibiotics by many feed manufacturers. Regardless of the wide usage, the effect, dose response and mode of action of acids on intestinal microbes is poorly understood. In this study, we investigated the effects of dietary supplementation of three commonly used products, namely formic acid (FA) (90%), dl-methionine (DLM) (99%) and liquid methionine hydroxy analogue-free acid (88%), on ileal microbiota of pigs. Laboratory simulation system, mimicking swine ileum, was used to study the products at various concentrations and combinations. Furthermore, selected combinations were tested in a piglet trial to confirm the findings made in in vitro studies. FA turned out to have a dual effect on ileal microbiota. At concentrations below 0.5%, it significantly stimulated bacteria, but at higher inclusion rates it was highly inhibitory. This finding, which was consistent in in vitro and in vivo studies, implies that reducing the dose of FA does not lead to a diluted inhibitory effect, but in fact, an opposite, stimulatory effect on intestinal microbiota. It is highly important that feed compounders acknowledge this finding. Unlike FA, the inhibitory effect of methionine hydroxy analogue on ileal bacteria was linearly dose dependent and significant at inclusion levels above 0.2%, in vitro. Partial replacement of methionine hydroxy analogue by FA, or FA by methionine hydroxy analogue, led to an unpredictable outcome due to the dual effects of FA; e.g., a minor inclusion of added FA changed the inhibitory effect of methionine hydroxy analogue into microbial stimulation by FA. Inhibition of ileal microbiota by methionine hydroxy analogue was detected only in in vitro studies, suggesting that intact methionine hydroxy analogue may not have reached the ileum, in live animals. Therefore

  11. Complete Chloroplast Genome Sequence of the Early Diverging Green Alga Palmophyllum crassum

    PubMed Central

    Furukawa, Ryo; Kunugi, Motoshi; Ihara, Kunio; Tanaka, Ayumi

    2017-01-01

    ABSTRACT Palmophyllum crassum is a little-known green alga, with a unique evolutionary position and distinctive photosynthetic features. Here, we present the complete chloroplast genome sequence of Palmophyllum crassum. PMID:28280029

  12. Sample sequencing of vascular plants demonstrates widespread conservation and divergence of microRNAs.

    PubMed

    Chávez Montes, Ricardo A; de Fátima Rosas-Cárdenas, Flor; De Paoli, Emanuele; Accerbi, Monica; Rymarquis, Linda A; Mahalingam, Gayathri; Marsch-Martínez, Nayelli; Meyers, Blake C; Green, Pamela J; de Folter, Stefan

    2014-04-23

    Small RNAs are pivotal regulators of gene expression that guide transcriptional and post-transcriptional silencing mechanisms in eukaryotes, including plants. Here we report a comprehensive atlas of sRNA and miRNA from 3 species of algae and 31 representative species across vascular plants, including non-model plants. We sequence and quantify sRNAs from 99 different tissues or treatments across species, resulting in a data set of over 132 million distinct sequences. Using miRBase mature sequences as a reference, we identify the miRNA sequences present in these libraries. We apply diverse profiling methods to examine critical sRNA and miRNA features, such as size distribution, tissue-specific regulation and sequence conservation between species, as well as to predict putative new miRNA sequences. We also develop database resources, computational analysis tools and a dedicated website, http://smallrna.udel.edu/. This study provides new insights on plant sRNAs and miRNAs, and a foundation for future studies.

  13. Complete Genome Sequences of Three Historically Important, Spatiotemporally Distinct, and Genetically Divergent Strains of Zika Virus: MR-766, P6-740, and PRVABC-59

    PubMed Central

    Yun, Sang-Im; Song, Byung-Hak; Frank, Jordan C.; Julander, Justin G.; Polejaeva, Irina A.; Davies, Christopher J.; White, Kenneth L.

    2016-01-01

    Here, we report the 10,807-nucleotide-long consensus RNA genome sequences of three spatiotemporally distinct and genetically divergent Zika virus strains, with the functionality of their genomic sequences substantiated by reverse genetics: MR-766 (African lineage, Uganda, 1947), P6-740 (Asian lineage, Malaysia, 1966), and PRVABC-59 (Asian lineage-derived American strain, Puerto Rico, 2015). PMID:27540058

  14. Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing.

    PubMed

    Yang, Hua; Wei, Chao-Ling; Liu, Hong-Wei; Wu, Jun-Lan; Li, Zheng-Guo; Zhang, Liang; Jian, Jian-Bo; Li, Ye-Yun; Tai, Yu-Ling; Zhang, Jing; Zhang, Zheng-Zhu; Jiang, Chang-Jun; Xia, Tao; Wan, Xiao-Chun

    2016-01-01

    Tea is one of the most popular beverages across the world and is made exclusively from cultivars of Camellia sinensis. Many wild relatives of the genus Camellia that are closely related to C. sinensis are native to Southwest China. In this study, we first identified the distinct genetic divergence between C. sinensis and its wild relatives and provided a glimpse into the artificial selection of tea plants at a genome-wide level by analyzing 15,444 genomic SNPs that were identified from 18 cultivated and wild tea accessions using a high-throughput genome-wide restriction site-associated DNA sequencing (RAD-Seq) approach. Six distinct clusters were detected by phylogeny inferrence and principal component and genetic structural analyses, and these clusters corresponded to six Camellia species/varieties. Genetic divergence apparently indicated that C. taliensis var. bangwei is a semi-wild or transient landrace occupying a phylogenetic position between those wild and cultivated tea plants. Cultivated accessions exhibited greater heterozygosity than wild accessions, with the exception of C. taliensis var. bangwei. Thirteen genes with non-synonymous SNPs exhibited strong selective signals that were suggestive of putative artificial selective footprints for tea plants during domestication. The genome-wide SNPs provide a fundamental data resource for assessing genetic relationships, characterizing complex traits, comparing heterozygosity and analyzing putatitve artificial selection in tea plants.

  15. Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing

    PubMed Central

    Liu, Hong-Wei; Wu, Jun-Lan; Li, Zheng-Guo; Zhang, Liang; Jian, Jian-Bo; Li, Ye-Yun; Tai, Yu-Ling; Zhang, Jing; Zhang, Zheng-Zhu; Jiang, Chang-Jun; Xia, Tao; Wan, Xiao-Chun

    2016-01-01

    Tea is one of the most popular beverages across the world and is made exclusively from cultivars of Camellia sinensis. Many wild relatives of the genus Camellia that are closely related to C. sinensis are native to Southwest China. In this study, we first identified the distinct genetic divergence between C. sinensis and its wild relatives and provided a glimpse into the artificial selection of tea plants at a genome-wide level by analyzing 15,444 genomic SNPs that were identified from 18 cultivated and wild tea accessions using a high-throughput genome-wide restriction site-associated DNA sequencing (RAD-Seq) approach. Six distinct clusters were detected by phylogeny inferrence and principal component and genetic structural analyses, and these clusters corresponded to six Camellia species/varieties. Genetic divergence apparently indicated that C. taliensis var. bangwei is a semi-wild or transient landrace occupying a phylogenetic position between those wild and cultivated tea plants. Cultivated accessions exhibited greater heterozygosity than wild accessions, with the exception of C. taliensis var. bangwei. Thirteen genes with non-synonymous SNPs exhibited strong selective signals that were suggestive of putative artificial selective footprints for tea plants during domestication. The genome-wide SNPs provide a fundamental data resource for assessing genetic relationships, characterizing complex traits, comparing heterozygosity and analyzing putatitve artificial selection in tea plants. PMID:26962860

  16. Diverging Narratives: Evaluating the Uses of the Ideal-Typical Sequence of Transport Network Development

    ERIC Educational Resources Information Center

    Weber, Joe

    2004-01-01

    The development of new transport systems has been an important and highly visible component of economic development and spatial reorganization in the past two centuries. The Ideal-Typical Sequence of network development has been a widely used model of transport development. This paper shows that this model has been used in several different ways,…

  17. Divergent spatial regulation of duplicated fatty acid-binding protein (fabp) genes in rainbow trout (Oncorhynchus mykiss).

    PubMed

    Bayır, Mehtap; Bayır, Abdulkadir; Wright, Jonathan M

    2015-06-01

    The increased use of plant oil as a dietary supplement with the resultant high dietary lipid loads challenges the lipid transport, metabolism and storage mechanisms in economically important aquaculture species, such as rainbow trout. Fatty acid-binding proteins (Fabp), ubiquitous in tissues highly active in fatty acid metabolism, participate in lipid uptake and transport, and overall lipid homeostasis. In the present study, searches of nucleotide sequence databases identified mRNA transcripts coded by 14 different fatty acid-binding protein (fabp) genes in rainbow trout (Oncorhynchus mykiss), which include the complete minimal suite of seven distinct fabp genes (fabp1, 2, 3, 6, 7, 10 and 11) discovered thus far in teleost fishes. Phylogenetic analyses suggest that many of these extant fabp genes in rainbow trout exist as duplicates, which putatively arose owing to the teleost-specific whole genome duplication (WGD); three pairs of duplicated fabp genes (fabp2a.1/fabp2a.2, fabp7b.1/fabp7b.2 and fabp10a.1/fabp10a.2) most likely were generated by the salmonid-specific WGD subsequent to the teleost-specific WGD; and fabp3 and fabp6 exist as single copy genes in the rainbow trout genome. Assay of the steady-state levels of fabp gene transcripts by RT-qPCR revealed: (1) steady-state transcript levels differ substantially between fabp genes and, in some instances, by as much as 30×10(4)-fold; (2) some fabp transcripts are widely distributed in many tissues, whereas others are restricted to one or a few tissues; and (3) divergence of regulatory mechanisms that control spatial transcription of duplicated fabp genes in rainbow trout appears related to length of time since their duplication. The suite of fabp genes described here provides the foundation to investigate the role(s) of fatty acid-binding proteins in the uptake, mobilization and storage of fatty acids in cultured fish fed diets differing in lipid content, especially the use of plant oil as a dietary supplement

  18. Sequence divergence and diversity suggests ongoing functional diversification of vertebrate NAD metabolism.

    PubMed

    Gossmann, Toni I; Ziegler, Mathias

    2014-11-01

    NAD is not only an important cofactor in redox reactions but has also received attention in recent years because of its physiological importance in metabolic regulation, DNA repair and signaling. In contrast to the redox reactions, these regulatory processes involve degradation of NAD and therefore necessitate a constant replenishment of its cellular pool. NAD biosynthetic enzymes are common to almost all species in all clades, but the number of NAD degrading enzymes varies substantially across taxa. In particular, vertebrates, including humans, have a manifold of NAD degrading enzymes which require a high turnover of NAD. As there is currently a lack of a systematic study of how natural selection has shaped enzymes involved in NAD metabolism we conducted a comprehensive evolutionary analysis based on intraspecific variation and interspecific divergence. We compare NAD biosynthetic and degrading enzymes in four eukaryotic model species and subsequently focus on human NAD metabolic enzymes and their orthologs in other vertebrates. We find that the majority of enzymes involved in NAD metabolism are subject to varying levels of purifying selection. While NAD biosynthetic enzymes appear to experience a rather high level of evolutionary constraint, there is evidence for positive selection among enzymes mediating NAD-dependent signaling. This is particularly evident for members of the PARP family, a diverse protein family involved in DNA damage repair and programmed cell death. Based on haplotype information and substitution rate analysis we pinpoint sites that are potential targets of positive selection. We also link our findings to a three dimensional structure, which suggests that positive selection occurs in domains responsible for DNA binding and polymerization rather than the NAD catalytic domain. Taken together, our results indicate that vertebrate NAD metabolism is still undergoing functional diversification.

  19. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  20. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  1. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  2. Multiple nucleic acid cleavage modes in divergent type III CRISPR systems

    PubMed Central

    Zhang, Jing; Graham, Shirley; Tello, Agnes; Liu, Huanting; White, Malcolm F.

    2016-01-01

    CRISPR-Cas is an RNA-guided adaptive immune system that protects bacteria and archaea from invading nucleic acids. Type III systems (Cmr, Csm) have been shown to cleave RNA targets in vitro and some are capable of transcription-dependent DNA targeting. The crenarchaeon Sulfolobus solfataricus has two divergent subtypes of the type III system (Sso-IIID and a Cmr7-containing variant of Sso-IIIB). Here, we report that both the Sso-IIID and Sso-IIIB complexes cleave cognate RNA targets with a ruler mechanism and 6 or 12 nt spacing that relates to the organization of the Cas7 backbone. This backbone-mediated cleavage activity thus appears universal for the type III systems. The Sso-IIIB complex is also known to possess a distinct ‘UA’ cleavage mode. The predominant activity observed in vitro depends on the relative molar concentration of protein and target RNA. The Sso-IIID complex can cleave plasmid DNA targets in vitro, generating linear DNA products with an activity that is dependent on both the cyclase and HD nuclease domains of the Cas10 subunit, suggesting a role for both nuclease active sites in the degradation of double-stranded DNA targets. PMID:26801642

  3. Abscisic acid mediates a divergence in the drought response of two conifers.

    PubMed

    Brodribb, Timothy J; McAdam, Scott A M

    2013-07-01

    During water stress, stomatal closure occurs as water tension and levels of abscisic acid (ABA) increase in the leaf, but the interaction between these two drivers of stomatal aperture is poorly understood. We investigate the dynamics of water potential, ABA, and stomatal conductance during the imposition of water stress on two drought-tolerant conifer species with contrasting stomatal behavior. Rapid rehydration of excised shoots was used as a means of differentiating the direct influences of ABA and water potential on stomatal closure. Pinus radiata (Pinaceae) was found to exhibit ABA-driven stomatal closure during water stress, resulting in strongly isohydric regulation of water loss. By contrast, stomatal closure in Callitris rhomboidea (Cupressaceae) was initiated by elevated foliar ABA, but sustained water stress saw a marked decline in ABA levels and a shift to water potential-driven stomatal closure. The transition from ABA to water potential as the primary driver of stomatal aperture allowed C. rhomboidea to rapidly recover gas exchange after water-stressed plants were rewatered, and was associated with a strongly anisohydric regulation of water loss. These two contrasting mechanisms of stomatal regulation function in combination with xylem vulnerability to produce highly divergent strategies of water management. Species-specific ABA dynamics are proposed as a central component of drought survival and ecology.

  4. Abscisic Acid Mediates a Divergence in the Drought Response of Two Conifers1[W][OA

    PubMed Central

    Brodribb, Timothy J.; McAdam, Scott A.M.

    2013-01-01

    During water stress, stomatal closure occurs as water tension and levels of abscisic acid (ABA) increase in the leaf, but the interaction between these two drivers of stomatal aperture is poorly understood. We investigate the dynamics of water potential, ABA, and stomatal conductance during the imposition of water stress on two drought-tolerant conifer species with contrasting stomatal behavior. Rapid rehydration of excised shoots was used as a means of differentiating the direct influences of ABA and water potential on stomatal closure. Pinus radiata (Pinaceae) was found to exhibit ABA-driven stomatal closure during water stress, resulting in strongly isohydric regulation of water loss. By contrast, stomatal closure in Callitris rhomboidea (Cupressaceae) was initiated by elevated foliar ABA, but sustained water stress saw a marked decline in ABA levels and a shift to water potential-driven stomatal closure. The transition from ABA to water potential as the primary driver of stomatal aperture allowed C. rhomboidea to rapidly recover gas exchange after water-stressed plants were rewatered, and was associated with a strongly anisohydric regulation of water loss. These two contrasting mechanisms of stomatal regulation function in combination with xylem vulnerability to produce highly divergent strategies of water management. Species-specific ABA dynamics are proposed as a central component of drought survival and ecology. PMID:23709665

  5. Sequence-Divergent Chordopoxvirus Homologs of the O3 Protein Maintain Functional Interactions with Components of the Vaccinia Virus Entry-Fusion Complex

    PubMed Central

    Satheshkumar, P. S.

    2012-01-01

    Composed of 35 amino acids, O3 is the smallest characterized protein encoded by vaccinia virus (VACV) and is an integral component of the entry-fusion complex (EFC). O3 is conserved with 100% identity in all orthopoxviruses except for monkeypox viruses, whose O3 homologs have 2 to 3 amino acid substitutions. Since O3 is part of the EFC, high conservation could suggest an immutable requirement for interaction with multiple proteins. Chordopoxviruses of other genera also encode small proteins with a characteristic predicted N-terminal α-helical hydrophobic domain followed by basic amino acids and proline in the same relative genome location as that of VACV O3. However, the statistical significance of their similarity to VACV O3 is low due to the large contribution of the transmembrane domain, their small size, and their sequence diversity. Nevertheless, trans-complementation experiments demonstrated the ability of a representative O3-like protein from each chordopoxvirus genus to rescue the infectivity of a VACV mutant that was unable to express endogenous O3. Moreover, recombinant viruses expressing O3 homologs in place of O3 replicated and formed plaques as well or nearly as well as wild-type VACV. The O3 homologs expressed by the recombinant VACVs were incorporated into the membranes of mature virions and, with one exception, remained stably associated with the detergent-extracted and affinity-purified EFC. The ability of the sequence-divergent O3 homologs to coordinate function with VACV entry proteins suggests the conservation of structural motifs. Analysis of chimeras formed by swapping domains of O3 with those of other proteins indicated that the N-terminal transmembrane segment was responsible for EFC interactions and for the complementation of infectivity. PMID:22114343

  6. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  7. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  8. Meiotic recombination at the Lmp2 hotspot tolerates minor sequence divergence between homologous chromosomes

    SciTech Connect

    Yoshino, Masayasu; Sagai, Tomoko; Shiroishi, Toshihiko

    1996-06-01

    Recombination is widely considered to linearly depend on the length of the homologous sequences. An 11% mismatch decreases the rate of phage-plasmid recombination 240-fold. Two single nucleotide mismatches, which reduce the longest uninterrupted stretch of similarity from 232 base pairs (bp) to 134 bp, reduce gene conversion in mouse L cells 20-fold. The efficiency of gene targeting through homologous recombination in mouse embryonic stem cells can be increased by using an isogenic, rather than a non-isogenic, DNA construct. In this study we asked whether a high degree of sequence identity between homologous mouse chromosomes enhances meiotic recombination at a hotspot. Sites of meiotic recombination in the mouse major histocompatibility complex (MHC) class II region are not randomly distributed but are almost all clustered within short segments known as recombinational hotspots. The wm7 MHC haplotype, derived from Japanese wild mice Mus musculus molossinus, enhances meiotic recombination at a hotspot near the Lmp2 gene. Heterozygotes between the wm7 haplotype and the b or k haplotypes have yielded a high frequency of recombination (2.1%) in 1.3 kilobase kb segment of this hotspot. 20 refs., 2 figs.

  9. Purification, sequencing and evaluation of a divergent phytase from Penicillium oxalicum KCTC6440.

    PubMed

    Kim, Bong-Hyun; Lee, Ji Yeon; Lee, Peter C W

    2015-01-01

    A fungal strain producing high levels of phytase was purified to homogeneity from Penicillium oxalicum KCTC6440 (PhyA). The molecular mass of the purified PhyA was 65 kDa and optimal activity occurred at 55°C. The enzyme was stable in a pH range of 4.5-6.5, with an optimum performance at pH 5.5. The Km value for the substrate sodium phytate was 0.48 mM with a Vmax of 672 U/mg. The enzyme was inhibited by Ca(2+), Cu(2+), and Zn(2+), and slightly enhanced by EDTA. The PhyA efficiently released phosphate from feedstuffs such as soybean, rich bran and corn meal. The PhyA gene was cloned in two steps of degenerate PCR and inverse PCR and found to comprise 1501 bp and encode 461 amino acid residues. The enzyme was found to have only 13 amino acids differing to the known PhyA from other Penicillium sp., but has distinct enzyme characteristics. Computational analysis showed that PhyA possessed more positively charged residues in the active sites compared to other PhyA molecules, which may explain the broader pH spectrum.

  10. Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63

    PubMed Central

    Zhang, Jianwei; Chen, Ling-Ling; Xing, Feng; Kudrna, David A.; Yao, Wen; Copetti, Dario; Mu, Ting; Li, Weiming; Song, Jia-Ming; Lee, Seunghee; Talag, Jayson; Shao, Lin; An, Yue; Zhang, Chun-Liu; Ouyang, Yidan; Sun, Shuai; Jiao, Wen-Biao; Lv, Fang; Du, Bogu; Luo, Meizhong; Maldonado, Carlos Ernesto; Goicoechea, Jose Luis; Xiong, Lizhong; Wu, Changyin; Xing, Yongzhong; Zhou, Dao-Xiu; Yu, Sibin; Zhao, Yu; Wang, Gongwei; Yu, Yeisoo; Luo, Yijie; Zhou, Zhi-Wei; Hurtado, Beatriz Elena Padilla; Danowitz, Ann; Wing, Rod A.; Zhang, Qifa

    2016-01-01

    Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica. Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059–2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement. PMID:27535938

  11. Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

    NASA Astrophysics Data System (ADS)

    Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

    2000-02-01

    Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.

  12. Population histories of right whales (Cetacea: Eubalaena) inferred from mitochondrial sequence diversities and divergences of their whale lice (Amphipoda: Cyamus).

    PubMed

    Kaliszewska, Zofia A; Seger, Jon; Rowntree, Victoria J; Barco, Susan G; Benegas, Rafael; Best, Peter B; Brown, Moira W; Brownell, Robert L; Carribero, Alejandro; Harcourt, Robert; Knowlton, Amy R; Marshall-Tilas, Kim; Patenaude, Nathalie J; Rivarola, Mariana; Schaeff, Catherine M; Sironi, Mariano; Smith, Wendy A; Yamada, Tadasu K

    2005-10-01

    Right whales carry large populations of three 'whale lice' (Cyamus ovalis, Cyamus gracilis, Cyamus erraticus) that have no other hosts. We used sequence variation in the mitochondrial COI gene to ask (i) whether cyamid population structures might reveal associations among right whale individuals and subpopulations, (ii) whether the divergences of the three nominally conspecific cyamid species on North Atlantic, North Pacific, and southern right whales (Eubalaena glacialis, Eubalaena japonica, Eubalaena australis) might indicate their times of separation, and (iii) whether the shapes of cyamid gene trees might contain information about changes in the population sizes of right whales. We found high levels of nucleotide diversity but almost no population structure within oceans, indicating large effective population sizes and high rates of transfer between whales and subpopulations. North Atlantic and Southern Ocean populations of all three species are reciprocally monophyletic, and North Pacific C. erraticus is well separated from North Atlantic and southern C. erraticus. Mitochondrial clock calibrations suggest that these divergences occurred around 6 million years ago (Ma), and that the Eubalaena mitochondrial clock is very slow. North Pacific C. ovalis forms a clade inside the southern C. ovalis gene tree, implying that at least one right whale has crossed the equator in the Pacific Ocean within the last 1-2 million years (Myr). Low-frequency polymorphisms are more common than expected under neutrality for populations of constant size, but there is no obvious signal of rapid, interspecifically congruent expansion of the kind that would be expected if North Atlantic or southern right whales had experienced a prolonged population bottleneck within the last 0.5 Myr.

  13. The interplay between DNA methylation and sequence divergence in recent human evolution

    PubMed Central

    Hernando-Herraez, Irene; Heyn, Holger; Fernandez-Callejo, Marcos; Vidal, Enrique; Fernandez-Bellon, Hugo; Prado-Martinez, Javier; Sharp, Andrew J.; Esteller, Manel; Marques-Bonet, Tomas

    2015-01-01

    Despite the increasing knowledge about DNA methylation, the understanding of human epigenome evolution is in its infancy. Using whole genome bisulfite sequencing we identified hundreds of differentially methylated regions (DMRs) in humans compared to non-human primates and estimated that ∼25% of these regions were detectable throughout several human tissues. Human DMRs were enriched for specific histone modifications and the majority were located distal to transcription start sites, highlighting the importance of regions outside the direct regulatory context. We also found a significant excess of endogenous retrovirus elements in human-specific hypomethylated. We reported for the first time a close interplay between inter-species genetic and epigenetic variation in regions of incomplete lineage sorting, transcription factor binding sites and human differentially hypermethylated regions. Specifically, we observed an excess of human-specific substitutions in transcription factor binding sites located within human DMRs, suggesting that alteration of regulatory motifs underlies some human-specific methylation patterns. We also found that the acquisition of DNA hypermethylation in the human lineage is frequently coupled with a rapid evolution at nucleotide level in the neighborhood of these CpG sites. Taken together, our results reveal new insights into the mechanistic basis of human-specific DNA methylation patterns and the interpretation of inter-species non-coding variation. PMID:26170231

  14. Sequence divergence of microsatellites for phylogeographic assessment of Moroccan Medicago species.

    PubMed

    Zitouna, N; Marghali, S; Gharbi, M; Haddioui, A; Trifi-Farah, N

    2014-03-12

    Six Medicago species were investigated to characterize and valorize plant genetic resources of pastoral interest in Morocco. Samples were obtained from the core collection of the South Australian Research and Development Institute (SARDI). The transferability of single sequence repeat markers of Medicago truncatula was successful with 97.6% efficiency across the five species. A total of 283 alleles and 243 genotypes were generated using seven SSR markers, confirming the high level of polymorphism that is characteristic of the Medicago genus, despite a heterozygosity deficit (HO = 0.378; HE = 0.705). In addition, a high level of gene flow was revealed among the species analyzed with significant intra-specific variation. The unweighted pair group method with arithmetic mean dendrogram generated by the dissimilarity matrix revealed that M. polymorpha and M. orbicularis are closely related, and that M. truncatula is likely the ancestral species. The Pearson correlation index revealed no significant correlations between the geographic distribution of the Moroccan species and genetic similarities, indicating local adaptation of these species to different ecological environments independent of their topographical proximities. The substantial genetic variation observed was likely due to the predominance of selfing species, the relative proximity of prospected sites, human impacts, and the nature of the SARDI core collections, which are selected for their high genetic diversity. The results of this first report on Moroccan Medicago species will be of great interest for establishing strategies aiming at reasonable management and selection programs for local and Mediterranean germplasm in the face of increasing environmental change.

  15. The gene coding for 3-deoxy-manno-octulosonic acid transferase and the rfaQ gene are transcribed from divergently arranged promoters in Escherichia coli.

    PubMed Central

    Clementz, T

    1992-01-01

    The gene kdtA in Escherichia coli codes for 3-deoxy-D-manno-octulosonic acid transferase, the enzyme responsible for attachment of the two 3-deoxy-D-manno-octulosonic acid residues that constitute the link between lipid A and the core oligosaccharide of the lipopolysaccharide. Cloning and subsequent sequencing of the region upstream of kdtA revealed an open reading frame identified as the first gene (rfaQ) in an rfa gene cluster. The kdtA and rfaQ transcripts were identified, and the 5' ends of the transcripts were mapped by primer extension. Two main, divergently arranged promoters were found. These promoters generated transcripts with 5' ends separated by 289 bases. That the two divergent transcripts from the identified promoters represent the kdtA and rfaQ transcripts was confirmed by fusing different parts of the intergenic region between the promoterless lacZ and phoA genes in promoter-screening plasmid pCB267. Images PMID:1447141

  16. Decrease of Population Divergence in Eurasian Perch (Perca fluviatilis) in Browning Waters: Role of Fatty Acids and Foraging Efficiency

    PubMed Central

    Scharnweber, Kristin; Strandberg, Ursula; Karlsson, Konrad; Eklöv, Peter

    2016-01-01

    Due to altered biogeochemical processes related to climate change, highly colored dissolved organic carbon (DOC) from terrestrial sources will lead to a water “brownification” in many freshwater systems of the Northern Hemisphere. This will create deteriorated visual conditions that have been found to affect habitat-specific morphological variations in Eurasian perch (Perca fluviatilis) in a previous study. So far, potential drivers and ultimate causes of these findings have not been identified. We conducted a field study to investigate the connection between morphological divergence and polyunsaturated fatty acid (PUFA) composition of perch from six lakes across a gradient of DOC concentration. We expected a decrease in the prevalence of PUFAs, which are important for perch growth and divergence with increasing DOC concentrations, due to the restructuring effects of DOC on aquatic food webs. In general, rate of morphological divergence in perch decreased with increasing DOC concentrations. Proportions of specific PUFAs (22:6n-3, 18:3n-3, 20:5n-3, and 20:4n-6) identified to primarily contribute to overall differences between perch caught in clear and brown-water lakes tended to be connected to overall decline of morphological divergence. However, no overall significant relationship was found, indicating no severe limitation of essential fatty acids for perch inhabiting brown water lakes. We further broaden our approach by conducting a laboratory experiment on foraging efficiency of perch. Therefore, we induced pelagic and littoral phenotypes by differences in habitat-structure and feeding mode and recorded attack rate in a feeding experiment. Generally, fish were less efficient in foraging on littoral prey (Ephemeroptera) when visual conditions were degraded by brown water color. We concluded that browning water may have a strong effect on the forager’s ability to find particular food resources, resulting in the reduced development of evolutionary traits

  17. Amino acid sequence of mouse submaxillary gland renin.

    PubMed Central

    Misono, K S; Chang, J J; Inagami, T

    1982-01-01

    The complete amino acid sequences of the heavy chain and light chain of mouse submaxillary gland renin have been determined. The heavy chain consists of 288 amino acid residues having a Mr of 31,036 calculated from the sequence. The light chain contains 48 amino acid residues with a Mr of 5,458. The sequence of the heavy chain was determined by automated Edman degradations of the cyanogen bromide peptides and tryptic peptides generated after citraconylation, as well as other peptides generated therefrom. The sequence of the light chain was derived from sequence analyses of the peptides generated by cyanogen bromide cleavage or by digestion with Staphylococcus aureus protease. The sequences in the active site regions in renin containing two catalytically essential aspartyl residues 32 and 215 were found identical with those in pepsin, chymosin, and penicillopepsin. Comparison of the amino acid sequence of renin with that of porcine pepsin indicated a 42% sequence identity of the heavy chain with the amino-terminal and middle regions and a 46% identity of the light chain with the carboxyl-terminal region of the porcine pepsin sequence. Residues identical in renin and pepsin are distributed throughout the length of the molecules, suggesting a similarity in their overall structures. PMID:6812055

  18. Restriction of V3 region sequence divergence in the HIV-1 envelope gene during antiretroviral treatment in a cohort of recent seroconverters

    PubMed Central

    2013-01-01

    Background Dynamic changes in Human Immunodeficiency Virus 1 (HIV-1) sequence diversity and divergence are associated with immune control during primary infection and progression to AIDS. Consensus sequencing or single genome amplification sequencing of the HIV-1 envelope (env) gene, in particular the variable (V) regions, is used as a marker for HIV-1 genome diversity, but population diversity is only minimally, or semi-quantitatively sampled using these methods. Results Here we use second generation deep sequencing to determine inter-and intra-patient sequence heterogeneity and to quantify minor variants in a cohort of individuals either receiving or not receiving antiretroviral treatment following seroconversion; the SPARTAC trial. We show, through a cross-sectional study of sequence diversity of the env V3 in 30 antiretroviral-naive patients during primary infection that considerable population structure diversity exists, with some individuals exhibiting highly constrained plasma virus diversity. Diversity was independent of clinical markers (viral load, time from seroconversion, CD4 cell count) of infection. Serial sampling over 60 weeks of non-treated individuals that define three initially different diversity profiles showed that complex patterns of continuing HIV-1 sequence diversification and divergence could be readily detected. Evidence for minor sequence turnover, emergence of new variants and re-emergence of archived variants could be inferred from this analysis. Analysis of viral divergence over the same time period in patients who received short (12 weeks, ART12) or long course antiretroviral therapy (48 weeks, ART48) and a non-treated control group revealed that ART48 successfully suppressed viral divergence while ART12 did not have a significant effect. Conclusions Deep sequencing is a sensitive and reliable method for investigating the diversity of the env V3 as an important component of HIV-1 genome diversity. Detailed insights into the complex

  19. Targeted sequence capture and resequencing implies a predominant role of regulatory regions in the divergence of a sympatric lake whitefish species pair (Coregonus clupeaformis).

    PubMed

    Hebert, Francois Olivier; Renaut, Sébastien; Bernatchez, Louis

    2013-10-01

    Latest technological developments in evolutionary biology bring new challenges in documenting the intricate genetic architecture of species in the process of divergence. Sympatric populations of lake whitefish represent one of the key systems to investigate this issue. Despite the value of random genotype-by-sequencing methods and decreasing cost of sequencing technologies, it remains challenging to investigate variation in coding regions, especially in the case of recently duplicated genomes as in salmonids, as this greatly complicates whole genome resequencing. We thus designed a sequence capture array targeting 2773 annotated genes to document the nature and the extent of genomic divergence between sympatric dwarf and normal whitefish. Among the 2728 genes successfully captured, a total of 2182 coding and 10,415 noncoding putative single-nucleotide polymorphisms (SNPs) were identified after applying a first set of basic filters. A genome scan with a quality-refined selection of 2203 SNPs identified 267 outlier SNPs in 210 candidate genes located in genomic regions potentially involved in whitefish divergence and reproductive isolation. We found highly heterogeneous FST estimates among SNP loci. There was an overall low level of coding polymorphism, with a predominance of noncoding mutations among outliers. The heterogeneous patterns of divergence among loci confirm the porous nature of genomes during speciation with gene flow. Considering that few protein-coding mutations were identified as highly divergent, our results, along with previous transcriptomic studies, imply that changes in regulatory regions most likely had a greater role in the process of whitefish population divergence than protein-coding mutations. This study is the first to demonstrate the efficiency of large-scale targeted resequencing for a nonmodel species with such a large and unsequenced genome.

  20. Characterization and Profiling of Liver microRNAs by RNA-sequencing in Cattle Divergently Selected for Residual Feed Intake

    PubMed Central

    Al-Husseini, Wijdan; Chen, Yizhou; Gondro, Cedric; Herd, Robert M.; Gibson, John P.; Arthur, Paul F.

    2016-01-01

    MicroRNAs (miRNAs) are short non-coding RNAs that post-transcriptionally regulate expression of mRNAs in many biological pathways. Liver plays an important role in the feed efficiency of animals and high and low efficient cattle demonstrated different gene expression profiles by microarray. Here we report comprehensive miRNAs profiles by next-gen deep sequencing in Angus cattle divergently selected for residual feed intake (RFI) and identify miRNAs related to feed efficiency in beef cattle. Two microRNA libraries were constructed from pooled RNA extracted from livers of low and high RFI cattle, and sequenced by Illumina genome analyser. In total, 23,628,103 high quality short sequence reads were obtained and more than half of these reads were matched to the bovine genome (UMD 3.1). We identified 305 known bovine miRNAs. Bta-miR-143, bta-miR-30, bta-miR-122, bta-miR-378, and bta-let-7 were the top five most abundant miRNAs families expressed in liver, representing more than 63% of expressed miRNAs. We also identified 52 homologous miRNAs and 10 novel putative bovine-specific miRNAs, based on precursor sequence and the secondary structure and utilizing the miRBase (v. 21). We compared the miRNAs profile between high and low RFI animals and ranked the most differentially expressed bovine known miRNAs. Bovine miR-143 was the most abundant miRNA in the bovine liver and comprised 20% of total expressed mapped miRNAs. The most highly expressed miRNA in liver of mice and humans, miR-122, was the third most abundant in our cattle liver samples. We also identified 10 putative novel bovine-specific miRNA candidates. Differentially expressed miRNAs between high and low RFI cattle were identified with 18 miRNAs being up-regulated and 7 other miRNAs down-regulated in low RFI cattle. Our study has identified comprehensive miRNAs expressed in bovine liver. Some of the expressed miRNAs are novel in cattle. The differentially expressed miRNAs between high and low RFI give some

  1. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    SciTech Connect

    Yu, Jinghua ); Eng, J.; Yalow, R.S. City Univ. of New York, NY )

    1990-12-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species.

  2. Amino Acid Sequence of Human Cholinesterase

    DTIC Science & Technology

    1985-10-01

    liquid chromatography (HPLC). Activity testing of the aged, DFP-labeled cholinesterase showed that 99.8% of the active sites had been labeled, since...acids were quantitated by ninhydrin at the AAA Labs, or by derivatization with phenylisothiocyanate at the University of Michigan. The latter method

  3. Cystatin. Amino acid sequence and possible secondary structure.

    PubMed Central

    Schwabe, C; Anastasi, A; Crow, H; McDonald, J K; Barrett, A J

    1984-01-01

    The amino acid sequence of cystatin, the protein from chicken egg-white that is a tight-binding inhibitor of many cysteine proteinases, is reported. Cystatin is composed of 116 amino acid residues, and the Mr is calculated to be 13 143. No striking similarity to any other known sequence has been detected. The results of computer analysis of the sequence and c.d. spectrometry indicate that the secondary structure includes relatively little alpha-helix (about 20%) and that the remainder is mainly beta-structure. PMID:6712597

  4. Multifarious selection through environmental change: acidity and predator-mediated adaptive divergence in the moor frog (Rana arvalis)

    PubMed Central

    Egea-Serrano, Andrés; Hangartner, Sandra; Laurila, Anssi; Räsänen, Katja

    2014-01-01

    Environmental change can simultaneously cause abiotic stress and alter biological communities, yet adaptation of natural populations to co-changing environmental factors is poorly understood. We studied adaptation to acid and predator stress in six moor frog (Rana arvalis) populations along an acidification gradient, where abundance of invertebrate predators increases with increasing acidity of R. arvalis breeding ponds. First, we quantified divergence among the populations in anti-predator traits (behaviour and morphology) at different rearing conditions in the laboratory (factorial combinations of acid or neutral pH and the presence or the absence of a caged predator). Second, we evaluated relative fitness (survival) of the populations by exposing tadpoles from the different rearing conditions to predation by free-ranging dragonfly larvae. We found that morphological defences (relative tail depth) as well as survival of tadpoles under predation increased with increasing pond acidity (under most experimental conditions). Tail depth and larval size mediated survival differences among populations, but the contribution of trait divergence to survival was strongly dependent on prior rearing conditions. Our results indicate that R. arvalis populations are adapted to the elevated predator pressure in acidified ponds and emphasize the importance of multifarious selection via both direct (here: pH) and indirect (here: predators) environmental changes. PMID:24552840

  5. Mitochondrial genomes and divergence times of crocodile newts: inter-islands distribution of Echinotriton andersoni and the origin of a unique repetitive sequence found in Tylototriton mt genomes.

    PubMed

    Kurabayashi, Atsushi; Nishitani, Takuma; Katsuren, Seiki; Oumi, Shohei; Sumida, Masayuki

    2012-01-01

    Crocodile newts, which constitute the genera Echinotriton and Tylototriton, are known as living fossils, and these genera comprise many endangered species. To identify mitochondrial (mt) genes suitable for future population genetic analyses for endangered taxa, we determined the complete nucleotide sequences of the mt genomes of the Japanese crocodile newt Echinotriton andersoni and Himalayan crocodile newt Tylototriton verrucosus. Although the control region (CR) is known as the most variable mtDNA region in many animal taxa, the CRs of crocodile newts are highly conservative. Rather, the genes of NADH dehydrogenase subunits and ATPase subunit 6 were found to have high sequence divergences and to be usable for population genetics studies. To estimate the inter-population divergence ages of E. andersoni endemic to the Ryukyu Islands, we performed molecular dating analysis using whole and partial mt genomic data. The estimated divergence ages of the inter-island individuals are older than the paleogeographic segmentation ages of the islands, suggesting that the lineage splits of E. andersoni populations were not caused by vicariant events. Our phylogenetic analysis with partial mt sequence data also suggests the existence of at least two more undescribed species in the genus Tylototriton. We also found unusual repeat sequences containing the 3' region of cytochrome apoenzyme b gene, whole tRNA-Thr gene, and a noncoding region (the T-P noncoding region characteristic in caudate mtDNAs) from T. verrucosus mtDNA. Similar repeat sequences were found in two other Tylototriton species. The Tylototriton taxa with the repeats become a monophyletic group, indicating a single origin of the repeat sequences. The intra-and inter-specific comparisons of the repeat sequences suggest the occurrences of homologous recombination-based concerted evolution among the repeat sequences.

  6. Whole plastome sequencing reveals deep plastid divergence and cytonuclear discordance between closely related balsam poplars, Populus balsamifera and P. trichocarpa (Salicaceae).

    PubMed

    Huang, Daisie I; Hefer, Charles A; Kolosova, Natalia; Douglas, Carl J; Cronk, Quentin C B

    2014-11-01

    As molecular phylogenetic analyses incorporate ever-greater numbers of loci, cases of cytonuclear discordance - the phenomenon in which nuclear gene trees deviate significantly from organellar gene trees - are being reported more frequently. Plant examples of topological discordance, caused by recent hybridization between extant species, are well known. However, examples of branch-length discordance are less reported in plants relative to animals. We use a combination of de novo assembly and reference-based mapping using short-read shotgun sequences to construct a robust phylogeny of the plastome for multiple individuals of all the common Populus species in North America. We demonstrate a case of strikingly high plastome divergence, in contrast to little nuclear genome divergence, in two closely related balsam poplars, Populus balsamifera and Populus trichocarpa (Populus balsamifera ssp. trichocarpa). Previous studies with nuclear loci indicate that the two species (or subspecies) diverged since the late Pleistocene, whereas their plastomes indicate deep divergence, dating to at least the Pliocene (6-7 Myr ago). Our finding is in marked contrast to the estimated Pleistocene divergence of the nuclear genomes, previously calculated at 75 000 yr ago, suggesting plastid capture from a 'ghost lineage' of a now-extinct North American poplar.

  7. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences

    PubMed Central

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D.; Adir, Noam

    2016-01-01

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  8. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.

  9. Amino acid sequences of proteins from Leptospira serovar pomona.

    PubMed

    Alves, S F; Lefebvre, R B; Probert, W

    2000-01-01

    This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  10. Phylogenetic and evolutionary implications of complete chloroplast genome sequences of four early-diverging angiosperms: Buxus (Buxaceae), Chloranthus (Chloranthaceae), Dioscorea (Dioscoreaceae), and Illicium (Schisandraceae).

    PubMed

    Hansen, Debra R; Dastidar, Sayantani G; Cai, Zhengqiu; Penaflor, Cynthia; Kuehl, Jennifer V; Boore, Jeffrey L; Jansen, Robert K

    2007-11-01

    We have determined the complete chloroplast genome sequences of four early-diverging lineages of angiosperms, Buxus (Buxaceae), Chloranthus (Chloranthaceae), Dioscorea (Dioscoreaceae), and Illicium (Schisandraceae), to examine the organization and evolution of plastid genomes and to estimate phylogenetic relationships among angiosperms. For the most part, the organization of these plastid genomes is quite similar to the ancestral angiosperm plastid genome with a few notable exceptions. Dioscorea has lost one protein-coding gene, rps16; this gene loss has also happened independently in four other land plant lineages, liverworts, conifers, Populus, and legumes. There has also been a small expansion of the inverted repeat (IR) in Dioscorea that has duplicated trnH-GUG. This event has also occurred multiple times in angiosperms, including in monocots, and in the two basal angiosperms Nuphar and Drimys. The Illicium chloroplast genome is unusual by having a 10 kb contraction of the IR. The four taxa sequenced represent key groups in resolving phylogenetic relationships among angiosperms. Illicium is one of the basal angiosperms in the Austrobaileyales, Chloranthus (Chloranthales) remains unplaced in angiosperm classifications, and Buxus and Dioscorea are early-diverging eudicots and monocots, respectively. We have used sequences for 61 shared protein-coding genes from these four genomes and combined them with sequences from 35 other genomes to estimate phylogenetic relationships using parsimony, likelihood, and Bayesian methods. There is strong congruence among the trees generated by the three methods, and most nodes have high levels of support. The results indicate that Amborella alone is sister to the remaining angiosperms; the Nymphaeales represent the next-diverging clade followed by Illicium; Chloranthus is sister to the magnoliids and together this group is sister to a large clade that includes eudicots and monocots; and Dioscorea represents an early-diverging

  11. Extensive amino acid sequence homologies between animal lectins

    SciTech Connect

    Paroutaud, P.; Levi, G.; Teichberg, V.I.; Strosberg, A.D.

    1987-09-01

    The authors have established the amino acid sequence of the ..beta..-D-galactoside binding lectin from the electric eel and the sequences of several peptides from a similar lectin isolated from human placenta. These sequences were compared with the published sequences of peptides derived from the ..beta..-D-galactoside binding lectin from human lung and with sequences deduced from cDNAs assigned to the ..beta..-D-galactoside binding lectins from chicken embryo skin and human hepatomas. Significant homologies were observed. One of the highly conserved regions that contains a tryptophan residue and two glutamic acid resides is probably part of the ..beta..-D-galactoside binding site, which, on the basis of spectroscopic studies of the electric eel lectin, is expected to contain such residues. The similarity of the hydropathy profiles and the predicted secondary structure of the lectins from chicken skin and electric eel, in spite of differences in their amino acid sequences, strongly suggests that these proteins have maintained structural homologies during evolution and together with the other ..beta..-D-galactoside binding lectins were derived form a common ancestor gene.

  12. Amino acid sequence of porcine spleen cathepsin D.

    PubMed Central

    Shewale, J G; Tang, J

    1984-01-01

    The amino acid sequence of porcine spleen cathepsin D heavy chain has been determined and, hence, the complete structure of this enzyme is now known. The sequence of heavy chain was constructed by aligning the structures of peptides generated by cyanogen bromide, trypsin, and endo-proteinase Lys C cleavages. The structure of the light chain has been published previously. The cathepsin D molecule contains 339 amino acid residues in two polypeptide chains: a 97-residue light chain and a 242-residue heavy chain, with a combined Mr of 36,779 (without carbohydrate). There are two carbohydrate units linked to asparagine residues 70 and 192. The disulfide bond arrangement in cathepsin D is probably similar to that of pepsin, because the positions of six half-cystine residues are conserved. The active site aspartyl residues, corresponding to aspartic acid-32 and -215 of pepsin, are located at residues 33 and 224 in the cathepsin D molecule. The amino acid sequence around these aspartyl residues is strongly conserved. Cathepsin D shows a strong homology with other acid proteases. When the sequence of cathepsin D, renin, and pepsin are aligned, 32.7% of the residues are identical. The homology is observed throughout the length of the molecules, indicating that three-dimensional structures of all three molecules are similar. PMID:6587385

  13. The REH theory of protein and nucleic acid divergence - A retrospective update. [Random Evolutionary Hits

    NASA Technical Reports Server (NTRS)

    Holmquist, R.

    1978-01-01

    The random evolutionary hits (REH) theory of evolutionary divergence, originally proposed in 1972, is restated with attention to certain aspects of the theory that have caused confusion. The theory assumes that natural selection and stochastic processes interact and that natural selection restricts those codon sites which may fix mutations. The predicted total number of fixed nucleotide replacements agrees with data for cytochrome c, a-hemoglobin, beta-hemoglobin, and myoglobin. The restatement analyzes the magnitude of possible sources of errors and simplifies calculational methodology by supplying polynomial expressions to replace tables and graphs.

  14. Functional Divergence in the Genus Oenococcus as Predicted by Genome Sequencing of the Newly-Described Species, Oenococcus kitaharae

    PubMed Central

    Borneman, Anthony R.; McCarthy, Jane M.; Chambers, Paul J.; Bartowsky, Eveline J.

    2012-01-01

    Oenococcus kitaharae is only the second member of the genus Oenococcus to be identified and is the closest relative of the industrially important wine bacterium Oenococcus oeni. To provide insight into this new species, the genome of the type strain of O. kitaharae, DSM 17330, was sequenced. Comparison of the sequenced genomes of both species show that the genome of O. kitaharae DSM 17330 contains many genes with predicted functions in cellular defence (bacteriocins, antimicrobials, restriction-modification systems and a CRISPR locus) which are lacking in O. oeni. The two genomes also appear to differentially encode several metabolic pathways associated with amino acid biosynthesis and carbohydrate utilization and which have direct phenotypic consequences. This would indicate that the two species have evolved different survival techniques to suit their particular environmental niches. O. oeni has adapted to survive in the harsh, but predictable, environment of wine that provides very few competitive species. However O. kitaharae appears to have adapted to a growth environment in which biological competition provides a significant selective pressure by accumulating biological defence molecules, such as bacteriocins and restriction-modification systems, throughout its genome. PMID:22235313

  15. Active site amino acid sequence of human factor D.

    PubMed

    Davis, A E

    1980-08-01

    Factor D was isolated from human plasma by chromatography on CM-Sephadex C50, Sephadex G-75, and hydroxylapatite. Digestion of reduced, S-carboxymethylated factor D with cyanogen bromide resulted in three peptides which were isolated by chromatography on Sephadex G-75 (superfine) equilibrated in 20% formic acid. NH2-Terminal sequences were determined by automated Edman degradation with a Beckman 890C sequencer using a 0.1 M Quadrol program. The smallest peptide (CNBr III) consisted of the NH2-terminal 14 amino acids. The other two peptides had molecular weights of 17,000 (CNBr I) and 7000 (CNBr II). Overlap of the NH2-terminal sequence of factor D with the NH2-terminal sequence of CNBr I established the order of the peptides. The NH2-terminal 53 residues of factor D are somewhat more homologous with the group-specific protease of rat intestine than with other serine proteases. The NH2-terminal sequence of CNBr II revealed the active site serine of factor D. The typical serine protease active site sequence (Gly-Asp-Ser-Gly-Gly-Pro was found at residues 12-17. The region surrounding the active site serine does not appear to be more highly homologous with any one of the other serine proteases. The structural data obtained point out the similarities between factor D and the other proteases. However, complete definition of the degree of relationship between factor D and other proteases will require determination of the remainder of the primary structure.

  16. Extreme sequence divergence between mitochondrial genomes of two subspecies of White-breasted Wood-wren (Henicorhina leucosticta, Cabanis, 1847) from western and central Panamá.

    PubMed

    Aguilar, Celestino; De Léon, Luis Fernando; Loaiza, José R; McMillan, W Owen; Miller, Matthew J

    2016-01-01

    Prior studies of mitochondrial variation in White-breasted Wood-Wrens (Henicorhina leucosticta) have suggested that populations in South American and Mesoamerica might represent multiple species. Here we report the complete mitochondrial genomes from two individuals of H. leucosticta, representing the Panamanian subspecies pittieri and alexandri. The two sequences were 16,721 and 16,726 base pairs in size with both genomes comprised of the usual 22 tRNA genes, 2 rRNA genes, 13 protein-coding genes, and one displacement loop region in the standard avian order. Uncorrected pairwise divergence between mitogenome features was high, with the highest divergence occurring in protein-coding genes (average = 8.2%), followed by control region (6.7%). RNA features had lower pairwise divergences (average tRNA = 4.3%, average rRNA = 2.3%). The protein-coding ATPase 6 gene had a different stop codon between these two specimens. The high level of sequence variation between these subspecies suggests that Mesoamerican H. leucosticta might be comprised of multiple species. We urge a full phylogeographic survey of this widespread Neotropical forest bird.

  17. The amino acid sequence of iguana (Iguana iguana) pancreatic ribonuclease.

    PubMed

    Zhao, W; Beintema, J J; Hofsteenge, J

    1994-01-15

    The pyrimidine-specific ribonuclease superfamily constitutes a group of homologous proteins so far found only in higher vertebrates. Four separate families are found in mammals, which have resulted from gene duplications in mammalian ancestors. To learn more about the evolutionary history of this superfamily, the primary structure and other characteristics of the pancreatic enzyme from iguana (Iguana iguana), a herbivorous lizard species belonging to the reptiles, have been determined. The polypeptide chain consists of 119 amino acid residues. The positions of insertions and deletions in the sequence are identical to those in the enzyme from snapping turtle. However, the two enzymes differ at 54% of the amino acid positions. Iguana ribonuclease contains no carbohydrate, although the enzyme possesses three recognition sites for carbohydrate attachment, and has a high number of acidic residues in a localized part of the sequence.

  18. Genome-wide association studies for fatty acid metabolic traits in five divergent pig populations

    PubMed Central

    Zhang, Wanchang; Bin Yang; Zhang, Junjie; Cui, Leilei; Ma, Junwu; Chen, Congying; Ai, Huashui; Xiao, Shijun; Ren, Jun; Huang, Lusheng

    2016-01-01

    Fatty acid composition profiles are important indicators of meat quality and tasting flavor. Metabolic indices of fatty acids are more authentic to reflect meat nutrition and public acceptance. To investigate the genetic mechanism of fatty acid metabolic indices in pork, we conducted genome-wide association studies (GWAS) for 33 fatty acid metabolic traits in five pig populations. We identified a total of 865 single nucleotide polymorphisms (SNPs), corresponding to 11 genome-wide significant loci on nine chromosomes and 12 suggestive loci on nine chromosomes. Our findings not only confirmed seven previously reported QTL with stronger association strength, but also revealed four novel population-specific loci, showing that investigations on intermediate phenotypes like the metabolic traits of fatty acids can increase the statistical power of GWAS for end-point phenotypes. We proposed a list of candidate genes at the identified loci, including three novel genes (FADS2, SREBF1 and PLA2G7). Further, we constructed the functional networks involving these candidate genes and deduced the potential fatty acid metabolic pathway. These findings advance our understanding of the genetic basis of fatty acid composition in pigs. The results from European hybrid commercial pigs can be immediately transited into breeding practice for beneficial fatty acid composition. PMID:27097669

  19. Complete Plastid Genome Sequencing of Trochodendraceae Reveals a Significant Expansion of the Inverted Repeat and Suggests a Paleogene Divergence between the Two Extant Species

    PubMed Central

    Sun, Yan-xia; Moore, Michael J.; Meng, Ai-ping; Soltis, Pamela S.; Soltis, Douglas E.; Li, Jian-qiang; Wang, Heng-chang

    2013-01-01

    The early-diverging eudicot order Trochodendrales contains only two monospecific genera, Tetracentron and Trochodendron. Although an extensive fossil record indicates that the clade is perhaps 100 million years old and was widespread throughout the Northern Hemisphere during the Paleogene and Neogene, the two extant genera are both narrowly distributed in eastern Asia. Recent phylogenetic analyses strongly support a clade of Trochodendrales, Buxales, and Gunneridae (core eudicots), but complete plastome analyses do not resolve the relationships among these groups with strong support. However, plastid phylogenomic analyses have not included data for Tetracentron. To better resolve basal eudicot relationships and to clarify when the two extant genera of Trochodendrales diverged, we sequenced the complete plastid genome of Tetracentron sinense using Illumina technology. The Tetracentron and Trochodendron plastomes possess the typical gene content and arrangement that characterize most angiosperm plastid genomes, but both genomes have the same unusual ∼4 kb expansion of the inverted repeat region to include five genes (rpl22, rps3, rpl16, rpl14, and rps8) that are normally found in the large single-copy region. Maximum likelihood analyses of an 83-gene, 88 taxon angiosperm data set yield an identical tree topology as previous plastid-based trees, and moderately support the sister relationship between Buxaceae and Gunneridae. Molecular dating analyses suggest that Tetracentron and Trochodendron diverged between 44-30 million years ago, which is congruent with the fossil record of Trochodendrales and with previous estimates of the divergence time of these two taxa. We also characterize 154 simple sequence repeat loci from the Tetracentron sinense and Trochodendron aralioides plastomes that will be useful in future studies of population genetic structure for these relict species, both of which are of conservation concern. PMID:23577110

  20. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  1. Amino acid sequence and comparative antigenicity of chicken metallothionein.

    PubMed Central

    McCormick, C C; Fullmer, C S; Garvey, J S

    1988-01-01

    The complete amino acid sequence of metallothionein (MT) from chicken liver is reported. The primary structure was determined by automated sequence analysis of peptides produced by limited acid hydrolysis and by trypsin digestion. The comparative antigenicity of chicken MT was determined by radioimmunoassay using rabbit anti-rat MT polyclonal antibody. Chicken MT consists of 63 amino acids as compared to 61 found in MTs from mammals. One insertion (and two substitutions) occurs in the amino-terminal region, a region considered invariant among mammalian MTs. Eighteen of the 20 cysteines in chicken MT were aligned with cysteines from other mammalian sequences. Two cysteines near the carboxyl terminus are shifted by one residue due to the insertion of proline in that region. Overall, the chicken protein showed approximately equal to 68% sequence identity in a comparison with various mammalian MTs. The affinity of the polyclonal antibody for chicken MT was decreased by 2 orders of magnitude in comparison to that of a mammalian MT (rat MT isoforms). This reduced affinity is attributed to major substitutions in chicken MT in the regions of the principal determinants of mammalian MTs. Theoretical analysis of the primary structure predicted the secondary structure to consist of reverse turns and random coils with no stable beta or helix conformations. There is no evidence that chicken MT differs functionally from mammalian MTs. PMID:2448773

  2. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  3. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  4. Middle-Upper Pleistocene climate changes shaped the divergence and demography of Cycas guizhouensis (Cycadaceae): Evidence from DNA sequences and microsatellite markers.

    PubMed

    Feng, Xiuyan; Zheng, Ying; Gong, Xun

    2016-06-07

    Climatic oscillations in the Pleistocene have had profound effects on the demography and genetic diversity of many extant species. Cycas guizhouensis Lan &R.F. Zou is an endemic and endangered species in Southwest China that is primarily distributed along the valleys of the Nanpan River. In this study, we used four chloroplast DNAs (cpDNA), three nuclear genes (nDNA) and 13 microsatellite (SSR) loci to investigate the genetic structure, divergence time and demographic history of 11 populations of C. guizhouensis. High genetic diversity and high levels of genetic differentiation among the populations were observed. Two evolutionary units were revealed based on network and Structure analysis. The divergence time estimations suggested that haplotypes of C. guizhouensis were diverged during the Middle-Upper Pleistocene. Additionally, the demographic histories deduced from different DNA sequences were discordant, but overall indicated that C. guizhouensis had experienced a recent population expansion during the post-glacial period. Microsatellite data revealed that there was a contraction in effective population size in the past. These genetic features allow conservation measures to be taken to ensure the protection of this endangered species from extinction.

  5. Middle-Upper Pleistocene climate changes shaped the divergence and demography of Cycas guizhouensis (Cycadaceae): Evidence from DNA sequences and microsatellite markers

    PubMed Central

    Feng, Xiuyan; Zheng, Ying; Gong, Xun

    2016-01-01

    Climatic oscillations in the Pleistocene have had profound effects on the demography and genetic diversity of many extant species. Cycas guizhouensis Lan & R.F. Zou is an endemic and endangered species in Southwest China that is primarily distributed along the valleys of the Nanpan River. In this study, we used four chloroplast DNAs (cpDNA), three nuclear genes (nDNA) and 13 microsatellite (SSR) loci to investigate the genetic structure, divergence time and demographic history of 11 populations of C. guizhouensis. High genetic diversity and high levels of genetic differentiation among the populations were observed. Two evolutionary units were revealed based on network and Structure analysis. The divergence time estimations suggested that haplotypes of C. guizhouensis were diverged during the Middle-Upper Pleistocene. Additionally, the demographic histories deduced from different DNA sequences were discordant, but overall indicated that C. guizhouensis had experienced a recent population expansion during the post-glacial period. Microsatellite data revealed that there was a contraction in effective population size in the past. These genetic features allow conservation measures to be taken to ensure the protection of this endangered species from extinction. PMID:27270859

  6. Sequencing and characterization of the complete mitochondrial genomes of three Pneumocystis species provide new insights into divergence between human and rodent Pneumocystis

    PubMed Central

    Ma, Liang; Huang, Da-Wei; Cuomo, Christina A.; Sykes, Sean; Fantoni, Giovanna; Das, Biswajit; Sherman, Brad T.; Yang, Jun; Huber, Charles; Xia, Yun; Davey, Emma; Kutty, Geetha; Bishop, Lisa; Sassi, Monica; Lempicki, Richard A.; Kovacs, Joseph A.

    2013-01-01

    Pneumocystis jirovecii is an important opportunistic pathogen associated with AIDS and other immunodeficient conditions. Currently, very little is known about its nuclear and mitochondrial genomes. In this study, we sequenced the complete mitochondrial genome (mtDNA) of this organism and its closely related species Pneumocystis carinii and Pneumocystis murina by a combination of sequencing technologies. Our study shows that P. carinii and P. murina mtDNA share a nearly identical number and order of genes in a linear configuration, whereas P. jirovecii has a circular mtDNA containing nearly the same set of genes but in a different order. Detailed studies of the mtDNA terminal structures of P. murina and P. carinii suggest a unique replication mechanism for linear mtDNA. Phylogenetic analysis supports a close association of Pneumocystis species with Taphrina, Saitoella, and Schizosaccharomyces, and divergence within Pneumocystis species, with P. murina and P. carinii being more closely related to each other than either is to P. jirovecii. Comparative analysis of four complete P. jirovecii mtDNA sequences in this study and previously reported mtDNA sequences for diagnosing and genotyping suggests that the current diagnostic and typing methods can be improved using the complete mtDNA data. The availability of the complete P. jirovecii mtDNA also opens the possibility of identifying new therapeutic targets.—Ma, L., Huang, D. W., Cuomo, C. A., Sykes, S., Fantoni, G., Das, B., Sherman, B. T., Yang, J., Huber, C., Xia, Y., Davey, E., Kutty, G., Bishop, L., Sassi, M., Lempicki, R. A., Kovacs, J. A. Sequencing and characterization of the complete mitochondrial genomes of three Pneumocystis species provide new insights into divergence between human and rodent Pneumocystis. PMID:23392351

  7. Structurally divergent lysophosphatidic acid acyltransferases with high selectivity for saturated medium chain fatty acids from Cuphea seeds.

    PubMed

    Kim, Hae Jin; Silva, Jillian E; Iskandarov, Umidjon; Andersson, Mariette; Cahoon, Rebecca E; Mockaitis, Keithanne; Cahoon, Edgar B

    2015-12-01

    Lysophosphatidic acid acyltransferase (LPAT) catalyzes acylation of the sn-2 position on lysophosphatidic acid by an acyl CoA substrate to produce the phosphatidic acid precursor of polar glycerolipids and triacylglycerols (TAGs). In the case of TAGs, this reaction is typically catalyzed by an LPAT2 from microsomal LPAT class A that has high specificity for C18 fatty acids containing Δ9 unsaturation. Because of this specificity, the occurrence of saturated fatty acids in the TAG sn-2 position is infrequent in seed oils. To identify LPATs with variant substrate specificities, deep transcriptomic mining was performed on seeds of two Cuphea species producing TAGs that are highly enriched in saturated C8 and C10 fatty acids. From these analyses, cDNAs for seven previously unreported LPATs were identified, including cDNAs from Cuphea viscosissima (CvLPAT2) and Cuphea avigera var. pulcherrima (CpuLPAT2a) encoding microsomal, seed-specific class A LPAT2s and a cDNA from C. avigera var. pulcherrima (CpuLPATB) encoding a microsomal, seed-specific LPAT from the bacterial-type class B. The activities of these enzymes were characterized in Camelina sativa by seed-specific co-expression with cDNAs for various Cuphea FatB acyl-acyl carrier protein thioesterases (FatB) that produce a variety of saturated medium-chain fatty acids. CvLPAT2 and CpuLPAT2a expression resulted in accumulation of 10:0 fatty acids in the Camelina sativa TAG sn-2 position, indicating a 10:0 CoA specificity that has not been previously described for plant LPATs. CpuLPATB expression generated TAGs with 14:0 at the sn-2 position, but not 10:0. Identification of these LPATs provides tools for understanding the structural basis of LPAT substrate specificity and for generating altered oil functionalities.

  8. The complementary deoxyribonucleic acid sequence of guinea pig endometrial prorelaxin.

    PubMed

    Lee, Y A; Bryant-Greenwood, G D; Mandel, M; Greenwood, F C

    1992-03-01

    The nucleotide sequence of the relaxin gene transcript in the endometrium of the late pregnant guinea pig has been determined. The strategy used was a combination of polymerase chain reaction (PCR) with primers designed from the mRNA sequence of porcine preprorelaxin, rapid amplification of cDNA ends-PCR, and blunt end cloning in M13 mp18. With heterologous primers, a 226-basepair (bp) segment of the guinea pig relaxin gene sequence was obtained and was used to design a guinea pig-specific primer for use with the rapid amplification of cDNA ends-PCR method. The latter allowed completion of the sequence of 336 bp, with a 96-bp overlap. The sequence obtained shows greater homology at both the nucleotide and amino acid levels with porcine and human relaxins H1 and H2 than with rat relaxin, supporting the thesis that the guinea pig is not a rodent. The transcription of the guinea pig endometrial relaxin gene during pregnancy was confirmed by Northern analysis of guinea pig endometrial tissues with a species-specific cDNA probe. The endometrial relaxin gene is transcribed during pregnancy, but not in lactation, consistent with the observed immunostaining for relaxin.

  9. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  10. Profiling serum bile acid glucuronides in humans: gender divergences, genetic determinants and response to fenofibrate

    PubMed Central

    Trottier, Jocelyn; Perreault, Martin; Rudkowska, Iwona; Levy, Cynthia; Dallaire-Theroux, Amélie; Verreault, Mélanie; Caron, Patrick; Staels, Bart; Vohl, Marie-Claude; Straka, Robert J.; Barbier, Olivier

    2014-01-01

    Glucuronidation, catalyzed by UDP-glucuronosyltransferase (UGT) enzymes detoxifies cholestatic bile acids (BAs). We aimed at i) characterizing the circulating BA-glucuronide (-G) pool composition in humans, ii) evaluating how sex and UGT polymorphisms influence this composition, and iii) analyzing the effects of lipid-lowering drug fenofibrate on the circulating BA-G profile in 300 volunteers and 5 cholestatic patients. Eleven BA-Gs were determined in pre- and post-fenofibrate samples. Men exhibited higher BA-G concentrations, and various genotype/BA-G associations were discovered in relevant UGT genes. The chenodeoxycholic acid-3G concentration was associated with the UGT2B7 802C>T polymorphism. Glucuronidation assays confirmed the predominant role of UGT2B7 and UGT1A4 in CDCA-3G formation. Fenofibrate exposure increased the serum levels of 5 BA-G species, including CDCA-3G, and up-regulated expression of UGT1A4, but not UGT2B7, in hepatic cells. This study demonstrates that fenofibrate stimulates BA glucuronidation in humans, and thus reduces bile acid toxicity in the liver. PMID:23756370

  11. Phylogeny and divergence times inferred from rps16 sequence data analyses for Tricyrtis (Liliaceae), an endemic genus of north-east Asia

    PubMed Central

    Hong, Sophia Wan-Pyo; Jury, Stephen L.

    2011-01-01

    Background and aims Tricyrtis is a genus of monocots with attractive and sophisticated flower shapes and colours, endemic to north-east Asia. There are 18 known species. The highly restricted geographical distribution of the genus is of great interest in terms of both abiotic (continental drift) and biotic (long-distance dispersal) impacts on monocot plant speciation events and their timing, and of evolutionary patterns of diversification leading to the extant taxa. The aims of this study were to (i) predict the time of speciation (divergence) events at infraspecific levels of Tricyrtis, (ii) estimate the rate of evolution of the genus and (iii) provide information on an excellent plant model system in terms of studying loss of biodiversity or extinction of organisms in the dynamic earth environment. Methodology To investigate the divergence time and evolution rate of Tricyrtis, Bayesian Markov chain Monte Carlo (MCMC) analyses were performed by calculating the mean branch lengths of evolutionary paths based on base substitution variations between rps16 intron nucleotide sequences from the 18 known species. Principal results Based upon the relaxed molecular clock model test data, a Bayesian phylogenetic inference tree is presented, and the divergence times and rate of evolution of Tricyrtis were estimated. The analyses also suggest that evolution is occurring at the infraspecific level of the genus in a manner that is not strictly clock bound. Conclusions Continental drift may have been the main speciation process giving rise to the current distribution of the taxa of Tricyrtis. The single-locus gene sequence data presented here are a significant step towards an improved future understanding of the molecular evolution of Tricyrtis via multi-locus evaluation. PMID:22476495

  12. Historical Selection, Amino Acid Polymorphism and Lineage-Specific Divergence at the G6pd Locus in Drosophila Melanogaster and D. Simulans

    PubMed Central

    Eanes, W. F.; Kirchner, M.; Yoon, J.; Biermann, C. H.; Wang, I. N.; McCartney, M. A.; Verrelli, B. C.

    1996-01-01

    The nucleotide diversity across 1705 bp of the G6pd gene is studied in 50 Drosophila melanogaster and 12 D. simulans lines. Our earlier report contrasted intraspecific polymorphism and interspecific differences at silent and replacement sites in these species. This report expands the number of European and African lines and examines the pattern of polymorphism with respect to the common A/B allozymes. In D. melanogaster the silent nucleotide diversity varies 2.8-fold across localities. The B allele sequences are two- to fourfold more variable than the derived A allele, and differences between allozymes are twice as among B alleles. There is strong linkage disequilibrium across the G6pd region. In both species the level of silent polymorphism increases from the 5' to 3' ends, while there is no comparable pattern in level of silent site divergence or fixation. The neutral model is not rejected in either species. Using D. yakuba as an outgroup, the D. melanogaster lineage shows a twofold greater rate of silent fixation, but less than half the rate of amino acid replacement. Lineage-specific differences in mutation fixation are inconsistent with neutral expectations and suggest the interaction of species-specific population size differences with both weakly advantageous and deleterious selection. PMID:8913747

  13. Genetic divergence analysis of the Common Barn Owl Tyto alba (Scopoli, 1769) and the Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile using COI sequence

    PubMed Central

    Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita

    2015-01-01

    Abstract In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy. PMID:26668551

  14. Genetic divergence analysis of the Common Barn Owl Tyto alba (Scopoli, 1769) and the Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile using COI sequence.

    PubMed

    Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita

    2015-01-01

    In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy.

  15. Nucleotide divergence vs. gene expression differentiation: comparative transcriptome sequencing in natural isolates from the carrion crow and its hybrid zone with the hooded crow.

    PubMed

    Wolf, Jochen B W; Bayer, Till; Haubold, Bernhard; Schilhabel, Markus; Rosenstiel, Philip; Tautz, Diethard

    2010-03-01

    Recent advances in sequencing technology promise to provide new strategies for studying population differentiation and speciation phenomena in their earliest phases. We focus here on the black carrion crow (Corvus [corone] corone), which forms a zone of hybridization and overlap with the grey coated hooded crow (Corvus [corone] cornix). However, although these semispecies are taxonomically distinct, previous analyses based on several types of genetic markers did not reveal significant molecular differentiation between them. We here corroborate this result with sequence data obtained from a set of 25 nuclear intronic loci. Thus, the system represents a case of a very early phase of species divergence that requires new molecular approaches for its description. We have therefore generated RNAseq expression profiles using barcoded massively parallel pyrosequencing of brain mRNA from six individuals of the carrion crow and five individuals from a hybrid zone with the hooded crow. We obtained 856 675 reads from two runs, with average read length of 270 nt and coverage of 8.44. Reads were assembled de novo into 19 552 contigs, 70% of which could be assigned to annotated genes in chicken and zebra finch. This resulted in a total of 7637 orthologous genes and a core set of 1301 genes that could be compared across all individuals. We find a clear clustering of expression profiles for the pure carrion crow animals and disperse profiles for the animals from the hybrid zone. These results suggest that gene expression differences may indeed be a sensitive indicator of initial species divergence.

  16. Refugial isolation and divergence in the Narrowheaded Gartersnake species complex (Thamnophis rufipunctatus) as revealed by multilocus DNA sequence data

    USGS Publications Warehouse

    Wood, Dustin A.; Vandergast, A.G.; Espinal, A. Lemos; Fisher, R.N.; Holycross, A.T.

    2011-01-01

    Glacial–interglacial cycles of the Pleistocene are hypothesized as one of the foremost contributors to biological diversification. This is especially true for cold-adapted montane species, where range shifts have had a pronounced effect on population-level divergence. Gartersnakes of the Thamnophis rufipunctatus species complex are restricted to cold headwater streams in the highlands of the Sierra Madre Occidental and southwestern USA. We used coalescent and multilocus phylogenetic approaches to test whether genetic diversification of this montane-restricted species complex is consistent with two prevailing models of range fluctuation for species affected by Pleistocene climate changes. Our concatenated nuDNA and multilocus species analyses recovered evidence for the persistence of multiple lineages that are restricted geographically, despite a mtDNA signature consistent with either more recent connectivity (and introgression) or recent expansion (and incomplete lineage sorting). Divergence times estimated using a relaxed molecular clock and fossil calibrations fall within the Late Pleistocene, and zero gene flow scenarios among current geographically isolated lineages could not be rejected. These results suggest that increased climate shifts in the Late Pleistocene have driven diversification and current range retraction patterns and that the differences between markers reflect the stochasticity of gene lineages (i.e. ancestral polymorphism) rather than gene flow and introgression. These results have important implications for the conservation of T. rufipunctatus (sensu novo), which is restricted to two drainage systems in the southwestern US and has undergone a recent and dramatic decline.

  17. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  18. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  19. Profiling serum bile acid glucuronides in humans: gender divergences, genetic determinants, and response to fenofibrate.

    PubMed

    Trottier, J; Perreault, M; Rudkowska, I; Levy, C; Dallaire-Theroux, A; Verreault, M; Caron, P; Staels, B; Vohl, M-C; Straka, R J; Barbier, O

    2013-10-01

    Glucuronidation, catalyzed by uridine 5'-diphospho-glucuronosyltransferase (UGT) enzymes, detoxifies cholestatic bile acids (BAs). We aimed to (i) characterize the circulating BA-glucuronide (BA-G) pool composition in humans, (ii) determine how sex and UGT polymorphisms influence this composition, and (iii) analyze the effects of the lipid-lowering drug fenofibrate on the circulating BA-G profile in 300 volunteers and 5 cholestatic patients. Eleven BA-Gs were determined in pre- and postfenofibrate samples. Men exhibited higher BA-G concentrations, and various genotype/BA-G associations were discovered in relevant UGT genes. The chenodeoxycholic acid-3G (CDCA-3G) concentration was associated with the UGT2B7 802C>T polymorphism. Glucuronidation assays confirmed the predominant role of UGT2B7 and UGT1A4 in CDCA-3G formation. Fenofibrate exposure increased the serum levels of five BA-G species, including CDCA-3G, and upregulated expression of UGT1A4, but not UGT2B7, in hepatic cells. This study demonstrated that fenofibrate stimulates BA glucuronidation in humans and thus reduces BA toxicity in the liver.

  20. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  1. The amino acid sequence of rabbit cardiac troponin I.

    PubMed Central

    Grand, R J; Wilkinson, J M

    1976-01-01

    The complete amino acid sequence of troponin I from rabbit cardiac muscle was determined by the isolation of four unique CNBr fragments, together with overlapping tryptic peptides containing radioactive methionine residues. Overlap data for residues 35-36, 93-94 and 140-145 are incomplete, the sequence at these positions being based on homology with the sequence of the fast-skeletal-muscle protein. Cardiac troponin I is a single polypeptide chain of 206 residues with mol.wt. 23550 and an extinction coefficient, E 1%,1cm/280, of 4.37. The protein has a net positive charge of 14 and is thus somewhat more basic than troponin I from fast-skeletal muscle. Comparison of the sequences of troponin I from cardiac and fast skeletal muscle show that the cardiac protein has 26 extra residues at the N-terminus which account for the larger size of the protein. In the remainder of sequence there is a considerable degree of homology, this being greater in the C-terminal two-thirds of the molecule. The region in the cardiac protein corresponding to the peptide with inhibitory activity from the fast-skeletal-muscle protein is very similar and it seems unlikely that this is the cause of the difference in inhibitory activity between the two proteins. The region responsible for binding troponin C, however, possesses a lower degree of homology. Detailed evidence on which the sequence is based has been deposited as Supplementary Publication SUP 50072 (20 pages), at the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7QB, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1976) 153, 5. PMID:1008822

  2. A case of orthologous sequences of hemocyanin subunits for an evolutionary study of horseshoe crabs: amino acid sequence comparison of immunologically identical subunits of Carcinoscorpius rotundicauda and Tachypleus tridentatus.

    PubMed

    Sugita, H; Shishikura, F

    1995-10-01

    About 83% of the amino acid sequence of hemocyanin subunit HR6 from the Southeast Asian horseshoe crab, Carcinoscorpius rotundicauda, has been determined. There is a difference of about 43% between HR6 and complete sequences of chelicerate hemocyanin subunits from the American horseshoe crab, Limulus polyphemus, and a tarantula, Eurypelma californicum. However, the immunologically identical subunits HR6 and HT6 from Tachypleus tridentatus (Japanese horseshoe crab) show 2.7% sequence difference. Based on the amino acid sequences of HR6 and HT6, the divergence between C. rotundicauda and T. tridentatus occurred about 9.6 million years ago. In the case of horseshoe crab hemocyanin subunits, it seems that the orthologous homologues in many homologous subunits between species are immunologically detectable.

  3. Amino acid sequence of a mouse immunoglobulin mu chain.

    PubMed Central

    Kehry, M; Sibley, C; Fuhrman, J; Schilling, J; Hood, L E

    1979-01-01

    The complete amino acid sequence of the mouse mu chain from the BALB/c myeloma tumor MOPC 104E is reported. The C mu region contains four consecutive homology regions of approximately 110 residues and a COOH-terminal region of 19 residues. A comparison of this mu chain from mouse with a complete mu sequence from human (Ou) and a partial mu chain sequence from dog (Moo) reveals a striking gradient of increasing homology from the NH2-terminal to the COOH-terminal portion of these mu chains, with the former being the least and the latter the most highly conserved. Four of the five sites of carbohydrate attachment appear to be at identical residue positions when the constant regions of the mouse and human mu chains are compared. The mu chain of MOPC 104E has a carbohydrate moiety attached in the second hypervariable region. This is particularly interesting in view of the fact that MOPC 104E binds alpha-(1 leads to 3)-dextran, a simple carbohydrate. The structural and functional constraints imposed by these comparative sequence analyses are discussed. PMID:111247

  4. First next-generation sequencing full-genome characterization of a hepatitis C virus genotype 7 divergent subtype.

    PubMed

    Salmona, M; Caporossi, A; Simmonds, P; Thélu, M-A; Fusillier, K; Mercier-Delarue, S; De Castro, N; LeGoff, J; Chaix, M-L; François, O; Simon, F; Morand, P; Larrat, S; Maylin, S

    2016-11-01

    We report the near-full-length genome sequence of a hepatitis C virus (HCV) isolate from a man originating from Democratic Republic of Congo, the genotype of which could not be determined by the routinely used sequencing technique. The near-complete genome sequence of this variant BAK1 was obtained by the association of two next-generation sequencing technologies. Evolutionary analysis indicates that this isolate, BAK1, could be the first reported strain belonging to a new HCV-7b subtype. This new subtype has been incorrectly identified as genotype 2 by the Versant HCV Genotype 2.0 assay (LiPA). The requirement of three independent isolates has been filled, and a new subtype can be assigned. More examples of HCV-7 are required to better understand its origin, its pathogenicity and its relationship with genotype 2.

  5. Identification of a Divergent Environmental DNA Sequence Clade Using the Phylogeny of Gregarine Parasites (Apicomplexa) from Crustacean Hosts

    PubMed Central

    Rueckert, Sonja; Simdyanov, Timur G.; Aleoshin, Vladimir V.; Leander, Brian S.

    2011-01-01

    Background Environmental SSU rDNA surveys have significantly improved our understanding of microeukaryotic diversity. Many of the sequences acquired using this approach are closely related to lineages previously characterized at both morphological and molecular levels, making interpretation of these data relatively straightforward. Some sequences, by contrast, appear to be phylogenetic orphans and are sometimes inferred to represent “novel lineages” of unknown cellular identity. Consequently, interpretation of environmental DNA surveys of cellular diversity rely on an adequately comprehensive database of DNA sequences derived from identified species. Several major taxa of microeukaryotes, however, are still very poorly represented in these databases, and this is especially true for diverse groups of single-celled parasites, such as gregarine apicomplexans. Methodology/Principal Findings This study attempts to address this paucity of DNA sequence data by characterizing four different gregarine species, isolated from the intestines of crustaceans, at both morphological and molecular levels: Thiriotia pugettiae sp. n. from the graceful kelp crab (Pugettia gracilis), Cephaloidophora cf. communis from two different species of barnacles (Balanus glandula and B. balanus), Heliospora cf. longissima from two different species of freshwater amphipods (Eulimnogammarus verrucosus and E. vittatus), and Heliospora caprellae comb. n. from a skeleton shrimp (Caprella alaskana). SSU rDNA sequences were acquired from isolates of these gregarine species and added to a global apicomplexan alignment containing all major groups of gregarines characterized so far. Molecular phylogenetic analyses of these data demonstrated that all of the gregarines collected from crustacean hosts formed a very strongly supported clade with 48 previously unidentified environmental DNA sequences. Conclusions/Significance This expanded molecular phylogenetic context enabled us to establish a major clade

  6. Sequence and ionomic analysis of divergent strains of maize inbred line B73 with an altered growth phenotype.

    PubMed

    Mascher, Martin; Gerlach, Nina; Gahrtz, Manfred; Bucher, Marcel; Scholz, Uwe; Dresselhaus, Thomas

    2014-01-01

    Maize (Zea mays) is the most widely grown crop species in the world and a classical model organism for plant research. The completion of a high-quality reference genome sequence and the advent of high-throughput sequencing have greatly empowered re-sequencing studies in maize. In this study, plants of maize inbred line B73 descended from two different sets of seed material grown for several generations either in the field or in the greenhouse were found to show a different growth phenotype and ionome under phosphate starvation conditions and moreover a different responsiveness towards mycorrhizal fungi of the species Glomus intraradices (syn: Rhizophagus irregularis). Whole genome re-sequencing of individuals from both sets and comparison to the B73 reference sequence revealed three cryptic introgressions on chromosomes 1, 5 and 10 in the line grown in the greenhouse summing up to a total of 5,257 single-nucleotide polymorphisms (SNPs). Transcriptome sequencing of three individuals from each set lent further support to the location of the introgression intervals and confirmed them to be fixed in all sequenced individuals. Moreover, we identified >120 genes differentially expressed between the two B73 lines. We thus have found a nearly-isogenic line (NIL) of maize inbred line B73 that is characterized by an altered growth phenotype under phosphate starvation conditions and an improved responsiveness towards symbiosis with mycorrhizal fungi. Through next-generation sequencing of the genomes and transcriptomes we were able to delineate exact introgression intervals. Putative de novo mutations appeared approximately uniformly distributed along the ten maize chromosomes mainly representing G:C -> A:T transitions. The plant material described in this study will be a valuable tool both for functional studies of genes differentially expressed in both B73 lines and for research on growth behavior especially in response to symbiosis between maize and mycorrhizal fungi.

  7. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  8. Purification of a marsupial insulin: amino-acid sequence of insulin from the eastern grey kangaroo Macropus giganteus.

    PubMed

    Treacy, G B; Shaw, D C; Griffiths, M E; Jeffrey, P D

    1989-03-24

    Insulin has been purified from kangaroo pancreas by acidic ethanol extraction, diethyl ether precipitation and gel filtration. The amino-acid sequence of this, the first marsupial insulin to be studied, is reported. It differs from human insulin by only four amino-acid substitutions, all in regions of the molecule previously known to be variable. However, it should be noted that one of these, asparagine for threonine at A8, has not been reported before. Computer comparisons of all 43 insulin sequences reported to date with kangaroo insulin show it to be most closely related to a group of mammalian insulins (dog, pig, cow, human) known to be of high biological potency. The measurement of blood glucose lowering in the rabbit by kangaroo insulin is consistent with this conclusion. Comparisons of amino-acid sequences of other proteins with their kangaroo counterparts show a greater difference, in line with the time of divergence of marsupials. The limited differences observed in insulin and cytochrome c suggest that their structures need to be closely conserved in order to maintain function.

  9. Molecular cloning and characterization of cDNAs coding for apo-polysialoglycoprotein of rainbow trout eggs. Multiple mRNA species transcribed from multiple genes contain diverged numbers of exact 39-base (13-amino acid) repeats.

    PubMed

    Sorimachi, H; Emori, Y; Kawasaki, H; Kitajima, K; Inoue, S; Suzuki, K; Inoue, Y

    1988-11-25

    Polysialoglycoprotein (PSGP) of unfertilized eggs of rainbow trout (Salmo gairdneri) consists of tandem repeats (about 25) of a glycotridecapeptide, Asp-Asp-Ala-Thr*-Ser*-Glu-Ala-Ala-Thr*-Gly-Pro-Ser-Gly (* denotes the attachment site of a polysialoglycan chain) (Kitajima, K., Inoue, Y., and Inoue, S. (1986) J. Biol. Chem. 261, 5262-5269). By using oligodeoxynucleotide probes based on the above sequence, we isolated a genomic clone for apoPSGP which contains 39-base pair repeats (5'-GACGACGCCACCTCTGAAGCT-GCGACCGGCCCGTCTGGC-3') encoding the tridecapeptide. Using a fragment of this genomic DNA as a probe, we next screened a cDNA library constructed with mRNA from immature ovaries of rainbow trout. Nucleotide sequencing analyses of cDNA clones thus obtained revealed that apoPSGP is encoded by multiple mRNA species consisting of diverged numbers (6-32) of the 39-base repeat encoding the tridecapeptide unit and homologous 5'- and 3'-bordering regions. The encoded protein consists of three distinct regions: the N-region consisting of a putative signal peptide and a pro-peptide, the R-region containing diverged numbers of the tandem repeat of 13-amino acid residues, and the C-region with six amino acid residues. Southern blot analysis showed that multiple mRNAs are transcribed from multiple genes for apoPSGP containing diverged numbers of the 39-base pair repeat. Thus, the genes for apoPSGP constitute a multigene family. Expression of the mRNAs is stage and organ specific, i.e. they are expressed only in immature ovaries and not in mature ovaries or in any other organ.

  10. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    PubMed Central

    Kasarda, D D; Okita, T W; Bernardin, J E; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with amino acid sequences derived from both cDNA clones and is virtually identical to one of them. On the basis of sequence information, after loss of the signal sequence, the mature alpha-type gliadins may be divided into five different domains, two of which may have evolved from an ancestral gliadin gene, whereas the remaining three contain repeating sequences that may have developed independently. Images PMID:6589619

  11. High sequence similarity within ras exons 1 and 2 in different mammalian species and phylogenetic divergence of the ras gene family.

    PubMed

    Watzinger, F; Mayr, B; Haring, E; Lion, T

    1998-03-01

    We have determined the canine and feline N-, K-, and H-ras gene sequences from position +23 to +270 covering exons I and II which contain the mutational hot spot codons 12, 13, and 61. The results were used to assess the degree of similarity between ras gene DNA regions containing the critical domains affected in neoplastic disorders in different mammalian species. The comparative analyses performed included human, canine, feline, murine, rattine, and, whenever possible, bovine, leporine (rabbit), porcelline (guinea pig), and mesocricetine (hamster) ras gene sequences within the region of interest. Comparison of feline and canine nucleotide sequences with the corresponding regions in human DNA revealed a sequence similarity greater than 85% to the human sequence. Contemporaneous analysis of previously published ras DNA sequences from other mammalian species showed a similar degree of homology to human DNA. Most nucleotide differences observed represented synonymous changes without effect on the amino acid sequence of the respective proteins. For assessment of the phylogenetic evolution of ras gene family, a maximum parsimony dendrogram based on multiple sequence alignment of the common region of exons I and II in the N-, K-, and H-ras genes was constructed. Interestingly, a higher substitution rate among the H-ras genes became apparent, indicating accelerated sequence evolution within this particular clade. The most parsimonious tree clearly shows that the duplications giving rise to the three ras genes must have occurred before the mammalian radiation.

  12. Genetic divergence of bradyrhizobium strains nodulating soybeans as revealed by multilocus sequence analysis of genes inside and outside the symbiosis island.

    PubMed

    Zhang, Xing Xing; Guo, Hui Juan; Wang, Rui; Sui, Xin Hua; Zhang, Yan Ming; Wang, En Tao; Tian, Chang Fu; Chen, Wen Xin

    2014-05-01

    The genus Bradyrhizobium has been considered to be a taxonomically difficult group. In this study, phylogenetics and evolutionary genetics analyses were used to investigate divergence levels among Bradyrhizobium strains nodulating soybeans in China. Eleven genospecies were identified by sequence analysis of three phylogenetic and taxonomic markers (SMc00019, thrA, and truA). This was also supported by analyses of eight genes outside the symbiosis island ("off-island" genes; SMc00019, thrA, truA, fabB, glyA, phyR, exoN, and hsfA). However, seven genes inside the symbiosis island ("island" genes; nifA, nifH, nodC, nodV, fixA, trpD, and rhcC2) showed contrasting lower levels of nucleotide diversity and recombination rates than did off-island genes. Island genes had significantly incongruent gene phylogenies compared to the species tree. Four phylogenetic clusters were observed in island genes, and the epidemic cluster IV (harbored by Bradyrhizobium japonicum, Bradyrhizobium diazoefficiens, Bradyrhizobium huanghuaihaiense, Bradyrhizobium liaoningense, Bradyrhizobium daqingense, Bradyrhizobium sp. I, Bradyrhizobium sp. III, and Bradyrhizobium sp. IV) was not found in Bradyrhizobium yuanmingense, Bradyrhizobium sp. II, or Bradyrhizobium elkanii. The gene flow level of island genes among genospecies is discussed in the context of the divergence level of off-island genes.

  13. Testing models of speciation from genome sequences: divergence and asymmetric admixture in Island South-East Asian Sus species during the Plio-Pleistocene climatic fluctuations

    PubMed Central

    Frantz, Laurent A F; Madsen, Ole; Megens, Hendrik-Jan; Groenen, Martien A M; Lohse, Konrad

    2014-01-01

    In many temperate regions, ice ages promoted range contractions into refugia resulting in divergence (and potentially speciation), while warmer periods led to range expansions and hybridization. However, the impact these climatic oscillations had in many parts of the tropics remains elusive. Here, we investigate this issue using genome sequences of three pig (Sus) species, two of which are found on islands of the Sunda-shelf shallow seas in Island South-East Asia (ISEA). A previous study revealed signatures of interspecific admixture between these Sus species (Genome biology,14, 2013, R107). However, the timing, directionality and extent of this admixture remain unknown. Here, we use a likelihood-based model comparison to more finely resolve this admixture history and test whether it was mediated by humans or occurred naturally. Our analyses suggest that interspecific admixture between Sunda-shelf species was most likely asymmetric and occurred long before the arrival of humans in the region. More precisely, we show that these species diverged during the late Pliocene but around 23% of their genomes have been affected by admixture during the later Pleistocene climatic transition. In addition, we show that our method provides a significant improvement over D-statistics which are uninformative about the direction of admixture. PMID:25294645

  14. Assessing genetic divergence in interspecific hybrids of Aechmea gomosepala and A. recurvata var. recurvata using inflorescence characteristics and sequence-related amplified polymorphism markers.

    PubMed

    Zhang, F; Ge, Y Y; Wang, W Y; Shen, X L; Yu, X Y

    2012-12-03

    Conventional hybridization and selection techniques have aided the development of new ornamental crop cultivars. However, little information is available on the genetic divergence of bromeliad hybrids. In the present study, we investigated the genetic variability in interspecific hybrids of Aechmea gomosepala and A. recurvata var. recurvata using inflorescence characteristics and sequence-related amplified polymorphism (SRAP) markers. The morphological analysis showed that the putative hybrids were intermediate between both parental species with respect to inflorescence characteristics. The 16 SRAP primer combinations yield 265 bands, among which 154 (57.72%) were polymorphic. The genetic similarity was an average of 0.59 and ranged from 0.21 to 0.87, indicating moderate genetic divergence among the hybrids. The unweighted pair group method with arithmetic average (UPGMA)-based cluster analysis distinguished the hybrids from their parents with a genetic distance coefficient of 0.54. The cophenetic correlation was 0.93, indicating a good fit between the dendrogram and the original distance matrix. The two-dimensional plot from the principal coordinate analysis showed that the hybrids were intermediately dispersed between both parents, corresponding to the results of the UPGMA cluster and the morphological analysis. These results suggest that SRAP markers could help to identify breeders, characterize F(1) hybrids of bromeliads at an early stage, and expedite genetic improvement of bromeliad cultivars.

  15. Structural gene and complete amino acid sequence of Vibrio alginolyticus collagenase.

    PubMed Central

    Takeuchi, H; Shibano, Y; Morihara, K; Fukushima, J; Inami, S; Keil, B; Gilles, A M; Kawamoto, S; Okuda, K

    1992-01-01

    The DNA encoding the collagenase of Vibrio alginolyticus was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited both collagenase antigen and collagenase activity. The open reading frame from the ATG initiation codon was 2442 bp in length for the collagenase structural gene. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature collagenase consists of 739 amino acids with an Mr of 81875. The amino acid sequences of 20 polypeptide fragments were completely identical with the deduced amino acid sequences of the collagenase gene. The amino acid composition predicted from the DNA sequence was similar to the chemically determined composition of purified collagenase reported previously. The analyses of both the DNA and amino acid sequences of the collagenase gene were rigorously performed, but we could not detect any significant sequence similarity to other collagenases. Images Fig. 2. PMID:1311172

  16. Sequence of cDNAs for mammalian H2A.Z, an evolutionarily diverged but highly conserved basal histone H2A isoprotein species.

    PubMed Central

    Hatch, C L; Bonner, W M

    1988-01-01

    The nucleotide sequences of cDNAs for the evolutionarily diverged but highly conserved basal H2A isoprotein, H2A.Z, have been determined for the rat, cow, and human. As a basal histone, H2A.Z is synthesized throughout the cell cycle at a constant rate, unlinked to DNA replication, and at a much lower rate in quiescent cells. Each of the cDNA isolates encodes the entire H2A.Z polypeptide. The human isolate is about 1.0 kilobases long. It contains a coding region of 387 nucleotides flanked by 106 nucleotides of 5'UTR and 376 nucleotides of 3'UTR, which contains a polyadenylation signal followed by a poly A tail. The bovine and rat cDNAs have 97 and 94% nucleotide positional identity to the human cDNA in the coding region and 98% in the proximal 376 nucleotides of the 3'UTR which includes the polyadenylation signal. A potential stem-forming sequence imbedded in a direct repeat is found centered at 261 nucleotides into the 3'UTR. Each of the cDNA clones could be transcribed and translated in vitro to yield H2A.Z protein. The mammalian H2A.Z cDNA coding sequences are approximately 80% similar to those in chicken and 75% to those in sea urchin. PMID:3344202

  17. cis-acting sequences required for expression of the divergently transcribed Drosophila melanogaster Sgs-7 and Sgs-8 glue protein genes.

    PubMed Central

    Hofmann, A; Garfinkel, M D; Meyerowitz, E M

    1991-01-01

    The Sgs-7 and Sgs-8 glue genes at 68C are divergently transcribed and are separated by 475 bp. Fusion genes with Adh or lacZ coding sequences were constructed, and the expression of these genes, with different amounts of upstream sequences present, was tested by a transient expression procedure and by germ line transformation. A cis-acting element for both genes is located asymmetrically in the intergenic region between -211 and -43 bp relative to Sgs-7. It is required for correct expression of both genes. This element can confer the stage- and tissue-specific expression pattern of glue genes on a heterologous promoter. An 86-bp portion of the element, from -133 to -48 bp relative to Sgs-7, is shown to be capable of enhancing the expression of a truncated and therefore weakly expressed Sgs-3 fusion gene. Recently described common sequence motifs of glue gene regulatory elements (T. Todo, M. Roark, K. Vijay Raghavan, C. A. Mayeda, and E.M. Meyerowitz, Mol. Cell. Biol. 10:5991-6002, 1990) are located within this 86-bp region. PMID:1903838

  18. Sequence of the canine herpesvirus thymidine kinase gene: taxon-preferred amino acid residues in the alphaherpesviral thymidine kinases.

    PubMed

    Rémond, M; Sheldrick, P; Lebreton, F; Foulon, T

    1995-12-01

    Multiple sequence alignments of evolutionarily related proteins are finding increasing use as indicators of critical amino acid residues necessary for structural stability or involved in functional domains responsible for catalytic activities. In the past, a number of alignments have provided such information for the herpesviral thymidine kinases, for which three-dimensional structures are not yet available. We have sequenced the thymidine kinase gene of a canine herpesvirus, and with a multiple alignment have identified amino acids preferentially conserved in either of two taxons, the genera Varicellovirus and Simplexvirus, of the subfamily Alphaherpesvirinae. Since some regions of the thymidine kinases show otherwise elevated levels of substitutional tolerance, these conserved amino acids are candidates for critical residues which have become fixed through selection during the evolutionary divergence of these enzymes. Several pairs with distinctive patterns of distribution among the various viruses occur in or near highly conserved sequence motifs previously proposed to form the catalytic site, and we speculate that they may represent interacting, co-ordinately variable residues.

  19. Sequencing of whole plastid genomes and nuclear ribosomal DNA of Diospyros species (Ebenaceae) endemic to New Caledonia: many species, little divergence

    PubMed Central

    Turner, Barbara; Paun, Ovidiu; Munzinger, Jérôme; Chase, Mark W.; Samuel, Rosabelle

    2016-01-01

    Background and Aims Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely related Diospyros species. Methods Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices. Key Results The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species. Conclusions In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared with

  20. cDNA sequence of a human skeletal muscle ADP/ATP translocator: lack of a leader peptide, divergence from a fibroblast translocator cDNA, and coevolution with mitochondrial DNA genes

    SciTech Connect

    Neckelmann, N.; Li, K.; Wade, R.P.; Shuster, R.; Wallace, D.C.

    1987-11-01

    The authors have characterized a 1400-nucleotide cDNA for the human skeletal muscle ADP/ATP translocator. The deduced amino acid sequence is 94% homologous to the beef heart ADP/ATP translocator protein and contains only a single additional amino-terminal methionine. This implies that the human translocator lacks an amino-terminal targeting peptide, a conclusion substantiated by measuring the molecular weight of the protein synthesized in vitro. A 1400-nucleotide transcript encoding the skeletal muscle translocator was detected on blots of total RNA from human heart, kidney, skeletal muscle, and HeLa cells by hybridization with oligonucleotide probes homologous to the coding region and 3' noncoding region of the cDNA. However, the level of this mRNA varied substantially among tissues. Comparison of our skeletal muscle translocator sequence with that of a recently published human fibroblast translocator cognate revealed that the two proteins are 88% identical and diverged about 275 million years ago. Hence, tissues vary both in the level of expression of individual translocator genes and in differential expression of cognate translocator genes. Comparison of the base substitution rates of the ADP/ATP translocator and the oxidative phosphorylation genes encoded by mitochondrial DNA revealed that the mitochondrial DNA genes fix 10 times more synonymous substitutions and 12 times more replacement substitutions; yet, these nuclear and cytoplasmic respiration genes experience comparable evolutionary constraints. This suggest that the mitochondrial DNA genes are highly prone to deleterious mutations.

  1. Structural Variant Detection by Large-scale Sequencing Reveals New Evolutionary Evidence on Breed Divergence between Chinese and European Pigs

    PubMed Central

    Zhao, Pengju; Li, Junhui; Kang, Huimin; Wang, Haifei; Fan, Ziyao; Yin, Zongjun; Wang, Jiafu; Zhang, Qin; Wang, Zhiquan; Liu, Jian-Feng

    2016-01-01

    In this study, we performed a genome-wide SV detection among the genomes of thirteen pigs from diverse Chinese and European originated breeds by next genetation sequencing, and constrcuted a single-nucleotide resolution map involving 56,930 putative SVs. We firstly identified a SV hotspot spanning 35 Mb region on the X chromosome specifically in the genomes of Chinese originated individuals. Further scrutinizing this region by large-scale sequencing data of extra 111 individuals, we obtained the confirmatory evidence on our initial finding. Moreover, thirty five SV-related genes within the hotspot region, being of importance for reproduction ability, rendered significant different evolution rates between Chinese and European originated breeds. The SV hotspot identified herein offers a novel evidence for assessing phylogenetic relationships, as well as likely explains the genetic difference of corresponding phenotypes and features, among Chinese and European pig breeds. Furthermore, we employed various SVs to infer genetic structure of individuls surveyed. We found SVs can clearly detect the difference of genetic background among individuals. This clues us that genome-wide SVs can capture majority of geneic variation and be applied into cladistic analyses. Characterizing whole genome SVs demonstrated that SVs are significantly enriched/depleted with various genomic features. PMID:26729041

  2. Hydrogen Exchange Mass Spectrometry of Related Proteins with Divergent Sequences: A Comparative Study of HIV-1 Nef Allelic Variants

    NASA Astrophysics Data System (ADS)

    Wales, Thomas E.; Poe, Jerrod A.; Emert-Sedlak, Lori; Morgan, Christopher R.; Smithgall, Thomas E.; Engen, John R.

    2016-06-01

    Hydrogen exchange mass spectrometry can be used to compare the conformation and dynamics of proteins that are similar in tertiary structure. If relative deuterium levels are measured, differences in sequence, deuterium forward- and back-exchange, peptide retention time, and protease digestion patterns all complicate the data analysis. We illustrate what can be learned from such data sets by analyzing five variants (Consensus G2E, SF2, NL4-3, ELI, and LTNP4) of the HIV-1 Nef protein, both alone and when bound to the human Hck SH3 domain. Regions with similar sequence could be compared between variants. Although much of the hydrogen exchange features were preserved across the five proteins, the kinetics of Nef binding to Hck SH3 were not the same. These observations may be related to biological function, particularly for ELI Nef where we also observed an impaired ability to downregulate CD4 surface presentation. The data illustrate some of the caveats that must be considered for comparison experiments and provide a framework for investigations of other protein relatives, families, and superfamilies with HX MS.

  3. Nucleic acid (cDNA) and amino acid sequences of the maize endosperm protein glutelin-2.

    PubMed Central

    Prat, S; Cortadas, J; Puigdomènech, P; Palau, J

    1985-01-01

    The cDNA coding for a glutelin-2 protein from maize endosperm has been cloned and the complete amino acid sequence of the protein derived for the first time. An immature maize endosperm cDNA bank was screened for the expression of a beta-lactamase:glutelin-2 (G2) fusion polypeptide by using antibodies against the purified 28 kd G2 protein. A clone corresponding to the 28 kd G2 protein was sequenced and the primary structure of this protein was derived. Five regions can be defined in the protein sequence: an 11 residue N-terminal part, a repeated region formed by eight units of the sequence Pro-Pro-Pro-Val-His-Leu, an alternating Pro-X stretch 21 residues long, a Cys rich domain and a C-terminal part rich in Gln. The protein sequence is preceded by 19 residues which have the characteristics of the signal peptide found in secreted proteins. Unlike zeins, the main maize storage proteins, 28 kd glutelin-2 has several homologous sequences in common with other cereal storage proteins. Images PMID:3839076

  4. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  5. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  6. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  7. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  8. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  9. Human liver apolipoprotein B-100 cDNA: complete nucleic acid and derived amino acid sequence.

    PubMed Central

    Law, S W; Grant, S M; Higuchi, K; Hospattankar, A; Lackner, K; Lee, N; Brewer, H B

    1986-01-01

    Human apolipoprotein B-100 (apoB-100), the ligand on low density lipoproteins that interacts with the low density lipoprotein receptor and initiates receptor-mediated endocytosis and low density lipoprotein catabolism, has been cloned, and the complete nucleic acid and derived amino acid sequences have been determined. ApoB-100 cDNAs were isolated from normal human liver cDNA libraries utilizing immunoscreening as well as filter hybridization with radiolabeled apoB-100 oligodeoxynucleotides. The apoB-100 mRNA is 14.1 kilobases long encoding a mature apoB-100 protein of 4536 amino acids with a calculated amino acid molecular weight of 512,723. ApoB-100 contains 20 potential glycosylation sites, and 12 of a total of 25 cysteine residues are located in the amino-terminal region of the apolipoprotein providing a potential globular structure of the amino terminus of the protein. ApoB-100 contains relatively few regions of amphipathic helices, but compared to other human apolipoproteins it is enriched in beta-structure. The delineation of the entire human apoB-100 sequence will now permit a detailed analysis of the conformation of the protein, the low density lipoprotein receptor binding domain(s), and the structural relationship between apoB-100 and apoB-48 and will provide the basis for the study of genetic defects in apoB-100 in patients with dyslipoproteinemias. PMID:3464946

  10. Computer selection of oligonucleotide probes from amino acid sequences for use in gene library screening.

    PubMed

    Yang, J H; Ye, J H; Wallace, D C

    1984-01-11

    We present a computer program, FINPROBE, which utilizes known amino acid sequence data to deduce minimum redundancy oligonucleotide probes for use in screening cDNA or genomic libraries or in primer extension. The user enters the amino acid sequence of interest, the desired probe length, the number of probes sought, and the constraints on oligonucleotide synthesis. The computer generates a table of possible probes listed in increasing order of redundancy and provides the location of each probe in the protein and mRNA coding sequence. Activation of a next function provides the amino acid and mRNA sequences of each probe of interest as well as the complementary sequence and the minimum dissociation temperature of the probe. A final routine prints out the amino acid sequence of the protein in parallel with the mRNA sequence listing all possible codons for each amino acid.

  11. Resemblance and Dissemblance of Arabidopsis Type II Peroxiredoxins: Similar Sequences for Divergent Gene Expression, Protein Localization, and Activity1

    PubMed Central

    Bréhélin, Claire; Meyer, Etienne H.; de Souris, Jean-Paul; Bonnard, Géraldine; Meyer, Yves

    2003-01-01

    The Arabidopsis type II peroxiredoxin (PRXII) family is composed of six different genes, five of which are expressed. On the basis of the nucleotide and protein sequences, we were able to define three subgroups among the PRXII family. The first subgroup is composed of AtPRXII-B, -C, and -D, which are highly similar and localized in the cytosol. AtPRXII-B is ubiquitously expressed. More striking is the specific expression of AtPRXII-C and AtPRXII-D localized in pollen. The second subgroup comprises the mitochondrial AtPRXII-F, the corresponding gene of which is expressed constitutively. We show that AtPRXII-E, belonging to the last subgroup, is expressed mostly in reproductive tissues and that its product is addressed to the plastid. By in vitro enzymatic experiments, we demonstrate that glutaredoxin is the electron donor of recombinant AtPRXII-B for peroxidase reaction, but the donors of AtPRXII-E and AtPRXII-F have still to be identified. PMID:12913160

  12. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  13. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  14. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  15. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  16. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  17. Determining divergence times with a protein clock: update and reevaluation

    NASA Technical Reports Server (NTRS)

    Feng, D. F.; Cho, G.; Doolittle, R. F.; Bada, J. L. (Principal Investigator)

    1997-01-01

    A recent study of the divergence times of the major groups of organisms as gauged by amino acid sequence comparison has been expanded and the data have been reanalyzed with a distance measure that corrects for both constraints on amino acid interchange and variation in substitution rate at different sites. Beyond that, the availability of complete genome sequences for several eubacteria and an archaebacterium has had a great impact on the interpretation of certain aspects of the data. Thus, the majority of the archaebacterial sequences are not consistent with currently accepted views of the Tree of Life which cluster the archaebacteria with eukaryotes. Instead, they are either outliers or mixed in with eubacterial orthologs. The simplest resolution of the problem is to postulate that many of these sequences were carried into eukaryotes by early eubacterial endosymbionts about 2 billion years ago, only very shortly after or even coincident with the divergence of eukaryotes and archaebacteria. The strong resemblances of these same enzymes among the major eubacterial groups suggest that the cyanobacteria and Gram-positive and Gram-negative eubacteria also diverged at about this same time, whereas the much greater differences between archaebacterial and eubacterial sequences indicate these two groups may have diverged between 3 and 4 billion years ago.

  18. Sequencing of Pax6 Loci from the Elephant Shark Reveals a Family of Pax6 Genes in Vertebrate Genomes, Forged by Ancient Duplications and Divergences

    PubMed Central

    Gautier, Philippe; Loosli, Felix; Tay, Boon-Hui; Tay, Alice; Murdoch, Emma; Coutinho, Pedro; van Heyningen, Veronica; Brenner, Sydney; Venkatesh, Byrappa; Kleinjan, Dirk A.

    2013-01-01

    Pax6 is a developmental control gene essential for eye development throughout the animal kingdom. In addition, Pax6 plays key roles in other parts of the CNS, olfactory system, and pancreas. In mammals a single Pax6 gene encoding multiple isoforms delivers these pleiotropic functions. Here we provide evidence that the genomes of many other vertebrate species contain multiple Pax6 loci. We sequenced Pax6-containing BACs from the cartilaginous elephant shark (Callorhinchus milii) and found two distinct Pax6 loci. Pax6.1 is highly similar to mammalian Pax6, while Pax6.2 encodes a paired-less Pax6. Using synteny relationships, we identify homologs of this novel paired-less Pax6.2 gene in lizard and in frog, as well as in zebrafish and in other teleosts. In zebrafish two full-length Pax6 duplicates were known previously, originating from the fish-specific genome duplication (FSGD) and expressed in divergent patterns due to paralog-specific loss of cis-elements. We show that teleosts other than zebrafish also maintain duplicate full-length Pax6 loci, but differences in gene and regulatory domain structure suggest that these Pax6 paralogs originate from a more ancient duplication event and are hence renamed as Pax6.3. Sequence comparisons between mammalian and elephant shark Pax6.1 loci highlight the presence of short- and long-range conserved noncoding elements (CNEs). Functional analysis demonstrates the ancient role of long-range enhancers for Pax6 transcription. We show that the paired-less Pax6.2 ortholog in zebrafish is expressed specifically in the developing retina. Transgenic analysis of elephant shark and zebrafish Pax6.2 CNEs with homology to the mouse NRE/Pα internal promoter revealed highly specific retinal expression. Finally, morpholino depletion of zebrafish Pax6.2 resulted in a “small eye” phenotype, supporting a role in retinal development. In summary, our study reveals that the pleiotropic functions of Pax6 in vertebrates are served by a divergent

  19. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  20. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  1. Human retroviruses and aids, 1992. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Korber, B.; Berzofsky, J.A.; Pavlakis, G.N.; Smith, R.F.

    1992-10-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) HIV and SIV Nucleotide Sequences; (H) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions below of the parts of the compendium, the user should read the individual introductions for each part.

  2. Divergent evolution and molecular adaptation in the Drosophila odorant-binding protein family: inferences from sequence variation at the OS-E and OS-F genes

    PubMed Central

    2008-01-01

    Background The Drosophila Odorant-Binding Protein (Obp) genes constitute a multigene family with moderate gene number variation across species. The OS-E and OS-F genes are the two phylogenetically closest members of this family in the D. melanogaster genome. In this species, these genes are arranged in the same genomic cluster and likely arose by tandem gene duplication, the major mechanism proposed for the origin of new members in this olfactory-system family. Results We have analyzed the genomic cluster encompassing OS-E and OS-F genes (Obp83 genomic region) to determine the role of the functional divergence and molecular adaptation on the Obp family size evolution. We compared nucleotide and amino acid variation across 18 Drosophila and 4 mosquito species applying a phylogenetic-based maximum likelihood approach complemented with information of the OBP three-dimensional structure and function. We show that, in spite the OS-E and OS-F genes are currently subject to similar and strong selective constraints, they likely underwent divergent evolution. Positive selection was likely involved in the functional diversification of new copies in the early stages after the gene duplication event; moreover, it might have shaped nucleotide variation of the OS-E gene concomitantly with the loss of functionally related members. Besides, molecular adaptation likely affecting the functional OBP conformational changes was supported by the analysis of the evolution of physicochemical properties of the OS-E protein and the location of the putative positive selected amino acids on the OBP three-dimensional structure. Conclusion Our results support that positive selection was likely involved in the functional differentiation of new copies of the OBP multigene family in the early stages after their birth by gene duplication; likewise, it might shape variation of some members of the family concomitantly with the loss of functionally related genes. Thus, the stochastic gene gain

  3. Switchable stereocontrolled divergent synthesis induced by aza-Michael addition of deactivated primary amines under acid catalysis.

    PubMed

    Amara, Z; Drège, E; Troufflard, C; Retailleau, P; Tran Huu-Dau, M-E; Joseph, D

    2014-11-24

    Switchable tandem intramolecular aza-Michael/Michael and double aza-Michael reactions allow the oriented synthesis of highly functionalised cyclic skeletons. Conjugate addition of deactivated anilines triggers chemo- and stereo-divergent ring-closure reaction pathways with a striking selectivity depending on reaction conditions.

  4. Phylogenetic relationships in the mushroom genus Coprinus and dark-spored allies based on sequence data from the nuclear gene coding for the large ribosomal subunit RNA: divergent domains, outgroups, and monophyly.

    PubMed

    Hopple, J S; Vilgalys, R

    1999-10-01

    Phylogenetic relationships were investigated in the mushroom genus Coprinus based on sequence data from the nuclear encoded large-subunit rDNA gene. Forty-seven species of Coprinus and 19 additional species from the families Coprinaceae, Strophariaceae, Bolbitiaceae, Agaricaceae, Podaxaceae, and Montagneaceae were studied. A total of 1360 sites was sequenced across seven divergent domains and intervening sequences. A total of 302 phylogenetically informative characters was found. Ninety-eight percent of the average divergence between taxa was located within the divergent domains, with domains D2 and D8 being most divergent and domains D7 and D10 the least divergent. An empirical test of phylogenetic signal among divergent domains also showed that domains D2 and D3 had the lowest levels of homoplasy. Two equally most parsimonious trees were resolved using Wagner parsimony. A character-state weighted analysis produced 12 equally most parsimonious trees similar to those generated by Wagner parsimony. Phylogenetic analyses employing topological constraints suggest that none of the major taxonomic systems proposed for subgeneric classification is able to completely reflect phylogenetic relationships in Coprinus. A strict consensus integration of the two Wagner trees demonstrates the problematic nature of choosing outgroups within dark-spored mushrooms. The genus Coprinus is found to be polyphyletic and is separated into three distinct clades. Most Coprinus taxa belong to the first two clades, which together form a larger monophyletic group with Lacrymaria and Psathyrella in basal positions. A third clade contains members of Coprinus section Comati as well as the genus Leucocoprinus, Podaxis pistillaris, Montagnea arenaria, and Agaricus pocillator. This third clade is separated from the other species of Coprinus by members of the families Strophariaceae and Bolbitiaceae and the genus Panaeolus.

  5. Genetic characterization of Betacoronavirus lineage C viruses in bats reveals marked sequence divergence in the spike protein of pipistrellus bat coronavirus HKU5 in Japanese pipistrelle: implications for the origin of the novel Middle East respiratory syndrome coronavirus.

    PubMed

    Lau, Susanna K P; Li, Kenneth S M; Tsang, Alan K L; Lam, Carol S F; Ahmed, Shakeel; Chen, Honglin; Chan, Kwok-Hung; Woo, Patrick C Y; Yuen, Kwok-Yung

    2013-08-01

    While the novel Middle East respiratory syndrome coronavirus (MERS-CoV) is closely related to Tylonycteris bat CoV HKU4 (Ty-BatCoV HKU4) and Pipistrellus bat CoV HKU5 (Pi-BatCoV HKU5) in bats from Hong Kong, and other potential lineage C betacoronaviruses in bats from Africa, Europe, and America, its animal origin remains obscure. To better understand the role of bats in its origin, we examined the molecular epidemiology and evolution of lineage C betacoronaviruses among bats. Ty-BatCoV HKU4 and Pi-BatCoV HKU5 were detected in 29% and 25% of alimentary samples from lesser bamboo bat (Tylonycteris pachypus) and Japanese pipistrelle (Pipistrellus abramus), respectively. Sequencing of their RNA polymerase (RdRp), spike (S), and nucleocapsid (N) genes revealed that MERS-CoV is more closely related to Pi-BatCoV HKU5 in RdRp (92.1% to 92.3% amino acid [aa] identity) but is more closely related to Ty-BatCoV HKU4 in S (66.8% to 67.4% aa identity) and N (71.9% to 72.3% aa identity). Although both viruses were under purifying selection, the S of Pi-BatCoV HKU5 displayed marked sequence polymorphisms and more positively selected sites than that of Ty-BatCoV HKU4, suggesting that Pi-BatCoV HKU5 may generate variants to occupy new ecological niches along with its host in diverse habitats. Molecular clock analysis showed that they diverged from a common ancestor with MERS-CoV at least several centuries ago. Although MERS-CoV may have diverged from potential lineage C betacoronaviruses in European bats more recently, these bat viruses were unlikely to be the direct ancestor of MERS-CoV. Intensive surveillance for lineage C betaCoVs in Pipistrellus and related bats with diverse habitats and other animals in the Middle East may fill the evolutionary gap.

  6. Genetic Characterization of Betacoronavirus Lineage C Viruses in Bats Reveals Marked Sequence Divergence in the Spike Protein of Pipistrellus Bat Coronavirus HKU5 in Japanese Pipistrelle: Implications for the Origin of the Novel Middle East Respiratory Syndrome Coronavirus

    PubMed Central

    Lau, Susanna K. P.; Li, Kenneth S. M.; Tsang, Alan K. L.; Lam, Carol S. F.; Ahmed, Shakeel; Chen, Honglin; Chan, Kwok-Hung

    2013-01-01

    While the novel Middle East respiratory syndrome coronavirus (MERS-CoV) is closely related to Tylonycteris bat CoV HKU4 (Ty-BatCoV HKU4) and Pipistrellus bat CoV HKU5 (Pi-BatCoV HKU5) in bats from Hong Kong, and other potential lineage C betacoronaviruses in bats from Africa, Europe, and America, its animal origin remains obscure. To better understand the role of bats in its origin, we examined the molecular epidemiology and evolution of lineage C betacoronaviruses among bats. Ty-BatCoV HKU4 and Pi-BatCoV HKU5 were detected in 29% and 25% of alimentary samples from lesser bamboo bat (Tylonycteris pachypus) and Japanese pipistrelle (Pipistrellus abramus), respectively. Sequencing of their RNA polymerase (RdRp), spike (S), and nucleocapsid (N) genes revealed that MERS-CoV is more closely related to Pi-BatCoV HKU5 in RdRp (92.1% to 92.3% amino acid [aa] identity) but is more closely related to Ty-BatCoV HKU4 in S (66.8% to 67.4% aa identity) and N (71.9% to 72.3% aa identity). Although both viruses were under purifying selection, the S of Pi-BatCoV HKU5 displayed marked sequence polymorphisms and more positively selected sites than that of Ty-BatCoV HKU4, suggesting that Pi-BatCoV HKU5 may generate variants to occupy new ecological niches along with its host in diverse habitats. Molecular clock analysis showed that they diverged from a common ancestor with MERS-CoV at least several centuries ago. Although MERS-CoV may have diverged from potential lineage C betacoronaviruses in European bats more recently, these bat viruses were unlikely to be the direct ancestor of MERS-CoV. Intensive surveillance for lineage C betaCoVs in Pipistrellus and related bats with diverse habitats and other animals in the Middle East may fill the evolutionary gap. PMID:23720729

  7. Completion of the amino acid sequence of the alpha 1 chain from type I calf skin collagen. Amino acid sequence of alpha 1(I)B8.

    PubMed Central

    Glanville, R W; Breitkreutz, D; Meitinger, M; Fietzek, P P

    1983-01-01

    The complete amino acid sequence of the 279-residue CNBr peptide CB8 from the alpha 1 chain of type I calf skin collagen is presented. It was determined by sequencing overlapping fragments of CB8 produced by Staphylococcus aureus V8 proteinase, trypsin, Endoproteinase Arg-C and hydroxylamine. Tryptic cleavages were also made specific for lysine by blocking arginine residues with cyclohexane-1,2-dione. This completes the amino acid sequence analysis of the 1054-residues-long alpha (I) chain of calf skin collagen. PMID:6354180

  8. An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data.

    PubMed Central

    Adzhubei, I A; Adzhubei, A A; Neidle, S

    1998-01-01

    We have constructed a non-homologous database, termed the Integrated Sequence-Structure Database (ISSD) which comprises the coding sequences of genes, amino acid sequences of the corresponding proteins, their secondary structure and straight phi,psi angles assignments, and polypeptide backbone coordinates. Each protein entry in the database holds the alignment of nucleotide sequence, amino acid sequence and the PDB three-dimensional structure data. The nucleotide and amino acid sequences for each entry are selected on the basis of exact matches of the source organism and cell environment. The current version 1.0 of ISSD is available on the WWW at http://www.protein.bio.msu.su/issd/ and includes 107 non-homologous mammalian proteins, of which 80 are human proteins. The database has been used by us for the analysis of synonymous codon usage patterns in mRNA sequences showing their correlation with the three-dimensional structure features in the encoded proteins. Possible ISSD applications include optimisation of protein expression, improvement of the protein structure prediction accuracy, and analysis of evolutionary aspects of the nucleotide sequence-protein structure relationship. PMID:9399866

  9. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor.

  10. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  11. Amino acid sequence of myoglobin from the chiton Liolophura japonica and a phylogenetic tree for molluscan globins.

    PubMed

    Suzuki, T; Furukohri, T; Okamoto, S

    1993-02-01

    Myoglobin was isolated from the radular muscle of the chiton Liolophura japonica, a primitive archigastropodic mollusc. Liolophura contains three monomeric myoglobins (I, II, and III), and the complete amino acid sequence of myoglobin I has been determined. It is composed of 145 amino acid residues, and the molecular mass was calculated to be 16,070 D. The E7 distal histidine, which is replaced by valine or glutamine in several molluscan globins, is conserved in Liolophura myoglobin. The autoxidation rate at physiological conditions indicated that Liolophura oxymyoglobin is fairly stable when compared with other molluscan myoglobins. The amino acid sequence of Liolophura myoglobin shows low homology (11-21%) with molluscan dimeric myoglobins and hemoglobins, but shows higher homology (26-29%) with monomeric myoglobins from the gastropodic molluscs Aplysia, Dolabella, and Bursatella. A phylogenetic tree was constructed from 19 molluscan globin sequences. The tree separated them into two distinct clusters, a cluster for muscle myoglobins and a cluster for erythrocyte or gill hemoglobins. The myoglobin cluster is divided further into two subclusters, corresponding to monomeric and dimeric myoglobins, respectively. Liolophura myoglobin was placed on the branch of monomeric myoglobin lineage, showing that it diverged earlier from other monomeric myoglobins. The hemoglobin cluster is also divided into two subclusters. One cluster contains homodimeric, heterodimeric, tetrameric, and didomain chains of erythrocyte hemoglobins of the blood clams Anadara, Scapharca, and Barbatia. Of special interest is the other subcluster. It consists of three hemoglobin chains derived from the bacterial symbiontharboring clams Calyptogena and Lucina, in which hemoglobins are supposed to play an important role in maintaining the symbiosis with sulfide bacteria.

  12. Trichomonas vaginalis acidic phospholipase A2: isolation and partial amino acid sequence.

    PubMed

    Escobedo-Guajardo, Brenda L; González-Salazar, Francisco; Palacios-Corona, Rebeca; Torres de la Cruz, Víctor M; Morales-Vallarta, Mario; Mata-Cárdenas, Benito D; Garza-González, Jesús N; Rivera-Silva, Gerardo; Vargas-Villarreal, Javier

    2013-12-01

    Sexually transmitted diseases are a major cause of acute disease worldwide, and trichomoniasis is the most common and curable disease, generating more than 170 million cases annually worldwide. Trichomonas vaginalis is the causal agent of trichomoniasis and has the ability to destroy in vitro cell monolayers of the vaginal mucosa, where the phospholipases A2 (PLA2) have been reported as potential virulence factors. These enzymes have been partially characterized from the subcellular fraction S30 of pathogenic T. vaginalis strains. The main objective of this study was to purify a phospholipase A2 from T. vaginalis, make a partial characterization, obtain a partial amino acid sequence, and determine its enzymatic participation as hemolytic factor causing lysis of erythrocytes. Trichomonas S30, RF30 and UFF30 sub-fractions from GT-15 strain have the capacity to hydrolyze [2-(14)C-PA]-PC at pH 6.0. Proteins from the UFF30 sub-fraction were separated by affinity chromatography into two eluted fractions with detectable PLA A2 activity. The EDTA-eluted fraction was analyzed by HPLC using on-line HPLC-tandem mass spectrometry and two protein peaks were observed at 8.2 and 13 kDa. Peptide sequences were identified from the proteins present in the eluted EDTA UFF30 fraction; bioinformatic analysis using Protein Link Global Server charged with T. vaginalis protein database suggests that eluted peptides correspond a putative ubiquitin protein in the 8.2 kDa fraction and a phospholipase preserved in the 13 kDa fraction. The EDTA-eluted fraction hydrolyzed [2-(14)C-PA]-PC lyses erythrocytes from Sprague-Dawley in a time and dose-dependent manner. The acidic hemolytic activity decreased by 84% with the addition of 100 μM of Rosenthal's inhibitor.

  13. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  14. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  15. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities.

  16. The amino acid sequences of the Fd fragments of two human γ heavy chains

    PubMed Central

    Press, E. M.; Hogg, N. M.

    1970-01-01

    The amino acid sequences of the Fd fragments of two human pathological immunoglobulins of the immunoglobulin G1 class are reported. Comparison of the two sequences shows that the heavy-chain variable regions are similar in length to those of the light chains. The existence of heavy chain variable region subgroups is also deduced, from a comparison of these two sequences with those of another γ 1 chain, Eu, a μ chain, Ou, and the partial sequence of a fourth γ 1 chain, Ste. Carbohydrate has been found to be linked to an aspartic acid residue in the variable region of one of the γ 1 chains, Cor. PMID:5449120

  17. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  18. The amino acid sequence of goat beta-lactoglobulin.

    PubMed

    Préaux, G; Braunitzer, G; Schrank, B; Stangl, A

    1979-11-01

    The isolation of beta-lactoglobulin from milk of the goat is described. The purified protein was checked for purity and has been characterized by its gross composition and end groups. The native or the modified protein was then degraded by tryptic and cyanogen bromide cleavage. The cleavage products were isolated and sequenced in the sequenator using a Quadrol and propyne program. These data provide the complete sequence of beta-lactoglobulin of the goat. The results are discussed and compared particularly with bovine beta-lactoglobulin components AB. Some biological aspects are described.

  19. Molecular characterization of divergent grapevine Pinot gris virus isolates and their detection in Slovak and Czech grapevines.

    PubMed

    Glasa, Miroslav; Predajňa, Lukáš; Komínek, Petr; Nagyová, Alžbeta; Candresse, Thierry; Olmos, Antonio

    2014-08-01

    Analysis of complete genome sequences of three Slovak isolates of grapevine Pinot gris virus (GPGV) showed their low heterogeneity (reaching 1.7 %) and a close relationship to the Italian NC_015782 isolate (4.2-4.5 % divergence). Comparison of Slovak and Italian isolates revealed an unusual accumulation of 21 indel mutations in ORF1, resulting in a localized high divergence in the encoded amino acid sequences. An elevated divergence in the 5' extremity of the GPGV genomes is suggestive of a recombination between Slovak isolates and grapevine berry inner necrosis virus. RT-PCR allowed the frequent detection of closely related GPGV isolates in grapevines from Slovakia and the Czech Republic.

  20. Layered materials with coexisting acidic and basic sites for catalytic one-pot reaction sequences.

    PubMed

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2009-06-17

    Acidic montmorillonite-immobilized primary amines (H-mont-NH(2)) were found to be excellent acid-base bifunctional catalysts for one-pot reaction sequences, which are the first materials with coexisting acid and base sites active for acid-base tamdem reactions. For example, tandem deacetalization-Knoevenagel condensation proceeded successfully with the H-mont-NH(2), affording the corresponding condensation product in a quantitative yield. The acidity of the H-mont-NH(2) was strongly influenced by the preparation solvent, and the base-catalyzed reactions were enhanced by interlayer acid sites.

  1. Synthesis of gamma,delta-unsaturated glycolic acids via sequenced brook and Ireland--claisen rearrangements.

    PubMed

    Schmitt, Daniel C; Johnson, Jeffrey S

    2010-03-05

    Organozinc, -magnesium, and -lithium nucleophiles initiate a Brook/Ireland-Claisen rearrangement sequence of allylic silyl glyoxylates resulting in the formation of gamma,delta-unsaturated alpha-silyloxy acids.

  2. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  3. Human low-Mr kininogen contains three copies of a cystatin sequence that are divergent in structure and in inhibitory activity for cysteine proteinases.

    PubMed Central

    Salvesen, G; Parkes, C; Abrahamson, M; Grubb, A; Barrett, A J

    1986-01-01

    We point out that human low-Mr kininogen contains three cystatin-like sequences, rather than two, as had previously been thought. The protein was purified by affinity chromatography on carboxymethyl-papain-Sepharose, and subjected to limited proteolysis by trypsin and chymotrypsin. Fragments were isolated, and three corresponding to the individual cystatin-like domains were identified. By comparison with the known amino acid sequence of the protein they were numbered 1 to 3 from the N-terminus. Domain 1 was not found to have any inhibitory activity for cysteine proteinases, which is consistent with the absence of residues that are highly conserved in inhibitors of the cystatin superfamily, and have previously been suggested to be essential for activity. Domain 2 was a good inhibitor of chicken calpain, and also papain and cathepsin L. Domain 3 showed negligible inhibition of calpain, but inhibited papain and cathepsin L strongly. The probable arrangement of disulphide bonds in the heavy chain of low-Mr kininogen is deduced from the homology with the cystatins and other evidence contained in the present paper. Images Fig. 2. PMID:3521586

  4. Contrasting responses of root morphology and root-exuded organic acids to low phosphorus availability in three important food crops with divergent root traits

    PubMed Central

    Wang, Yan-Liang; Almvik, Marit; Clarke, Nicholas; Eich-Greatorex, Susanne; Øgaard, Anne Falk; Krogstad, Tore; Lambers, Hans; Clarke, Jihong Liu

    2015-01-01

    Phosphorus (P) is an important element for crop productivity and is widely applied in fertilizers. Most P fertilizers applied to land are sorbed onto soil particles, so research on improving plant uptake of less easily available P is important. In the current study, we investigated the responses in root morphology and root-exuded organic acids (OAs) to low available P (1 μM P) and sufficient P (50 μM P) in barley, canola and micropropagated seedlings of potato—three important food crops with divergent root traits, using a hydroponic plant growth system. We hypothesized that the dicots canola and tuber-producing potato and the monocot barley would respond differently under various P availabilities. WinRHIZO and liquid chromatography triple quadrupole mass spectrometry results suggested that under low P availability, canola developed longer roots and exhibited the fastest root exudation rate for citric acid. Barley showed a reduction in root length and root surface area and an increase in root-exuded malic acid under low-P conditions. Potato exuded relatively small amounts of OAs under low P, while there was a marked increase in root tips. Based on the results, we conclude that different crops show divergent morphological and physiological responses to low P availability, having evolved specific traits of root morphology and root exudation that enhance their P-uptake capacity under low-P conditions. These results could underpin future efforts to improve P uptake of the three crops that are of importance for future sustainable crop production. PMID:26286222

  5. Genome sequence of the acid-tolerant strain Rhizobium sp. LPU83.

    PubMed

    Wibberg, Daniel; Tejerizo, Gonzalo Torres; Del Papa, María Florencia; Martini, Carla; Pühler, Alfred; Lagares, Antonio; Schlüter, Andreas; Pistorio, Mariano

    2014-04-20

    Rhizobia are important members of the soil microbiome since they enter into nitrogen-fixing symbiosis with different legume host plants. Rhizobium sp. LPU83 is an acid-tolerant Rhizobium strain featuring a broad-host-range. However, it is ineffective in nitrogen fixation. Here, the improved draft genome sequence of this strain is reported. Genome sequence information provides the basis for analysis of its acid tolerance, symbiotic properties and taxonomic classification.

  6. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly).

  7. Single-chain structure of human ceruloplasmin: the complete amino acid sequence of the whole molecule.

    PubMed Central

    Takahashi, N; Ortel, T L; Putnam, F W

    1984-01-01

    We have determined the amino acid sequence of the amino-terminal 67,000-dalton (67-kDa) fragment of human ceruloplasmin and have established overlapping sequences between the 67-kDa and 50-kDa fragments and between the 50-kDa and 19-kDa fragments. The 67-kDa fragment contains 480 amino acid residues and three glucosamine oligosaccharides. These results together with our previous sequence data for the 50-kDa and 19-kDa fragments complete the amino acid sequence of human ceruloplasmin. The polypeptide chain has a total of 1,046 amino acid residues (Mr 120,085) and has attachment sites for four glucosamine oligosaccharides; together these account for the total molecular mass of human ceruloplasmin (132 kDa). The sequence analysis of the peptides overlapping the fragments showed that one additional amino acid, arginine, is present between the 67-kDa and 50-kDa fragments, and another, lysine, is between the 50-kDa and 19-kDa fragments. Only two apparent sites of amino acid interchange have been identified in the polypeptide chain. Both involve a single-point interchange of glycine and lysine that would result in a difference in charge. The results of the complete sequence analysis verified that human ceruloplasmin is composed of a single polypeptide chain and that the subunit-like fragments are produced by proteolytic cleavage during purification (and possibly also in vivo). PMID:6582496

  8. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    PubMed Central

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  9. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    PubMed

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  10. Theoretical foundations for quantitative paleogenetics. III - The molecular divergence of nucleic acids and proteins for the case of genetic events of unequal probability

    NASA Technical Reports Server (NTRS)

    Holmquist, R.; Pearl, D.

    1980-01-01

    Theoretical equations are derived for molecular divergence with respect to gene and protein structure in the presence of genetic events with unequal probabilities: amino acid and base compositions, the frequencies of nucleotide replacements, the usage of degenerate codons, the distribution of fixed base replacements within codons and the distribution of fixed base replacements among codons. Results are presented in the form of tables relating the probabilities of given numbers of codon base changes with respect to the original codon for the alpha hemoglobin, beta hemoglobin, myoglobin, cytochrome c and parvalbumin group gene families. Application of the calculations to the rabbit alpha and beta hemoglobin mRNAs and proteins indicates that the genes are separated by about 425 fixed based replacements distributed over 114 codon sites, which is a factor of two greater than previous estimates. The theoretical results also suggest that many more base replacements are required to effect a given gene or protein structural change than previously believed.

  11. Total synthesis of dihydrolysergic acid and dihydrolysergol: development of a divergent synthetic strategy applicable to rapid assembly of D-ring analogs

    PubMed Central

    Lee, Kiyoun; Poudel, Yam B.; Glinkerman, Christopher M.; Boger, Dale L.

    2015-01-01

    The total syntheses of dihydrolysergic acid and dihydrolysergol are detailed based on a Pd(0)-catalyzed intramolecular Larock indole cyclization for the preparation of the embedded tricyclic indole (ABC ring system) and a subsequent powerful inverse electron demand Diels–Alder reaction of 5-carbomethoxy-1,2,3-triazine with a ketone-derived enamine for the introduction of a functionalized pyridine, serving as the precursor for a remarkably diastereoselective reduction to the N-methylpiperidine D-ring. By design, the use of the same ketone-derived enamine and a set of related complementary heterocyclic azadiene [4 + 2] cycloaddition reactions permitted the late stage divergent preparation of a series of alternative heterocyclic derivatives not readily accessible by more conventional approaches. PMID:26273113

  12. SETG: Nucleic Acid Extraction and Sequencing for In Situ Life Detection on Mars

    NASA Astrophysics Data System (ADS)

    Mojarro, A.; Hachey, J.; Tani, J.; Smith, A.; Bhattaru, S. A.; Pontefract, A.; Doebler, R.; Brown, M.; Ruvkun, G.; Zuber, M. T.; Carr, C. E.

    2016-10-01

    We are developing an integrated nucleic acid extraction and sequencing instrument: the Search for Extra-Terrestrial Genomes (SETG) for in situ life detection on Mars. Our goals are to identify related or unrelated nucleic acid-based life on Mars.

  13. Draft Genome Sequence of Cyanobacterium sp. Strain IPPAS B-1200 with a Unique Fatty Acid Composition

    PubMed Central

    Starikov, Alexander Y.; Usserbaeva, Aizhan A.; Sinetova, Maria A.; Sarsekeyeva, Fariza K.; Zayadan, Bolatkhan K.; Ustinova, Vera V.; Kupriyanova, Elena V.; Los, Dmitry A.

    2016-01-01

    Here, we report the draft genome of Cyanobacterium sp. IPPAS strain B-1200, isolated from Lake Balkhash, Kazakhstan, and characterized by the unique fatty acid composition of its membrane lipids, which are enriched with myristic and myristoleic acids. The approximate genome size is 3.4 Mb, and the predicted number of coding sequences is 3,119. PMID:27856596

  14. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    PubMed

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  15. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    PubMed

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  16. Using intron sequence comparisons in the triose-phosphate isomerase gene to study the divergence of the fall armyworm host strains

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Noctuid moth, Spodoptera frugiperda (the fall armyworm), is endemic to the Western Hemisphere and appears to be undergoing sympatric speciation to produce two subpopulations that differ in their choice of host plants. The diverging “rice strain” and “corn strain” are morphologically indistinguis...

  17. Purification, characterization and partial amino acid sequence of glycogen synthase from Saccharomyces cerevisiae.

    PubMed Central

    Carabaza, A; Arino, J; Fox, J W; Villar-Palasi, C; Guinovart, J J

    1990-01-01

    Glycogen synthase from Saccharomyces cerevisiae was purified to homogeneity. The enzyme showed a subunit molecular mass of 80 kDa. The holoenzyme appears to be a tetramer. Antibodies developed against purified yeast glycogen synthase inactivated the enzyme in yeast extracts and allowed the detection of the protein in Western blots. Amino acid analysis showed that the enzyme is very rich in glutamate and/or glutamine residues. The N-terminal sequence (11 amino acid residues) was determined. In addition, selected tryptic-digest peptides were purified by reverse-phase h.p.l.c. and submitted to gas-phase sequencing. Up to eight sequences (79 amino acid residues) could be aligned with the human muscle enzyme sequence. Levels of identity range between 37 and 100%, indicating that, although human and yeast glycogen synthases probably share some conserved regions, significant differences in their primary structure should be expected. Images Fig. 1. Fig. 2. Fig. 3. PMID:2114092

  18. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    PubMed

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  19. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations.

    PubMed

    Abascal, Federico; Zardoya, Rafael; Telford, Maximilian J

    2010-07-01

    We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

  20. Amino acid sequence of homologous rat atrial peptides: natriuretic activity of native and synthetic forms.

    PubMed Central

    Seidah, N G; Lazure, C; Chrétien, M; Thibault, G; Garcia, R; Cantin, M; Genest, J; Nutt, R F; Brady, S F; Lyle, T A

    1984-01-01

    A substance called atrial natriuretic factor (ANF), localized in secretory granules of atrial cardiocytes, was isolated as four homologous natriuretic peptides from homogenates of rat atria. The complete sequence of the longest form showed that it is composed of 33 amino acids. The three other shorter forms (2-33, 3-33, and 8-33) represent amino-terminally truncated versions of the 33 amino acid parent molecule as shown by analysis of sequence, amino acid composition, or both. The proposed primary structure agrees entirely with the amino acid composition and reveals no significant sequence homology with any known protein or segment of protein. The short form ANF-(8-33) was synthesized by a multi-fragment condensation approach and the synthetic product was shown to exhibit specific activity comparable to that of the natural ANF-(3-33). PMID:6232612

  1. Nucleotide and deduced amino acid sequences of a new subtilisin from an alkaliphilic Bacillus isolate.

    PubMed

    Saeki, Katsuhisa; Magallones, Marietta V; Takimura, Yasushi; Hatada, Yuji; Kobayashi, Tohru; Kawai, Shuji; Ito, Susumu

    2003-10-01

    The gene for a new subtilisin from the alkaliphilic Bacillus sp. KSM-LD1 was cloned and sequenced. The open reading frame of the gene encoded a 97 amino-acid prepro-peptide plus a 307 amino-acid mature enzyme that contained a possible catalytic triad of residues, Asp32, His66, and Ser224. The deduced amino acid sequence of the mature enzyme (LD1) showed approximately 65% identity to those of subtilisins SprC and SprD from alkaliphilic Bacillus sp. LG12. The amino acid sequence identities of LD1 to those of previously reported true subtilisins and high-alkaline proteases were below 60%. LD1 was characteristically stable during incubation with surfactants and chemical oxidants. Interestingly, an oxidizable Met residue is located next to the catalytic Ser224 of the enzyme as in the cases of the oxidation-susceptible subtilisins reported to date.

  2. Shark myelin basic protein: amino acid sequence, secondary structure, and self-association.

    PubMed

    Milne, T J; Atkins, A R; Warren, J A; Auton, W P; Smith, R

    1990-09-01

    Myelin basic protein (MBP) from the Whaler shark (Carcharhinus obscurus) has been purified from acid extracts of a chloroform/methanol pellet from whole brains. The amino acid sequence of the majority of the protein has been determined and compared with the sequences of other MBPs. The shark protein has only 44% homology with the bovine protein, but, in common with other MBPs, it has basic residues distributed throughout the sequence and no extensive segments that are predicted to have an ordered secondary structure in solution. Shark MBP lacks the triproline sequence previously postulated to form a hairpin bend in the molecule. The region containing the putative consensus sequence for encephalitogenicity in the guinea pig contains several substitutions, thus accounting for the lack of activity of the shark protein. Studies of the secondary structure and self-association have shown that shark MBP possesses solution properties similar to those of the bovine protein, despite the extensive differences in primary structure.

  3. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  4. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons.

    PubMed

    Abu-Qarn, Mehtap; Eichler, Jerry

    2007-05-01

    Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  5. Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

    PubMed Central

    Sinclair, Robert M.; Ravantti, Janne J.

    2017-01-01

    ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids

  6. Classification of mouse VK groups based on the partial amino acid sequence to the first invariant tryptophan: impact of 14 new sequences from IgG myeloma proteins.

    PubMed

    Potter, M; Newell, J B; Rudikoff, S; Haber, E

    1982-12-01

    Fourteen new VK sequences derived from BALB/c IgG myeloma proteins were determined to the first invariant tryptophan (Trp 35). These partial sequences were compared with 65 other published VK sequences using a computer program. The 79 sequences were organized according to the length of the sequence from the amino terminus to the first invariant tryptophan (Trp 35), into seven groups (33, 34, 35, 36, 39, 40 and 41aa). A distance matrix of all 79 sequences was then computed, i.e. the number of amino acid substitutions necessary to convert one sequence to another was determined. From these data a dendrogram was constructed. Most of the VK sequences fell into clusters or closely related groups. The definition of a sequence group is arbitrary but facilitates the classification of VK proteins. We used 12 substitutions as the basis for defining a sequence group based on the known number of substitutions that are found in the VK21 proteins. By this criterion there were 18 groups in the Trp 35 dendrogram. Twelve of the 14 new sequences fell into one of these sequence groups; two formed new sequence groups. Collective amino acid sequencing is still encountering new VK structures indicating more sequences will be required to attain an accurate estimate of the total number of VK groups. Updated dendrograms can be quickly generated to include newly generated sequences.

  7. Sequence space and the ongoing expansion of the protein universe.

    PubMed

    Povolotskaya, Inna S; Kondrashov, Fyodor A

    2010-06-17

    The need to maintain the structural and functional integrity of an evolving protein severely restricts the repertoire of acceptable amino-acid substitutions. However, it is not known whether these restrictions impose a global limit on how far homologous protein sequences can diverge from each other. Here we explore the limits of protein evolution using sequence divergence data. We formulate a computational approach to study the rate of divergence of distant protein sequences and measure this rate for ancient proteins, those that were present in the last universal common ancestor. We show that ancient proteins are still diverging from each other, indicating an ongoing expansion of the protein sequence universe. The slow rate of this divergence is imposed by the sparseness of functional protein sequences in sequence space and the ruggedness of the protein fitness landscape: approximately 98 per cent of sites cannot accept an amino-acid substitution at any given moment but a vast majority of all sites may eventually be permitted to evolve when other, compensatory, changes occur. Thus, approximately 3.5 x 10(9) yr has not been enough to reach the limit of divergent evolution of proteins, and for most proteins the limit of sequence similarity imposed by common function may not exceed that of random sequences.

  8. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  9. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  10. Amino acid sequence around the active-site serine residue in the acyltransferase domain of goat mammary fatty acid synthetase.

    PubMed Central

    Mikkelsen, J; Højrup, P; Rasmussen, M M; Roepstorff, P; Knudsen, J

    1985-01-01

    Goat mammary fatty acid synthetase was labelled in the acyltransferase domain by formation of O-ester intermediates by incubation with [1-14C]acetyl-CoA and [2-14C]malonyl-CoA. Tryptic-digest and CNBr-cleavage peptides were isolated and purified by high-performance reverse-phase and ion-exchange liquid chromatography. The sequences of the malonyl- and acetyl-labelled peptides were shown to be identical. The results confirm the hypothesis that both acetyl and malonyl groups are transferred to the mammalian fatty acid synthetase complex by the same transferase. The sequence is compared with those of other fatty acid synthetase transferases. PMID:3922356

  11. Ligation with nucleic acid sequence-based amplification.

    PubMed

    Ong, Carmichael; Tai, Warren; Sarma, Aartik; Opal, Steven M; Artenstein, Andrew W; Tripathi, Anubhav

    2012-01-01

    This work presents a novel method for detecting nucleic acid targets using a ligation step along with an isothermal, exponential amplification step. We use an engineered ssDNA with two variable regions on the ends, allowing us to design the probe for optimal reaction kinetics and primer binding. This two-part probe is ligated by T4 DNA Ligase only when both parts bind adjacently to the target. The assay demonstrates that the expected 72-nt RNA product appears only when the synthetic target, T4 ligase, and both probe fragments are present during the ligation step. An extraneous 38-nt RNA product also appears due to linear amplification of unligated probe (P3), but its presence does not cause a false-positive result. In addition, 40 mmol/L KCl in the final amplification mix was found to be optimal. It was also found that increasing P5 in excess of P3 helped with ligation and reduced the extraneous 38-nt RNA product. The assay was also tested with a single nucleotide polymorphism target, changing one base at the ligation site. The assay was able to yield a negative signal despite only a single-base change. Finally, using P3 and P5 with longer binding sites results in increased overall sensitivity of the reaction, showing that increasing ligation efficiency can improve the assay overall. We believe that this method can be used effectively for a number of diagnostic assays.

  12. Thin-film technology for direct visual detection of nucleic acid sequences: applications in clinical research.

    PubMed

    Jenison, Robert D; Bucala, Richard; Maul, Diana; Ward, David C

    2006-01-01

    Certain optical conditions permit the unaided eye to detect thickness changes on surfaces on the order of 20 A, which are of similar dimensions to monomolecular interactions between proteins or hybridization of complementary nucleic acid sequences. Such detection exploits specific interference of reflected white light, wherein thickness changes are perceived as surface color changes. This technology, termed thin-film detection, allows for the visualization of subattomole amounts of nucleic acid targets, even in complex clinical samples. Thin-film technology has been applied to a broad range of clinically relevant indications, including the detection of pathogenic bacterial and viral nucleic acid sequences and the discrimination of sequence variations in human genes causally related to susceptibility or severity of disease.

  13. Four plant defensins from an indigenous South African Brassicaceae species display divergent activities against two test pathogens despite high sequence similarity in the encoding genes

    PubMed Central

    2011-01-01

    Background Plant defensins are an important component of the innate defence system of plants where they form protective antimicrobial barriers between tissue types of plant organs as well as around seeds. These peptides also have other activities that are important for agricultural applications as well as the medical sector. Amongst the numerous plant peptides isolated from a variety of plant species, a significant number of promising defensins have been isolated from Brassicaceae species. Here we report on the isolation and characterization of four defensins from Heliophila coronopifolia, a native South African Brassicaceae species. Results Four defensin genes (Hc-AFP1-4) were isolated with a homology based PCR strategy. Analysis of the deduced amino acid sequences showed that the peptides were 72% similar and grouped closest to defensins isolated from other Brassicaceae species. The Hc-AFP1 and 3 peptides shared high homology (94%) and formed a unique grouping in the Brassicaceae defensins, whereas Hc-AFP2 and 4 formed a second homology grouping with defensins from Arabidopsis and Raphanus. Homology modelling showed that the few amino acids that differed between the four peptides had an effect on the surface properties of the defensins, specifically in the alpha-helix and the loop connecting the second and third beta-strands. These areas are implicated in determining differential activities of defensins. Comparing the activities after recombinant production of the peptides, Hc-AFP2 and 4 had IC50 values of 5-20 μg ml-1 against two test pathogens, whereas Hc-AFP1 and 3 were less active. The activity against Botrytis cinerea was associated with membrane permeabilization, hyper-branching, biomass reduction and even lytic activity. In contrast, only Hc-AFP2 and 4 caused membrane permeabilization and severe hyper-branching against the wilting pathogen Fusarium solani, while Hc-AFP1 and 3 had a mild morphogenetic effect on the fungus, without any indication of

  14. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  15. RNA internal standard synthesis by nucleic acid sequence-based amplification for competitive quantitative amplification reactions.

    PubMed

    Lo, Wan-Yu; Baeumner, Antje J

    2007-02-15

    Nucleic acid sequence-based amplification (NASBA) reactions have been demonstrated to successfully synthesize new sequences based on deletion and insertion reactions. Two RNA internal standards were synthesized for use in competitive amplification reactions in which quantitative analysis can be achieved by coamplifying the internal standard with the wild type sample. The sequences were created in two consecutive NASBA reactions using the E. coli clpB mRNA sequence as model analyte. The primer sequences of the wild type sequence were maintained, and a 20-nt-long segment inside the amplicon region was exchanged for a new segment of similar GC content and melting temperature. The new RNA sequence was thus amplifiable using the wild type primers and detectable via a new inserted sequence. In the first reaction, the forwarding primer and an additional 20-nt-long sequence was deleted and replaced by a new 20-nt-long sequence. In the second reaction, a forwarding primer containing as 5' overhang sequence the wild type primer sequence was used. The presence of pure internal standard was verified using electrochemiluminescence and RNA lateral-flow biosensor analysis. Additional sequence deletion in order to shorten the internal standard amplicons and thus generate higher detection signals was found not to be required. Finally, a competitive NASBA reaction between one internal standard and the wild type sequence was carried out proving its functionality. This new rapid construction method via NASBA provides advantages over the traditional techniques since it requires no traditional cloning procedures, no thermocyclers, and can be completed in less than 4 h.

  16. The amino acid sequences of two alpha chains of hemoglobins from Komodo dragon Varanus komodoensis and phylogenetic relationships of amniotes.

    PubMed

    Fushitani, K; Higashiyama, K; Moriyama, E N; Imai, K; Hosokawa, K

    1996-09-01

    To elucidate phylogenetic relationships among amniotes and the evolution of alpha globins, hemoglobins were analyzed from the Komodo dragon (Komodo monitor lizard) Varanus komodoensis, the world's largest extant lizard, inhabiting Komodo Islands, Indonesia. Four unique globin chains (alpha A, alpha D, beta B, and beta C) were isolated in an equal molar ratio by high performance liquid chromatography from the hemolysate. The amino acid sequences of two alpha chains were determined. The alpha D chain has a glutamine at E7 as does an alpha chain of a snake, Liophis miliaris, but the alpha A chain has a histidine at E7 like the majority of hemoglobins. Phylogenetic analyses of 19 globins including two alpha chains of Komodo dragon and ones from representative amniotes showed the following results: (1) The a chains of squamates (snakes and lizards), which have a glutamine at E7, are clustered with the embryonic alpha globin family, which typically includes the alpha D chain from birds; (2) birds form a sister group with other reptiles but not with mammals; (3) the genes for embryonic and adult types of alpha globins were possibly produced by duplication of the ancestral alpha gene before ancestral amniotes diverged, indicating that each of the present amniotes might carry descendants of the two types of alpha globin genes; (4) squamates first split off from the ancestor of other reptiles and birds.

  17. Direct Comparison of Mice Null for Liver or Intestinal Fatty Acid-binding Proteins Reveals Highly Divergent Phenotypic Responses to High Fat Feeding*

    PubMed Central

    Gajda, Angela M.; Zhou, Yin Xiu; Agellon, Luis B.; Fried, Susan K.; Kodukula, Sarala; Fortson, Walter; Patel, Khamoshi; Storch, Judith

    2013-01-01

    The enterocyte expresses two fatty acid-binding proteins (FABP), intestinal FABP (IFABP; FABP2) and liver FABP (LFABP; FABP1). LFABP is also expressed in liver. Despite ligand transport and binding differences, it has remained uncertain whether these intestinally coexpressed proteins, which both bind long chain fatty acids (FA), are functionally distinct. Here, we directly compared IFABP−/− and LFABP−/− mice fed high fat diets containing long chain saturated or unsaturated fatty acids, reasoning that providing an abundance of dietary lipid would reveal unique functional properties. The results showed that mucosal lipid metabolism was indeed differentially modified, with significant decreases in FA incorporation into triacylglycerol (TG) relative to phospholipid (PL) in IFABP−/− mice, whereas LFABP−/− mice had reduced monoacylglycerol incorporation in TG relative to PL, as well as reduced FA oxidation. Interestingly, striking differences were found in whole body energy homeostasis; LFABP−/− mice fed high fat diets became obese relative to WT, whereas IFABP−/− mice displayed an opposite, lean phenotype. Fuel utilization followed adiposity, with LFABP−/− mice preferentially utilizing lipids, and IFABP−/− mice preferentially metabolizing carbohydrate for energy production. Changes in body weight and fat may arise, in part, from altered food intake; mucosal levels of the endocannabinoids 2-arachidonoylglycerol and arachidonoylethanolamine were elevated in LFABP−/−, perhaps contributing to increased energy intake. This direct comparison provides evidence that LFABP and IFABP have distinct roles in intestinal lipid metabolism; differential intracellular functions in intestine and in liver, for LFABP−/− mice, result in divergent downstream effects at the systemic level. PMID:23990461

  18. Amino acid sequences of two nonspecific lipid-transfer proteins from germinated castor bean.

    PubMed

    Takishima, K; Watanabe, S; Yamada, M; Suga, T; Mamiya, G

    1988-11-01

    The amino acid sequence of two nonspecific lipid-transfer proteins (nsLTP) B and C from germinated castor bean seeds have been determined. Both the proteins consist of 92 residues, as for nsLTP previously reported, and their calculated Mr values are 9847 and 9593 for nsLTP-B and nsLTP-C, respectively. The sequences of nsLTP-B and nsLTP-C, compared to the known sequence of nsLTP-A from the same source, are 68% and 35% similar, respectively. No variation was found at the positions of the cysteine residues, indicating that they might be involved in disulfide bridges.

  19. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  20. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences.

  1. Complete amino acid sequence of the N-terminal extension of calf skin type III procollagen.

    PubMed Central

    Brandt, A; Glanville, R W; Hörlein, D; Bruckner, P; Timpl, R; Fietzek, P P; Kühn, K

    1984-01-01

    The N-terminal extension peptide of type III procollagen, isolated from foetal-calf skin, contains 130 amino acid residues. To determine its amino acid sequence, the peptide was reduced and carboxymethylated or aminoethylated and fragmented with trypsin, Staphylococcus aureus V8 proteinase and bacterial collagenase. Pyroglutamate aminopeptidase was used to deblock the N-terminal collagenase fragment to enable amino acid sequencing. The type III collagen extension peptide is homologous to that of the alpha 1 chain of type I procollagen with respect to a three-domain structure. The N-terminal 79 amino acids, which contain ten of the 12 cysteine residues, form a compact globular domain. The next 39 amino acids are in a collagenase triplet sequence (Gly- Xaa - Yaa )n with a high hydroxyproline content. Finally, another short non-collagenous domain of 12 amino acids ends at the cleavage site for procollagen aminopeptidase, which cleaves a proline-glutamine bond. In contrast with type I procollagen, the type III procollagen extension peptides contain interchain disulphide bridges located at the C-terminus of the triple-helical domain. PMID:6331392

  2. Divergence of cuticular hydrocarbons in two sympatric grasshopper species and the evolution of fatty acid synthases and elongases across insects

    PubMed Central

    Finck, Jonas; Berdan, Emma L.; Mayer, Frieder; Ronacher, Bernhard; Geiselhardt, Sven

    2016-01-01

    Cuticular hydrocarbons (CHCs) play a major role in the evolution of reproductive isolation between insect species. The CHC profiles of two closely related sympatric grasshopper species, Chorthippus biguttulus and C. mollis, differ mainly in the position of the first methyl group in major methyl-branched CHCs. The position of methyl branches is determined either by a fatty acid synthase (FAS) or by elongases. Both protein families showed an expansion in insects. Interestingly, the FAS family showed several lineage-specific expansions, especially in insect orders with highly diverse methyl-branched CHC profiles. We found five putative FASs and 12 putative elongases in the reference transcriptomes for both species. A dN/dS test showed no evidence for positive selection acting on FASs and elongases in these grasshoppers. However, one candidate FAS showed species-specific transcriptional differences and may contribute to the shift of the methyl-branch position between the species. In addition, transcript levels of four elongases were expressed differentially between the sexes. Our study indicates that complex methyl-branched CHC profiles are linked to an expansion of FASs genes, but that species differences can also mediated at the transcriptional level. PMID:27677406

  3. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  4. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  5. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  6. Complete amino acid sequence of branched-chain amino acid aminotransferase (transaminase B) of Salmonella typhimurium, identification of the coenzyme-binding site and sequence comparison analysis

    SciTech Connect

    Feild, M.J.

    1988-01-01

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase of Salmonella typhimurium was determined by automated Edman degradation of peptide fragments generated by chemical and enzymatic digestion of S-carboxymethylated and S-pyridylethylated transaminase B. Peptide fragments of transaminase B were generated by treatment of the enzyme with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. Protocols were developed for separation of the peptide fragments by reverse-phase high performance liquid chromatography (HPLC), ion-exchange HPLC, and SDS-urea gel electrophoresis. The enzyme subunit contains 308 amino acid residues and has a molecular weight of 33,920 daltons. The coenzyme-binding site was determined by treatment of the enzyme, containing bound pyridoxal 5-phosphate, with tritiated sodium borohydride prior to trypsin digestion. Monitoring radioactivity incorporation and peptide map comparisons with an apoenzyme tryptic digest, allowed identification of the pyridoxylated-peptide which was isolated by reverse-phase HPLC and sequenced. The coenzyme-binding site is a lysyl residue at position 159. Some peptides were further characterized by fast atom bombardment mass spectrometry.

  7. Unifying bacteria from decaying wood with various ubiquitous Gibbsiella species as G. acetica sp. nov. based on nucleotide sequence similarities and their acetic acid secretion.

    PubMed

    Geider, Klaus; Gernold, Marina; Jock, Susanne; Wensing, Annette; Völksch, Beate; Gross, Jürgen; Spiteller, Dieter

    2015-12-01

    Bacteria were isolated from necrotic apple and pear tree tissue and from dead wood in Germany and Austria as well as from pear tree exudate in China. They were selected for growth at 37 °C, screened for levan production and then characterized as Gram-negative, facultatively anaerobic rods. Nucleotide sequences from 16S rRNA genes, the housekeeping genes dnaJ, gyrB, recA and rpoB alignments, BLAST searches and phenotypic data confirmed by MALDI-TOF analysis showed that these bacteria belong to the genus Gibbsiella and resembled strains isolated from diseased oaks in Britain and Spain. Gibbsiella-specific PCR primers were designed from the proline isomerase and the levansucrase genes. Acid secretion was investigated by screening for halo formation on calcium carbonate agar and the compound identified by NMR as acetic acid. Its production by Gibbsiella spp. strains was also determined in culture supernatants by GC/MS analysis after derivatization with pentafluorobenzyl bromide. Some strains were differentiated by the PFGE patterns of SpeI digests and by sequence analyses of the lsc and the ppiD genes, and the Chinese Gibbsiella strain was most divergent. The newly investigated bacteria as well as Gibbsiella querinecans, Gibbsiella dentisursi and Gibbsiella papilionis, isolated in Britain, Spain, Korea and Japan, are taxonomically related Enterobacteriaceae, tolerate and secrete acetic acid. We therefore propose to unify them in the species Gibbsiella acetica sp. nov.

  8. Analysis of phylogenetic and functional diverge in plant nine-cis epoxycarotenoid dioxygenase gene family.

    PubMed

    Priya, R; Siva, Ramamoorthy

    2015-07-01

    During different environmental stress conditions, plant growth is regulated by the hormone abscisic acid (an apocarotenoid). In the biosynthesis of abscisic acid, the oxidative cleavage of cis-epoxycarotenoid catalyzed by 9-cis-epoxycarotenoid dioxygenase (NCED) is the crucial step. The NCED genes were isolated in numerous plant species and those genes were phylogenetically investigated to understand the evolution of NCED genes in various plant lineages comprising lycophyte, gymnosperm, dicot and monocot. A total of 93 genes were obtained from 48 plant species to statistically estimate their sequence conservation and functional divergence. Selaginella moellendorffii appeared to be evolutionarily distinct from those of the angiosperms, insisting the substantial influence of natural selection pressure on NCED genes. Further, using exon-intron structure analysis, the gene structures of NCED were found to be conserved across some species. In addition, the substitution rate ratio of non-synonymous (Ka) versus synonymous (Ks) mutations using the Bayesian inference approach, depicted the critical amino acid residues for functional divergence. A significant functional divergence was found between some subgroups through the co-efficient of type-I functional divergence. Our results suggest that the evolution of NCED genes occurred by duplication, diversification and exon intron loss events. The site-specific profile and functional diverge analysis revealed NCED genes might facilitate the tissue-specific functional divergence in NCED sub-families, that could combat different environmental stress conditions aiding plant survival.

  9. The amino acid sequence of cytochromes c-551 from three species of Pseudomonas

    PubMed Central

    Ambler, R. P.; Wynn, Margaret

    1973-01-01

    The amino acid sequences of the cytochromes c-551 from three species of Pseudomonas have been determined. Each resembles the protein from Pseudomonas strain P6009 (now known to be Pseudomonas aeruginosa, not Pseudomonas fluorescens) in containing 82 amino acids in a single peptide chain, with a haem group covalently attached to cysteine residues 12 and 15. In all four sequences 43 residues are identical. Although by bacteriological criteria the organisms are closely related, the differences between pairs of sequences range from 22% to 39%. These values should be compared with the differences in the sequence of mitochondrial cytochrome c between mammals and amphibians (about 18%) or between mammals and insects (about 33%). Detailed evidence for the amino acid sequences of the proteins has been deposited as Supplementary Publication SUP 50015 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973), 131, 5. PMID:4352718

  10. Draft Genome Sequence of Sorghum Grain Mold Fungus Epicoccum sorghinum, a Producer of Tenuazonic Acid

    PubMed Central

    Oliveira, Rodrigo C.; Davenport, Karen W.; Hovde, Blake; Silva, Danielle; Chain, Patrick S. G.; Correa, Benedito

    2017-01-01

    ABSTRACT The facultative plant pathogen Epicoccum sorghinum is associated with grain mold of sorghum and produces the mycotoxin tenuazonic acid. This fungus can have serious economic impact on sorghum production. Here, we report the draft genome sequence of E. sorghinum (USPMTOX48). PMID:28126937

  11. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein.

  12. Draft Genome Sequence of Bacillus coagulans NL01, a Wonderful l-Lactic Acid Producer

    PubMed Central

    Zheng, Zhaojuan; Jiang, Ting; Lin, Xi; Zhou, Jie

    2015-01-01

    Here, we report the draft genome sequence of Bacillus coagulans NL01, which could produce high optically pure l-lactic acid using xylose as a sole carbon source. The draft genome is 3,505,081 bp, with 144 contigs. About 3,903 protein-coding genes and 92 rRNAs are predicted from this assembly. PMID:26089419

  13. Comparison of orthologous and paralogous DNA flanking the wheat high molecular weight glutenin genes: sequence conservation and divergence, transposon distribution, and matrix-attachment regions.

    PubMed

    Anderson, O D; Larka, L; Christoffers, M J; McCue, K F; Gustafson, J P

    2002-04-01

    Extended flanking DNA sequences were characterized for five members of the wheat high molecular weight (HMW) glutenin gene family to understand more of the structure, control, and evolution of these genes. Analysis revealed more sequence conservation among orthologous regions than between paralogous regions, with differences mainly owing to transposition events involving putative retrotransposons and several miniature inverted transposable elements (MITEs). Both gyspy-like long terminal repeat (LTR) and non-LTR retrotransposon sequences are represented in the flanking DNAs. One of the MITEs is a novel class, but another MITE is related to the maize Stowaway family and is widely represented in Triticeae express sequence tags (ESTs). Flanking DNA of the longest sequence, a 20 425-bp fragment including and surrounding the HMW-glutenin Bx7 gene, showed additional cereal gene-like sequences both immediately 5' and 3' to the HMW-glutenin coding region. The transcriptional activities of sequences related to these flanking putative genes and the retrotransposon-related regions were indicated by matches to wheat and other Triticeae ESTs. Predictive analysis of matrix-attachment regions (MARs) of the HMW glutenin and several alpha-, gamma-, and omega-gliadin flanking DNAs indicate potential MARs immediately flanking each of the genes. Matrix binding activity in the predicted regions was confirmed for two of the HMW-glutenin genes.

  14. Amino acid sequences of heterotrophic and photosynthetic ferredoxins from the tomato plant (Lycopersicon esculentum Mill.).

    PubMed

    Kamide, K; Sakai, H; Aoki, K; Sanada, Y; Wada, K; Green, L S; Yee, B C; Buchanan, B B

    1995-11-01

    Several forms (isoproteins) of ferredoxin in roots, leaves, and green and red pericarps in tomato plants (Lycopersicon esculentum Mill.) were earlier identified on the basis of N-terminal amino acid sequence and chromatographic behavior (Green et al. 1991). In the present study, a large scale preparation made possible determination of the full length amino acid sequence of the two ferredoxins from leaves. The ferredoxins characteristic of fruit and root were sequenced from the amino terminus to the 30th residue or beyond. The leaf ferredoxins were confirmed to be expressed in pericarp of both green and red fruit. The ferredoxins characteristic of fruit and root appeared to be restricted to those tissue. The results extend earlier findings in demonstrating that ferredoxin occurs in the major organs of the tomato plant where it appears to function irrespective of photosynthetic competence.

  15. Amino acid sequence of myoglobin from white-tailed deer (Odocoileus virginianus).

    PubMed

    Joseph, Poulson; Suman, Surendranath P; Li, Shuting; Fontaine, Michele; Steinke, Laurey

    2012-10-01

    Our objective was to determine the primary structure of white-tailed deer myoglobin (Mb). White-tailed deer Mb was isolated from cardiac muscles employing ammonium sulfate precipitation and gel-filtration chromatography. The amino acid sequence was determined by Edman degradation. Sequence analyses of intact Mb as well as tryptic- and cyanogen bromide-peptides yielded the complete primary structure of white-tailed deer Mb, which shared 100% similarity with red deer Mb. White-tailed deer Mb consists of 153 amino acid residues and shares more than 96% sequence similarity with myoglobins from meat-producing ruminants, such as cattle, buffalo, sheep, and goat. Similar to sheep and goat myoglobins, white-tailed deer Mb contains 12 histidine residues. Proximal (position 93) and distal (position 64) histidine residues responsible for maintaining the stability of heme are conserved in white-tailed deer Mb.

  16. Nucleotide sequence and the encoded amino acids of human apolipoprotein A-I mRNA.

    PubMed Central

    Law, S W; Brewer, H B

    1984-01-01

    The cDNA clones encoding the precursor form of human liver apolipoprotein A-I (apoA-I), preproapoA-I, have been isolated from a cDNA library. A 17-base synthetic oligonucleotide based on residues 108-113 of apoA-I and a 26-base primer-extended, dideoxynucleotide-terminated cDNA were used as hybridization probes to select for recombinant plasmids bearing the apoA-I sequence. The complete nucleic acid sequence of human liver preproapoA-I has been determined by analysis of the cloned cDNA. The sequence is composed of 801 nucleotides encoding 267 amino acid residues. PreproapoA-I contains an 18-amino-acid prepeptide and a 6-amino-acid propeptide connected to the amino terminus of the 243-amino acid mature apoA-I. Southern blotting analysis of chromosomal DNA obtained from peripheral blood indicated the apoA-I gene is contained in a 2.1-kilobase-pair Pst I fragment and there is no gross difference in structural organization between the normal apoA-I gene and the Tangier disease apoA-I gene. Images PMID:6198645

  17. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids.

    PubMed

    Das, Jayanta Kumar; Das, Provas; Ray, Korak Kumar; Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.

  18. Pike and salmon as sister taxa: detailed intraclade resolution and divergence time estimation of Esociformes + Salmoniformes based on whole mitochondrial genome sequences.

    PubMed

    Campbell, Matthew A; López, J Andrés; Sado, Tetsuya; Miya, Masaki

    2013-11-01

    The increasing number of taxa and loci in molecular phylogenetic studies of basal euteleosts has brought stability in a controversial area. A key emerging aspect to these studies is a sister Esociformes (pike) and Salmoniformes (salmon) relationship. We evaluate mitochondrial genome support for a sister Esociformes and Salmoniformes hypothesis by surveying many potential outgroups for these taxa, employing multiple phylogenetic approaches, and utilizing a thorough sampling scheme. Secondly, we conduct a simultaneous divergence time estimation and phylogenetic inference in a Bayesian framework with fossil calibrations focusing on relationships within Esociformes+Salmoniformes. Our dataset supports a sister relationship between Esociformes and Salmoniformes; however the nearest relatives of Esociformes+Salmoniformes are inconsistent among analyses. Within the order Esociformes, we advocate for a single family, Esocidae. Subfamily relationships within Salmonidae are poorly supported as Salmoninae sister to Thymallinae+Coregoninae.

  19. Long-read sequencing improves assembly of Trichinella genomes 10-fold, revealing substantial synteny between lineages diverged over seven million years

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome evolution influences a parasite’s’s pathogenicity, host-pathogen interactions, environmental constraints, and invasion biology, while genome assemblies form the basis of comparative sequence analyses. Given that closely related organisms typically maintain appreciable synteny, the genome asse...

  20. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided.

  1. Secondary structure prediction for complete rDNA sequences (18S, 5.8S, and 28S rDNA) of Demodex folliculorum, and comparison of divergent domains structures across Acari.

    PubMed

    Zhao, Ya-E; Wang, Zheng-Hang; Xu, Yang; Wu, Li-Ping; Hu, Li

    2013-10-01

    According to base pairing, the rRNA folds into corresponding secondary structures, which contain additional phylogenetic information. On the basis of sequencing for complete rDNA sequences (18S, ITS1, 5.8S, ITS2 and 28S rDNA) of Demodex, we predicted the secondary structure of the complete rDNA sequence (18S, 5.8S, and 28S rDNA) of Demodex folliculorum, which was in concordance with that of the main arthropod lineages in past studies. And together with the sequence data from GenBank, we also predicted the secondary structures of divergent domains in SSU rRNA of 51 species and in LSU rRNA of 43 species from four superfamilies in Acari (Cheyletoidea, Tetranychoidea, Analgoidea and Ixodoidea). The multiple alignment among the four superfamilies in Acari showed that, insertions from Tetranychoidea SSU rRNA formed two newly proposed helixes, and helix c3-2b of LSU rRNA was absent in Demodex (Cheyletoidea) taxa. Generally speaking, LSU rRNA presented more remarkable differences than SSU rRNA did, mainly in D2, D3, D5, D7a, D7b, D8 and D10.

  2. Complete Genome Sequence of the Alfalfa latent virus

    PubMed Central

    Shao, Jonathan; Postnikova, Olga A.

    2015-01-01

    The first complete genome sequence of the Alfalfa latent carlavirus (ALV) was obtained by primer walking and Illumina RNA sequencing. The virus differs substantially from the Czech ALV isolate and the Pea streak virus isolate from Wisconsin. The absence of a clear nucleic acid-binding protein indicates ALV divergence from other carlaviruses. PMID:25883281

  3. Amino acid sequence of band-3 protein from rainbow trout erythrocytes derived from cDNA.

    PubMed Central

    Hübner, S; Michel, F; Rudloff, V; Appelhans, H

    1992-01-01

    In this report we present the first complete band-3 cDNA sequence of a poikilothermic lower vertebrate. The primary structure of the anion-exchange protein band 3 (AE1) from rainbow trout erythrocytes was determined by nucleotide sequencing of cDNA clones. The overlapping clones have a total length of 3827 bp with a 5'-terminal untranslated region of 150 bp, a 2754 bp open reading frame and a 3'-untranslated region of 924 bp. Band-3 protein from trout erythrocytes consists of 918 amino acid residues with a calculated molecular mass of 101 827 Da. Comparison of its amino acid sequence revealed a 60-65% identity within the transmembrane spanning sequence of band-3 proteins published so far. An additional insertion of 24 amino acid residues within the membrane-associated domain of trout band-3 protein was identified, which until now was thought to be a general feature only of mammalian band-3-related proteins. PMID:1637296

  4. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken; SNL,

    2016-07-12

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  5. Role of the two-component leader sequence and mature amino acid sequences in extracellular export of endoglucanase EGL from Pseudomonas solanacearum.

    PubMed Central

    Huang, J Z; Schell, M A

    1992-01-01

    The egl gene of Pseudomonas solanacearum encodes a 43-kDa extracellular endoglucanase (mEGL) involved in wilt disease caused by this phytopathogen. Egl is initially translated with a 45-residue, two-part leader sequence. The first 19 residues are apparently removed by signal peptidase II during export of Egl across the inner membrane (IM); the remaining residues of the leader sequence (modified with palmitate) are removed during export across the outer membrane (OM). Localization of Egl-PhoA fusion proteins showed that the first 26 residues of the Egl leader sequence are required and sufficient to direct lipid modification, processing, and export of Egl or PhoA across the IM but not the OM. Fusions of the complete 45-residue leader sequence or of the leader and increasing portions of mEgl sequences to PhoA did not cause its export across the OM. In-frame deletion of portions of mEGL-coding sequences blocked export of the truncated polypeptides across the OM without affecting export across the IM. These results indicate that the first part of the leader sequence functions independently to direct export of Egl across the IM while the second part and sequences and structures in mEGL are involved in export across the OM. Computer analysis of the mEgl amino acid sequence obtained from its nucleotide sequence identified a region of mEGL similar in amino acid sequence to regions in other prokaryotic endoglucanases. Images PMID:1735723

  6. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    PubMed

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  7. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

    PubMed Central

    Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. PMID:27930687

  8. The complete amino acid sequence of a trypsin inhibitor from Bauhinia variegata var. candida seeds.

    PubMed

    Di Ciero, L; Oliva, M L; Torquato, R; Köhler, P; Weder, J K; Camillo Novello, J; Sampaio, C A; Oliveira, B; Marangoni, S

    1998-11-01

    Trypsin inhibitors of two varieties of Bauhinia variegata seeds have been isolated and characterized. Bauhinia variegata candida trypsin inhibitor (BvcTI) and B. variegata lilac trypsin inhibitor (BvlTI) are proteins with Mr of about 20,000 without free sulfhydryl groups. Amino acid analysis shows a high content of aspartic acid, glutamic acid, serine, and glycine, and a low content of histidine, tyrosine, methionine, and lysine in both inhibitors. Isoelectric focusing for both varieties detected three isoforms (pI 4.85, 5.00, and 5.15), which were resolved by HPLC procedure. The trypsin inhibitors show Ki values of 6.9 and 1.2 nM for BvcTI and BvlTI, respectively. The N-terminal sequences of the three trypsin inhibitor isoforms from both varieties of Bauhinia variegata and the complete amino acid sequence of B. variegata var. candida L. trypsin inhibitor isoform 3 (BvcTI-3) are presented. The sequences have been determined by automated Edman degradation of the reduced and carboxymethylated proteins of the peptides resulting from Staphylococcus aureus protease and trypsin digestion. BvcTI-3 is composed of 167 residues and has a calculated molecular mass of 18,529. Homology studies with other trypsin inhibitors show that BvcTI-3 belongs to the Kunitz family. The putative active site encompasses Arg (63)-Ile (64).

  9. Multiple site-selective insertions of non-canonical amino acids into sequence-repetitive polypeptides

    PubMed Central

    Wu, I-Lin; Patterson, Melissa A.; Carpenter Desai, Holly E.; Mehl, Ryan A.; Giorgi, Gianluca

    2013-01-01

    A simple and efficient method is described for introduction of non-canonical amino acids at multiple, structurally defined sites within recombinant polypeptide sequences. E. coli MRA30, a bacterial host strain with attenuated activity for release factor 1 (RF1), is assessed for its ability to support the incorporation of a diverse range of non-canonical amino acids in response to multiple encoded amber (TAG) codons within genetic templates derived from superfolder GFP and an elastin-mimetic protein polymer. Suppression efficiency and isolated protein yield were observed to depend on the identity of the orthogonal aminoacyl-tRNA synthetase/tRNACUA pair and the non-canonical amino acid substrate. This approach afforded elastin-mimetic protein polymers containing non-canonical amino acid derivatives at up to twenty-two positions within the repeat sequence with high levels of substitution. The identity and position of the variant residues was confirmed by mass spectrometric analysis of the full-length polypeptides and proteolytic cleavage fragments resulting from thermolysin digestion. The accumulated data suggest that this multi-site suppression approach permits the preparation of protein-based materials in which novel chemical functionality can be introduced at precisely defined positions within the polypeptide sequence. PMID:23625817

  10. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  11. SUBGROUPS OF AMINO ACID SEQUENCES IN THE VARIABLE REGIONS OF IMMUNOGLOBULIN HEAVY CHAINS*

    PubMed Central

    Cunningham, Bruce A.; Pflumm, Mollie N.; User, Urs Rutisha; Edelman, Gerald M.

    1969-01-01

    The amino acid sequence of the first 133 residues of the heavy (γ) chain from a human γG immunoglobulin (He) has been determined. This γ-chain is identical in Gm type to that of protein Eu, the complete sequence of which has been reported. Comparison of the two sequences substantiates the previous suggestion that there are subgroups of variable regions of heavy chains. The variable region of Eu has been assigned to subgroup I and that of He to subgroup II; on the other hand, the constant regions of the two proteins appear to be identical. Comparison of the sequence of the heavy chain of He with the heavy chain sequences determined in other laboratories suggests that the variable region of subgroup II is at least 118 residues long. The nature and distribution of amino acid variations in this heavy chain subgroup resemble those observed in light chain subgroups. These studies provide evidence that the translocation hypothesis applies to heavy as well as to light chains, viz., genes for variable regions (V) are somatically translocated to genes for constant regions (C) to form complete VC structural genes. Images PMID:5264153

  12. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand.

  13. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  14. Amino-Acid Sequence of NADP-Specific Glutamate Dehydrogenase of Neurospora crassa

    PubMed Central

    Wootton, John C.; Chambers, Geoffrey K.; Holder, Anthony A.; Baron, Andrew J.; Taylor, John G.; Fincham, John R. S.; Blumenthal, Kenneth M.; Moon, Kenneth; Smith, Emil L.

    1974-01-01

    A tentative primary structure of the NADP-specific glutamate dehydrogenase [L-glutamate: NADP oxidoreductase (deaminating), EC 1.4.1.4] from Neurospora crassa has been determined. The proposed sequence contains 452 amino-acid residues in each of the identical subunits of the hexameric enzyme. Comparison of the sequence with that of the bovine liver enzyme reveals considerable homology in the amino-terminal portion of the chain, including the vicinity of the reactive lysine, with only shorter stretches of homology within the carboxyl-terminal regions. The significance of this distribution of homologous regions is discussed. PMID:4155068

  15. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    PubMed Central

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583

  16. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion.

    PubMed

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-07-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed).

  17. Extended amino acid sequences around the active-site lysine residue of class-I fructose 1,6-bisphosphate aldolases from rabbit muscle, sturgeon muscle, trout muscle and ox liver.

    PubMed Central

    Benfield, P A; Forcina, B G; Gibbons, I; Perham, R N

    1979-01-01

    1. Amino acid sequences covering the region between residues 173 and 248 [adopting the numbering system proposed by Lai, Nakai & Chang (1974) Science 183, 1204-1206] were derived for trout (Salmo trutta) muscle aldolase and for ox liver aldolase. A comparable sequence was derived for residues 180-248 of sturgeon (Acipenser transmontanus) muscle aldolase. The close homology with the rabbit muscle enzyme was used to align the peptides of the other aldolases from which the sequences were derived. The results also allowed a partial sequence for the N-terminal 39 residues for the ox liver enzyme to be deduced. 2. In the light of the strong homology evinced for these enzymes, a re-investigation of the amino acid sequence of rabbit muscle aldolase between residues 181 and 185 was undertaken. This indicated the presence of a hitherto unsuspected -Ile-Val-sequence between residues 181 and 182 and the need to invert the sequence -Glu-Val- to -Val-Glx- at positions 184 and 185. 3. Comparison of the available amino acid sequences of these enzymes suggested an early evolutionary divergence of the genes for muscle and liver aldolases. It was also consistent with other evidence that the central region of the primary structure of these enzymes (which includes the active-site lysine-227) forms part of a conserved folding domain in the protein subunit. 4. Detailed evidence for the amino acid sequences proposed has been deposited as Suy Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1978) 169, 5. PMID:534504

  18. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  19. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  20. Sequence-specific thermodynamic properties of nucleic acids influence both transcriptional pausing and backtracking in yeast

    PubMed Central

    2017-01-01

    RNA Polymerase II pauses and backtracks during transcription, with many consequences for gene expression and cellular physiology. Here, we show that the energy required to melt double-stranded nucleic acids in the transcription bubble predicts pausing in Saccharomyces cerevisiae far more accurately than nucleosome roadblocks do. In addition, the same energy difference also determines when the RNA polymerase backtracks instead of continuing to move forward. This data-driven model corroborates—in a genome wide and quantitative manner—previous evidence that sequence-dependent thermodynamic features of nucleic acids influence both transcriptional pausing and backtracking. PMID:28301878

  1. Respiratory syncytial virus fusion glycoprotein: nucleotide sequence of mRNA, identification of cleavage activation site and amino acid sequence of N-terminus of F1 subunit.

    PubMed Central

    Elango, N; Satake, M; Coligan, J E; Norrby, E; Camargo, E; Venkatesan, S

    1985-01-01

    The amino acid sequence of respiratory syncytial virus fusion protein (Fo) was deduced from the sequence of a partial cDNA clone of mRNA and from the 5' mRNA sequence obtained by primer extension and dideoxysequencing. The encoded protein of 574 amino acids is extremely hydrophobic and has a molecular weight of 63371 daltons. The site of proteolytic cleavage within this protein was accurately mapped by determining a partial amino acid sequence of the N-terminus of the larger subunit (F1) purified by radioimmunoprecipitation using monoclonal antibodies. Alignment of the N-terminus of the F1 subunit within the deduced amino acid sequence of Fo permitted us to identify a sequence of lys-lys-arg-lys-arg-arg at the C-terminus of the smaller N-terminal F2 subunit that appears to represent the cleavage/activation domain. Five potential sites of glycosylation, four within the F2 subunit, were also identified. Three extremely hydrophobic domains are present in the protein; a) the N-terminal signal sequence, b) the N-terminus of the F1 subunit that is analogous to the N-terminus of the paramyxovirus F1 subunit and the HA2 subunit of influenza virus hemagglutinin, and c) the putative membrane anchorage domain near the C-terminus of F1. Images PMID:2987829

  2. Analysis of protein function and its prediction from amino acid sequence.

    PubMed

    Clark, Wyatt T; Radivojac, Predrag

    2011-07-01

    Understanding protein function is one of the keys to understanding life at the molecular level. It is also important in the context of human disease because many conditions arise as a consequence of alterations of protein function. The recent availability of relatively inexpensive sequencing technology has resulted in thousands of complete or partially sequenced genomes with millions of functionally uncharacterized proteins. Such a large volume of data, combined with the lack of high-throughput experimental assays to functionally annotate proteins, attributes to the growing importance of automated function prediction. Here, we study proteins annotated by Gene Ontology (GO) terms and estimate the accuracy of functional transfer from protein sequence only. We find that the transfer of GO terms by pairwise sequence alignments is only moderately accurate, showing a surprisingly small influence of sequence identity (SID) in a broad range (30-100%). We developed and evaluated a new predictor of protein function, functional annotator (FANN), from amino acid sequence. The predictor exploits a multioutput neural network framework which is well suited to simultaneously modeling dependencies between functional terms. Experiments provide evidence that FANN-GO (predictor of GO terms; available from http://www.informatics.indiana.edu/predrag) outperforms standard methods such as transfer by global or local SID as well as GOtcha, a method that incorporates the structure of GO.

  3. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  4. Amino acid sequence of myoglobin from emu (Dromaius novaehollandiae) skeletal muscle.

    PubMed

    Suman, S P; Joseph, P; Li, S; Beach, C M; Fontaine, M; Steinke, L

    2010-11-01

    The objective of the present study was to characterize the primary structure of emu myoglobin (Mb). Emu Mb was isolated from Iliofibularis muscle employing gel-filtration chromatography. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry was employed to determine the exact molecular mass of emu Mb in comparison with horse Mb, and Edman degradation was utilized to characterize the amino acid sequence. The molecular mass of emu Mb was 17,380 Da and was close to those reported for ratite and poultry myoglobins. Similar to myoglobins from meat-producing livestock and birds, emu Mb has 153 amino acids. Emu Mb contains 9 histidines. Proximal and distal histidines, responsible for coordinating oxygen-binding property of Mb, are conserved in emu. Emu Mb shared more than 90% homology with ratite and chicken myoglobins, whereas it demonstrated only less than 70% sequence similarity with ruminant myoglobins.

  5. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    NASA Astrophysics Data System (ADS)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2017-01-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  6. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  7. Amino acid sequence of atrial natriuretic peptides in human coronary sinus plasma.

    PubMed

    Yandle, T; Crozier, I; Nicholls, G; Espiner, E; Carne, A; Brennan, S

    1987-07-31

    Two atrial natriuretic peptides were purified from pooled human coronary sinus plasma by Sep-Pak extraction, immunoaffinity chromatography and reverse phase HPLC. The amino acid sequences of the two peptides were homologous with 99-126 human atrial natriuretic peptide (hANP) and 106-126 hANP, the latter being most probably linked to 99-105 ANP by the disulphide bond. The molar ratio of the peptides in plasma, as assessed by radioimmunoassay was 10:3.

  8. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein

    PubMed Central

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer. PMID:24147211

  9. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein.

    PubMed

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  10. New Highly Divergent rRNA Sequence among Biodiverse Genotypes of Enterocytozoon bieneusi Strains Isolated from Humans in Gabon and Cameroon▿

    PubMed Central

    Breton, Jacques; Bart-Delabesse, Emmanuelle; Biligui, Sylvestre; Carbone, Alessandra; Seiller, Xavier; Okome-Nkoumou, Madeleine; Nzamba, Chantal; Kombila, Maryvonne; Accoceberry, Isabelle; Thellier, Marc

    2007-01-01

    Intestinal microsporidiosis due to Enterocytozoon bieneusi is a leading cause of chronic diarrhea in severely immunocompromised human immunodeficiency virus (HIV)-positive patients. It may be a public health problem in Africa due to the magnitude of the HIV pandemic and to poor sanitary conditions. We designed two prevalence studies of E. bieneusi in Central Africa, the first with HIV-positive patients from an urban setting in Gabon and the second with a nonselected rural population in Cameroon. Stool samples were analyzed by an immunofluorescence antibody test and PCR. Twenty-five out of 822 HIV-positive patients from Gabon and 22 out of 758 villagers from Cameroon were found to be positive for E. bieneusi. The prevalence rates of the two studies were surprisingly similar (3.0% and 2.9%). Genotypic analysis of the internal transcribed spacer region of the rRNA gene showed a high degree of diversity in samples from both countries. In Gabon, 15 isolates showed seven different genotypes: the previously reported genotypes A, D, and K along with four new genotypes, referred to as CAF1, CAF2, CAF3, and CAF4. In Cameroon, five genotypes were found in 20 isolates: the known genotypes A, B, D, and K and the new genotype CAF4. Genotypes A and CAF4 predominated in Cameroon, whereas K, CAF4, and CAF1 were more frequent in Gabon, suggesting that different genotypes present differing risks of infection associated with immune status and living conditions. Phylogenetic analysis of the new genotype CAF4, identified in both HIV-negative and HIV-positive subjects, indicates that it represents a highly divergent strain. PMID:17537939

  11. Amino acid sequence similarity between rabies virus glycoprotein and snake venom curaremimetic neurotoxins.

    PubMed

    Lentz, T L; Wilson, P T; Hawrot, E; Speicher, D W

    1984-11-16

    Evidence was presented earlier that a host-cell receptor for the highly neurotropic rabies virus might be the acetylcholine receptor. The amino acid sequence of the glycoprotein of rabies virus was compared by computer analysis with that of snake venom curaremimetic neurotoxins, potent ligands of the acetylcholine receptor. A statistically significant sequence relation was found between a segment of the rabies glycoprotein and the entire sequence of long neurotoxins. The greatest identity occurs with residues considered most important in neurotoxicity, including those interacting with the acetylcholine binding site of the acetylcholine receptor. Because of the similarity between the glycoprotein and the receptor-binding region of the neurotoxins, this region of the viral glycoprotein may function as a recognition site for the acetylcholine receptor. Direct binding of the rabies virus glycoprotein to the acetylcholine receptor could contribute to the neurotropism of this virus.

  12. Partial amino acid sequence of human pancreatic stone protein, a novel pancreatic secretory protein.

    PubMed Central

    Montalto, G; Bonicel, J; Multigner, L; Rovery, M; Sarles, H; De Caro, A

    1986-01-01

    Pancreatic stone protein (PSP) is the major organic component of human pancreatic stones. With the use of monoclonal antibody immunoadsorbents, five immunoreactive forms (PSP-S) with close Mr values (14,000-19,000) were isolated from normal pancreatic juice. By CM-Trisacryl M chromatography the lowest-Mr form (PSP-S1) was separated from the others and some of its molecular characteristics were investigated. The Mr of the PSP-S1 polypeptide chain calculated from the amino acid composition was about 16,100. The N-terminal sequences (40 residues) of PSP and PSP-S1 are identical, which suggests that the peptide backbone is the same for both of these polypeptides. The PSP-S1 sequence was determined up to residue 65 and was found to be different from all other known protein sequences. Images Fig. 1. PMID:3541906

  13. Ecological significance of microdiversity: identical 16S rRNA gene sequences can be found in bacteria with highly divergent genomes and ecophysiologies.

    PubMed

    Jaspers, Elke; Overmann, Jörg

    2004-08-01

    A combination of cultivation-based methods with a molecular biological approach was used to investigate whether planktonic bacteria with identical 16S rRNA gene sequences can represent distinct eco- and genotypes. A set of 11 strains of Brevundimonas alba were isolated from a bacterial freshwater community by conventional plating or by using a liquid most-probable-number (MPN) dilution series. These strains had identical 16S rRNA gene sequences and represented the dominant phylotype in the plateable fraction, as well as in the highest positive dilutions of the MPN series. However, internally transcribed spacer and enterobacterial repetitive intergenic consensus PCR fingerprinting analyses, as well as DNA-DNA hybridization analyses, revealed great genetic diversity among the 11 strains. Each strain utilized a specific combination of 59 carbon substrates, and the niche overlap indices were low, suggesting that each strain occupied a different ecological niche. In dialysis cultures incubated in situ, each strain had a different growth rate and cell yield. We thus demonstrated that the B. alba strains represent distinct populations with genetically determined adaptations and probably occupy different ecological niches. Our results have implications for assessment of the diversity and biogeography of bacteria and increase the perception of natural diversity beyond the level of 16S rRNA gene sequences.

  14. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment.

  15. Rates of genomic divergence in humans, chimpanzees and their lice.

    PubMed

    Johnson, Kevin P; Allen, Julie M; Olds, Brett P; Mugisha, Lawrence; Reed, David L; Paige, Ken N; Pittendrigh, Barry R

    2014-02-22

    The rate of DNA mutation and divergence is highly variable across the tree of life. However, the reasons underlying this variation are not well understood. Comparing the rates of genetic changes between hosts and parasite lineages that diverged at the same time is one way to begin to understand differences in genetic mutation and substitution rates. Such studies have indicated that the rate of genetic divergence in parasites is often faster than that of their hosts when comparing single genes. However, the variation in this relative rate of molecular evolution across different genes in the genome is unknown. We compared the rate of DNA sequence divergence between humans, chimpanzees and their ectoparasitic lice for 1534 protein-coding genes across their genomes. The rate of DNA substitution in these orthologous genes was on average 14 times faster for lice than for humans and chimpanzees. In addition, these rates were positively correlated across genes. Because this correlation only occurred for substitutions that changed the amino acid, this pattern is probably produced by similar functional constraints across the same genes in humans, chimpanzees and their ectoparasites.

  16. [MOLECULAR EVOLUTION OF ION CHANNELS: AMINO ACID SEQUENCES AND 3D STRUCTURES].

    PubMed

    Korkosh, V S; Zhorov, B S; Tikhonov, D B

    2016-01-01

    An integral part of modern evolutionary biology is comparative analysis of structure and function of macromolecules such as proteins. The first and critical step to understand evolution of homologous proteins is their amino acid sequence alignment. However, standard algorithms fop not provide unambiguous sequence alignments for proteins of poor homology. More reliable results can be obtained by comparing experimental 3D structures obtained at atomic resolution, for instance, with the aid of X-ray structural analysis. If such structures are lacking, homology modeling is used, which may take into account indirect experimental data on functional roles of individual amino-acid residues. An important problem is that the sequence alignment, which reflects genetic modifications, does not necessarily correspond to the functional homology. The latter depends on three-dimensional structures which are critical for natural selection. Since alignment techniques relying only on the analysis of primary structures carry no information on the functional properties of proteins, including 3D structures into consideration is very important. Here we consider several examples involving ion channels and demonstrate that alignment of their three-dimensional structures can significantly improve sequence alignments obtained by traditional methods.

  17. The amino acid sequence of the aspartate aminotransferase from baker's yeast (Saccharomyces cerevisiae).

    PubMed Central

    Cronin, V B; Maras, B; Barra, D; Doonan, S

    1991-01-01

    1. The single (cytosolic) aspartate aminotransferase was purified in high yield from baker's yeast (Saccharomyces cerevisiae). 2. Amino-acid-sequence analysis was carried out by digestion of the protein with trypsin and with CNBr; some of the peptides produced were further subdigested with Staphylococcus aureus V8 proteinase or with pepsin. Peptides were sequenced by the dansyl-Edman method and/or by automated gas-phase methods. The amino acid sequence obtained was complete except for a probable gap of two residues as indicated by comparison with the structures of counterpart proteins in other species. 3. The N-terminus of the enzyme is blocked. Fast-atom-bombardment m.s. was used to identify the blocking group as an acetyl one. 4. Alignment of the sequence of the enzyme with those of vertebrate cytosolic and mitochondrial aspartate aminotransferases and with the enzyme from Escherichia coli showed that about 25% of residues are conserved between these distantly related forms. 5. Experimental details and confirmatory data for the results presented here are given in a Supplementary Publication (SUP 50164, 25 pages) that has been deposited at the British Library Document Supply Centre, Boston Spa. Wetherby, West Yorkshire LS23 7 BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1991) 273, 5. PMID:1859361

  18. Analysis of amino acid sequence variations and immunoglobulin E-binding epitopes of German cockroach tropomyosin.

    PubMed

    Jeong, Kyoung Yong; Lee, Jongweon; Lee, In-Yong; Ree, Han-Il; Hong, Chein-Soo; Yong, Tai-Soon

    2004-09-01

    The allergenicities of tropomyosins from different organisms have been reported to vary. The cDNA encoding German cockroach tropomyosin (Bla g 7) was isolated, expressed, and characterized previously. In the present study, the amino acid sequence variations in German cockroach tropomyosin were analyzed in order to investigate its influence on allergenicity. We also undertook the identification of immunodominant peptides containing immunoglobulin E (IgE) epitopes which may facilitate the development of diagnostic and immunotherapeutic strategies based on the recombinant proteins. Two-dimensional gel electrophoresis and immunoblot analysis with mouse anti-recombinant German cockroach tropomyosin serum was performed to investigate the isoforms at the protein level. Reverse transcriptase PCR (RT-PCR) was applied to examine the sequence diversity. Eleven different variants of the deduced amino acid sequences were identified by RT-PCR. German cockroach tropomyosin has only minor sequence variations that did not seem to affect its allergenicity significantly. These results support the molecular basis underlying the cross-reactivities of arthropod tropomyosins. Recombinant fragments were also generated by PCR, and IgE-binding epitopes were assessed by enzyme-linked immunosorbent assay. Sera from seven patients revealed heterogeneous IgE-binding responses. This study demonstrates multiple IgE-binding epitope regions in a single molecule, suggesting that full-length tropomyosin should be used for the development of diagnostic and therapeutic reagents.

  19. Synthesis, Improved Antisense Activity and Structural Rationale for the Divergent RNA Affinities of 3′-Fluoro Hexitol Nucleic Acid (FHNA and Ara-FHNA) Modified Oligonucleotides

    PubMed Central

    Egli, Martin; Pallan, Pradeep S.; Allerson, Charles R.; Prakash, Thazha P.; Berdeja, Andres; Yu, Jinghua; Lee, Sam; Watt, Andrew; Gaus, Hans; Bhat, Balkrishen; Swayze, Eric E.; Seth, Punit P.

    2011-01-01

    The synthesis, biophysical, structural and biological properties of both isomers of 3′-fluoro hexitol nucleic acid (FHNA and Ara-FHNA) modified oligonucleotides are reported. Synthesis of the FHNA and Ara-FHNA thymine phosphoramidites was efficiently accomplished starting from known sugar precursors. Optimal RNA affinities were observed with 3′-fluorine atom and nucleobase in a trans-diaxial orientation. The Ara-FHNA analog with an equatorial fluorine was found to be destabilizing. However, the magnitude of destabilization was sequence-dependent. Thus, the loss of stability is sharply reduced when Ara-FHNA residues were inserted at pyrimidine-purine (Py-Pu) steps compared to placement within a stretch of pyrimidines (Py-Py). Crystal structures of A-type DNA duplexes modified with either monomer, provide a rationalization for the opposing stability effects and point to a steric origin of the destabilization caused by the Ara-FHNA analog. The sequence dependent effect can be explained by the formation of an inter-nucleotide C-F…H-C pseudo hydrogen bond between F3′ of Ara-FHNA and C8-H of the nucleobase from the 3′-adjacent adenosine that is absent at Py-Py steps. In animal experiments, FHNA-modified antisense oligonucleotides formulated in saline showed potent downregulation of gene expression in liver tissue without producing hepatotoxicity. Our data establish FHNA as a useful modification for antisense therapeutics and also confirm the stabilizing influence of F(Py)…H-C(Pu) pseudo hydrogen bonds in nucleic acid structures. PMID:21919455

  20. Synthesis, Improved Antisense Activity and Structural Rationale for the Divergent RNA Affinities of 3;#8242;-Fluoro Hexitol Nucleic Acid (FHNA and Ara-FHNA) Modified Oligonucleotides

    SciTech Connect

    Egli, Martin; Pallan, Pradeep S.; Allerson, Charles R.; Prakash, Thazha P.; Berdeja, Andres; Yu, Jinghua; Lee, Sam; Watt, Andrew; Gaus, Hans; Bhat, Balkrishen; Swayze, Eric E.; Seth, Punit P.

    2012-03-16

    The synthesis, biophysical, structural, and biological properties of both isomers of 3'-fluoro hexitol nucleic acid (FHNA and Ara-FHNA) modified oligonucleotides are reported. Synthesis of the FHNA and Ara-FHNA thymine phosphoramidites was efficiently accomplished starting from known sugar precursors. Optimal RNA affinities were observed with a 3'-fluorine atom and nucleobase in a trans-diaxial orientation. The Ara-FHNA analog with an equatorial fluorine was found to be destabilizing. However, the magnitude of destabilization was sequence-dependent. Thus, the loss of stability is sharply reduced when Ara-FHNA residues were inserted at pyrimidine-purine (Py-Pu) steps compared to placement within a stretch of pyrimidines (Py-Py). Crystal structures of A-type DNA duplexes modified with either monomer provide a rationalization for the opposing stability effects and point to a steric origin of the destabilization caused by the Ara-FHNA analog. The sequence dependent effect can be explained by the formation of an internucleotide C-F {hor_ellipsis} H-C pseudo hydrogen bond between F3' of Ara-FHNA and C8-H of the nucleobase from the 3'-adjacent adenosine that is absent at Py-Py steps. In animal experiments, FHNA-modified antisense oligonucleotides formulated in saline showed a potent downregulation of gene expression in liver tissue without producing hepatotoxicity. Our data establish FHNA as a useful modification for antisense therapeutics and also confirm the stabilizing influence of F(Py) {hor_ellipsis} H-C(Pu) pseudo hydrogen bonds in nucleic acid structures.

  1. Complete amino acid sequence of a histidine-rich proteolytic fragment of human ceruloplasmin.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1979-04-01

    The complete amino acid sequence has been determined for a fragment of human ceruloplasmin [ferroxidase; iron(II):oxygen oxidoreductase, EC 1.16.3.1]. The fragment (designated Cp F5) contains 159 amino acid residues and has a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains one free cysteine that may be part of a copper-binding site. This fragment is present in most commercial preparations of ceruloplasmin, probably owing to proteolytic degradation, but can also be obtained by limited cleavage of single-chain ceruloplasmin with plasmin. Cp F5 probably is an intact domain attached to the COOH-terminal end of single-chain ceruloplasmin via a labile interdomain peptide bond. A model of the secondary structure predicted by empirical methods suggests that almost one-third of the amino acid residues are distributed in alpha helices, about a third in beta-sheet structure, and the remainder in beta turns and unidentified structures. Computer analysis of the amino acid sequence has not demonstrated a statistically significant relationship between this ceruloplasmin fragment and any other protein, but there is some evidence for an internal duplication.

  2. AdoMet radical proteins—from structure to evolution—alignment of divergent protein sequences reveals strong secondary structure element conservation

    PubMed Central

    Nicolet, Yvain; Drennan, Catherine L.

    2004-01-01

    Eighteen subclasses of S-adenosyl-l-methionine (AdoMet) radical proteins have been aligned in the first bioinformatics study of the AdoMet radical superfamily to utilize crystallographic information. The recently resolved X-ray structure of biotin synthase (BioB) was used to guide the multiple sequence alignment, and the recently resolved X-ray structure of coproporphyrinogen III oxidase (HemN) was used as the control. Despite the low 9% sequence identity between BioB and HemN, the multiple sequence alignment correctly predicted all but one of the core helices in HemN, and correctly predicted the residues in the enzyme active site. This alignment further suggests that the AdoMet radical proteins may have evolved from half-barrel structures (αβ)4 to three-quarter-barrel structures (αβ)6 to full-barrel structures (αβ)8. It predicts that anaerobic ribonucleotide reductase (RNR) activase, an ancient enzyme that, it has been suggested, serves as a link between the RNA and DNA worlds, will have a half-barrel structure, whereas the three-quarter barrel, exemplified by HemN, will be the most common architecture for AdoMet radical enzymes, and fewer members of the superfamily will join BioB in using a complete (αβ)8 TIM-barrel fold to perform radical chemistry. These differences in barrel architecture also explain how AdoMet radical enzymes can act on substrates that range in size from 10 atoms to 608 residue proteins. PMID:15289575

  3. Processing and amino acid sequence analysis of the mouse mammary tumor virus env gene product.

    PubMed Central

    Arthur, L O; Copeland, T D; Oroszlan, S; Schochetman, G

    1982-01-01

    The envelope proteins of mouse mammary tumor virus (MMTV) are synthesized from a subgenomic 24S mRNA as a 75,000-dalton glycosylated precursor polyprotein which is eventually processed to the mature glycoproteins gp52 and gp36. In vivo synthesis of this env precursor in the presence of the core glycosylation inhibitor tunicamycin yielded a precursor of approximately 61,000 daltons (P61env). However, a 67,000-dalton protein (P67env) was obtained from cell-free translation with the MMTV 24S mRNA as the template. To determine whether the portion of the protein cleaved from P67env to give P61env was removed from the NH2-terminal end of P67env and as such would represent a leader sequence, the NH2-terminal amino acid sequence of the terminal peptide gp52 was determined. Glutamic acid, and not methionine, was found to be the amino-terminal residue of gp52, indicating that the cleaved portion was derived from the NH2-terminal end of P67env. The NH2-terminal amino acid sequences of gp52's from endogenous and exogenous C3H MMTVs were determined though 46 residues and found to be identical. However, amino acid composition and type-specific gp52 radioimmunoassays from MMTVs grown in heterologous cells indicated primary structure differences between gp52's of the two viruses. The nucleic acid sequence of cloned MMTV DNA fragments (J. Majors and H. E. Varmus, personal communication) in conjunction with the NH2-terminal sequence of gp52 allowed localization of the env gene in the MMTV genome. Nucleotides coding for the NH2 terminus of gp52 begin approximately 0.8 kilobase to the 3' side of the single EcoRI cleavage site. Localization of the env gene at that point agrees with the proposed gene order -gag-pol-env- and also allows sufficient coding potential for the glycoprotein precursor without extending into the long terminal repeat. Images PMID:6281457

  4. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  5. BeadCons: detection of nucleic acid sequences by flow cytometry.

    PubMed

    Horejsh, Douglas; Martini, Federico; Capobianchi, Maria Rosaria

    2005-11-01

    Molecular beacons are single-stranded nucleic acid structures with a terminal fluorophore and a distal, terminal quencher. These molecules are typically used in real-time PCR assays, but have also been conjugated with solid matrices. This unit describes protocols related to molecular beacon-conjugated beads (BeadCons), whose specific hybridization with complementary target sequences can be resolved by cytometry. Assay sensitivity is achieved through the concentration of fluorescence signal on discrete particles. By using molecular beacons with different fluorophores and microspheres of different sizes, it is possible to construct a fluid array system with each bead corresponding to a specific target nucleic acid. Methods are presented for the design, construction, and use of BeadCons for the specific, multiplexed detection of unlabeled nucleic acids in solution. The use of bead-based detection methods will likely lead to the design of new multiplex molecular diagnostic tools.

  6. Measuring nanometer distances in nucleic acids using a sequence-independent nitroxide probe

    PubMed Central

    Qin, Peter Z; Haworth, Ian S; Cai, Qi; Kusnetzow, Ana K; Grant, Gian Paola G; Price, Eric A; Sowa, Glenna Z; Popova, Anna; Herreros, Bruno; He, Honghang

    2008-01-01

    This protocol describes the procedures for measuring nanometer distances in nucleic acids using a nitroxide probe that can be attached to any nucleotide within a given sequence. Two nitroxides are attached to phosphorothioates that are chemically substituted at specific sites of DNA or RNA. Inter-nitroxide distances are measured using a four-pulse double electron–electron resonance technique, and the measured distances are correlated to the parent structures using a Web-accessible computer program. Four to five days are needed for sample labeling, purification and distance measurement. The procedures described herein provide a method for probing global structures and studying conformational changes of nucleic acids and protein/nucleic acid complexes. PMID:17947978

  7. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1.

    PubMed

    Rhee, Mun Su; Moritz, Brélan E; Xie, Gary; Glavina Del Rio, T; Dalin, E; Tice, H; Bruce, D; Goodwin, L; Chertkov, O; Brettin, T; Han, C; Detter, C; Pitluck, S; Land, Miriam L; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O; Shanmugam, K T

    2011-12-31

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  8. Conserved and Divergent Rhythms of Crassulacean Acid Metabolism-Related and Core Clock Gene Expression in the Cactus Opuntia ficus-indica1[C][W

    PubMed Central

    Mallona, Izaskun; Egea-Cortines, Marcos; Weiss, Julia

    2011-01-01

    The cactus Opuntia ficus-indica is a constitutive Crassulacean acid metabolism (CAM) species. Current knowledge of CAM metabolism suggests that the enzyme phosphoenolpyruvate carboxylase kinase (PPCK) is circadian regulated at the transcriptional level, whereas phosphoenolpyruvate carboxylase (PEPC), malate dehydrogenase (MDH), NADP-malic enzyme (NADP-ME), and pyruvate phosphate dikinase (PPDK) are posttranslationally controlled. As little transcriptomic data are available from obligate CAM plants, we created an expressed sequence tag database derived from different organs and developmental stages. Sequences were assembled, compared with sequences in the National Center for Biotechnology Information nonredundant database for identification of putative orthologs, and mapped using Kyoto Encyclopedia of Genes and Genomes Orthology and Gene Ontology. We identified genes involved in circadian regulation and CAM metabolism for transcriptomic analysis in plants grown in long days. We identified stable reference genes for quantitative polymerase chain reaction and found that OfiSAND, like its counterpart in Arabidopsis (Arabidopsis thaliana), and OfiTUB are generally appropriate standards for use in the quantification of gene expression in O. ficus-indica. Three kinds of expression profiles were found: transcripts of OfiPPCK oscillated with a 24-h periodicity; transcripts of the light-active OfiNADP-ME and OfiPPDK genes adapted to 12-h cycles, while transcript accumulation patterns of OfiPEPC and OfiMDH were arrhythmic. Expression of the circadian clock gene OfiTOC1, similar to Arabidopsis, oscillated with a 24-h periodicity, peaking at night. Expression of OfiCCA1 and OfiPRR9, unlike in Arabidopsis, adapted best to a 12-h rhythm, suggesting that circadian clock gene interactions differ from those of Arabidopsis. Our results indicate that the evolution of CAM metabolism could be the result of modified circadian regulation at both the transcriptional and posttranscriptional

  9. Conserved and divergent rhythms of crassulacean acid metabolism-related and core clock gene expression in the cactus Opuntia ficus-indica.

    PubMed

    Mallona, Izaskun; Egea-Cortines, Marcos; Weiss, Julia

    2011-08-01

    The cactus Opuntia ficus-indica is a constitutive Crassulacean acid metabolism (CAM) species. Current knowledge of CAM metabolism suggests that the enzyme phosphoenolpyruvate carboxylase kinase (PPCK) is circadian regulated at the transcriptional level, whereas phosphoenolpyruvate carboxylase (PEPC), malate dehydrogenase (MDH), NADP-malic enzyme (NADP-ME), and pyruvate phosphate dikinase (PPDK) are posttranslationally controlled. As little transcriptomic data are available from obligate CAM plants, we created an expressed sequence tag database derived from different organs and developmental stages. Sequences were assembled, compared with sequences in the National Center for Biotechnology Information nonredundant database for identification of putative orthologs, and mapped using Kyoto Encyclopedia of Genes and Genomes Orthology and Gene Ontology. We identified genes involved in circadian regulation and CAM metabolism for transcriptomic analysis in plants grown in long days. We identified stable reference genes for quantitative polymerase chain reaction and found that OfiSAND, like its counterpart in Arabidopsis (Arabidopsis thaliana), and OfiTUB are generally appropriate standards for use in the quantification of gene expression in O. ficus-indica. Three kinds of expression profiles were found: transcripts of OfiPPCK oscillated with a 24-h periodicity; transcripts of the light-active OfiNADP-ME and OfiPPDK genes adapted to 12-h cycles, while transcript accumulation patterns of OfiPEPC and OfiMDH were arrhythmic. Expression of the circadian clock gene OfiTOC1, similar to Arabidopsis, oscillated with a 24-h periodicity, peaking at night. Expression of OfiCCA1 and OfiPRR9, unlike in Arabidopsis, adapted best to a 12-h rhythm, suggesting that circadian clock gene interactions differ from those of Arabidopsis. Our results indicate that the evolution of CAM metabolism could be the result of modified circadian regulation at both the transcriptional and posttranscriptional

  10. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group.

  11. Vorticity and divergence in the solar photosphere

    NASA Technical Reports Server (NTRS)

    Wang, YI; Noyes, Robert W.; Tarbell, Theodore D.; Title, Alan M.

    1995-01-01

    We have studied an outstanding sequence of continuum images of the solar granulation from Pic du Midi Observatory. We have calculated the horizontal vector flow field using a correlation tracking algorithm, and from this determined three scalar field: the vertical component of the curl; the horizontal divergence; and the horizontal flow speed. The divergence field has substantially longer coherence time and more power than does the curl field. Statistically, curl is better correlated with regions of negative divergence - that is, the vertical vorticity is higher in downflow regions, suggesting excess vorticity in intergranular lanes. The average value of the divergence is largest (i.e., outflow is largest) where the horizontal speed is large; we associate these regions with exploding granules. A numerical simulation of general convection also shows similar statistical differences between curl and divergence. Some individual small bright points in the granulation pattern show large local vorticities.

  12. A 25-Amino Acid Sequence of the Arabidopsis TGD2 Protein Is Sufficient for Specific Binding of Phosphatidic Acid*

    PubMed Central

    Lu, Binbin; Benning, Christoph

    2009-01-01

    Genetic analysis suggests that the TGD2 protein of Arabidopsis is required for the biosynthesis of endoplasmic reticulum derived thylakoid lipids. TGD2 is proposed to be the substrate-binding protein of a presumed lipid transporter consisting of the TGD1 (permease) and TGD3 (ATPase) proteins. The TGD1, -2, and -3 proteins are localized in the inner chloroplast envelope membrane. TGD2 appears to be anchored with an N-terminal membrane-spanning domain into the inner envelope membrane, whereas the C-terminal domain faces the intermembrane space. It was previously shown that the C-terminal domain of TGD2 binds phosphatidic acid (PtdOH). To investigate the PtdOH binding site of TGD2 in detail, the C-terminal domain of the TGD2 sequence lacking the transit peptide and transmembrane sequences was fused to the C terminus of the Discosoma sp. red fluorescent protein (DR). This greatly improved the solubility of the resulting DR-TGD2C fusion protein following production in Escherichia coli. The DR-TGD2C protein bound PtdOH with high specificity, as demonstrated by membrane lipid-protein overlay and liposome association assays. Internal deletion and truncation mutagenesis identified a previously undescribed minimal 25-amino acid fragment in the C-terminal domain of TGD2 that is sufficient for PtdOH binding. Binding characteristics of this 25-mer were distinctly different from those of TGD2C, suggesting that additional sequences of TGD2 providing the proper context for this 25-mer are needed for wild type-like PtdOH binding. PMID:19416982

  13. Nucleotide sequence of the luxC gene encoding fatty acid reductase of the lux operon from Photobacterium leiognathi.

    PubMed

    Lin, J W; Chao, Y F; Weng, S F

    1993-02-26

    The nucleotide sequence of the luxC gene (EMBL Accession No. 65156) encoding fatty acid reductase (FAR) of the lux operon from Photobacterium leiognathi PL741 was determined and the encoded amino acid sequence deduced. The fatty acid reductase is a component of the fatty acid reductase complex. The complex is responsible for converting fatty acid to aldehyde which serves as the substrate in the luciferase-catalyzed bioluminescent reaction. The protein comprises 478 amino acid residues and has a calculated M(r) of 53,858. Alignment and comparison of the fatty acid reductase of P. leiognathi with that of Vibrio harveyi B392 and Vibrio fischeri ATCC 7744 shows that there is 70% and 59% amino acid residues identity, respectively.

  14. Nucleotide sequence of the Klebsiella pneumoniae nifD gene and predicted amino acid sequence of the alpha-subunit of nitrogenase MoFe protein.

    PubMed Central

    Ioannidis, I; Buck, M

    1987-01-01

    The nucleotide sequence of the Klebsiella pneumoniae nifD gene is presented and together with the accompanying paper [Holland, Zilberstein, Zamir & Sussman (1987) Biochem. J. 247, 277-285] completes the sequence of the nifHDK genes encoding the nitrogenase polypeptides. The K. pneumoniae nifD gene encodes the 483-amino acid-residue nitrogenase alpha-subunit polypeptide of Mr 54156. The alpha-subunit has five strongly conserved cysteine residues at positions 63, 89, 155, 184 and 275, some occurring in a region showing both primary sequence and potential structural homology to the K. pneumoniae nitrogenase beta-subunit. A comparison with six other alpha-subunit amino acid sequences has been made, which indicates a number of potentially important domains within alpha-subunits. PMID:3322262

  15. Complete amino acid sequence of the A chain of human complement-classical-pathway enzyme C1r.

    PubMed Central

    Arlaud, G J; Willis, A C; Gagnon, J

    1987-01-01

    The amino acid sequence of human C1r A chain was determined, from sequence analysis performed on fragments obtained from C1r autolytic cleavage, cleavage of methionyl bonds, tryptic cleavages at arginine and lysine residues, and cleavages by staphylococcal proteinase. The polypeptide chain has an N-terminal serine residue and contains 446 amino acid residues (Mr 51,200). The sequence data allow chemical characterization of fragments alpha (positions 1-211), beta (positions 212-279) and gamma (positions 280-446) yielded from C1r autolytic cleavage, and identification of the two major cleavage sites generating these fragments. Position 150 of C1r A chain is occupied by a modified amino acid residue that, upon acid hydrolysis, yields erythro-beta-hydroxyaspartic acid, and that is located in a sequence homologous to the beta-hydroxyaspartic acid-containing regions of Factor IX, Factor X, protein C and protein Z. Sequence comparison reveals internal homology between two segments (positions 10-78 and 186-257). Two carbohydrate moieties are attached to the polypeptide chain, both via asparagine residues at positions 108 and 204. Combined with the previously determined sequence of C1r B chain [Arlaud & Gagnon (1983) Biochemistry 22, 1758-1764], these data give the complete sequence of human C1r. PMID:3036070

  16. Nucleotide sequences of the Pseudomonas savastanoi indoleacetic acid genes show homology with Agrobacterium tumefaciens T-DNA

    PubMed Central

    Yamada, Tetsuji; Palm, Curtis J.; Brooks, Bob; Kosuge, Tsune

    1985-01-01

    We report the nucleotide sequences of iaaM and iaaH, the genetic determinants for, respectively, tryptophan 2-monooxygenase and indoleacetamide hydrolase, the enzymes that catalyze the conversion of L-tryptophan to indoleacetic acid in the tumor-forming bacterium Pseudomonas syringae pv. savastanoi. The sequence analysis indicates that the iaaM locus contains an open reading frame encoding 557 amino acids that would comprise a protein with a molecular weight of 61,783; the iaaH locus contains an open reading frame of 455 amino acids that would comprise a protein with a molecular weight of 48,515. Significant amino acid sequence homology was found between the predicted sequence of the tryptophan monooxygenase of P. savastanoi and the deduced product of the T-DNA tms-1 gene of the octopine-type plasmid pTiA6NC from Agrobacterium tumefaciens. Strong homology was found in the 25 amino acid sequence in the putative FAD-binding region of tryptophan monooxygenase. Homology was also found in the amino acid sequences representing the central regions of the putative products of iaaH and tms-2 T-DNA. The results suggest a strong similarity in the pathways for indoleacetic acid synthesis encoded by genes in P. savastanoi and in A. tumefaciens T-DNA. Images PMID:16593610

  17. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    PubMed Central

    Chen, Ke; Kurgan, Lukasz A; Ruan, Jishou

    2007-01-01

    Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D) protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP); the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM) and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are characterized by accuracies below 70

  18. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  19. Gene sequence and predicted amino acid sequence of the motA protein, a membrane-associated protein required for flagellar rotation in Escherichia coli.

    PubMed Central

    Dean, G E; Macnab, R M; Stader, J; Matsumura, P; Burks, C

    1984-01-01

    The motA and motB gene products of Escherichia coli are integral membrane proteins necessary for flagellar rotation. We determined the DNA sequence of the region containing the motA gene and its promoter. Within this sequence, there is an open reading frame of 885 nucleotides, which with high probability (98% confidence level) meets criteria for a coding sequence. The 295-residue amino acid translation product had a molecular weight of 31,974, in good agreement with the value determined experimentally by gel electrophoresis. The amino acid sequence, which was quite hydrophobic, was subjected to a theoretical analysis designed to predict membrane-spanning alpha-helical segments of integral membrane proteins; four such hydrophobic helices were predicted by this treatment. Additional amphipathic helices may also be present. A remarkable feature of the sequence is the existence of two segments of high uncompensated charge density, one positive and the other negative. Possible organization of the protein in the membrane is discussed. Asymmetry in the amino acid composition of translated DNA sequences was used to distinguish between two possible initiation codons. The use of this method as a criterion for authentication of coding regions is described briefly in an Appendix. PMID:6090403

  20. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  1. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  2. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  3. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  4. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  5. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  6. Complete amino acid sequence of ananain and a comparison with stem bromelain and other plant cysteine proteases.

    PubMed Central

    Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T

    1997-01-01

    The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753

  7. Microbial community dynamics in bioaugmented sequencing batch reactors for bromoamine acid removal.

    PubMed

    Qu, Yuanyuan; Zhou, Jiti; Wang, Jing; Fu, Xiang; Xing, Linlin

    2005-05-01

    Sphingomonas xenophaga QYY with the ability to degrade bromoamine acid (BAA) was previously isolated from sludge samples. The enhancement of BAA removal by strain QYY in sequencing batch reactors (SBRs) was investigated in this study. The results showed that augmented SBRs exhibited stronger abilities to degrade BAA than the non-augmented control one. In order to estimate the relationship between community dynamics and function of augmented SBRs, a combined method based on fingerprints (ribosomal intergenic spacer analysis, RISA) and 16S rRNA gene sequencing was used. The results indicated that the microbial community dynamics were substantially changed, and the introduced strain QYY was persistent in the augmented systems. This study suggests that it is feasible and potentially useful to enhance BAA removal using BAA-degrading bacteria, such as S. xenophaga QYY.

  8. [Measurement of the amino acid sequence for the fusion protein FP3 with LC-MS/MS].

    PubMed

    Li, Xiang; Gao, Xiang-Dong; Tao, Lei; Pei, De-Ning; Guo, Ying; Rao, Chun-Ming; Wang, Jun-Zhi

    2012-02-01

    The amino acid sequence of the fusion protein FP3 was measured by two types of LC-MS/MS and its primary structure was confirmed. After reduction and alkylation, the protein was digested with trypsin and glycosyl groups in glycopeptide were removed by PNGase F. The mixed peptides were separated by LC, then Q-TOF and Ion trap tandem mass spectrometry were used to measure b, y fragment ions of each peptide to analyze the amino acid sequence of fusion protein FP3. Seventy-six percent of full amino acid sequence of the fusion protein FP3 was measured by LC-ESI-Q-TOF with the remaining 24% completed by LC-ESI-Trap. As LC-MS and tandem mass spectrometry are rapid, sensitive, accurate to measure the protein amino acid sequence, they are important approach to structure analysis and identification of recombinant protein.

  9. NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents

    PubMed Central

    Liu, Sophia S.; Hockenberry, Adam J.; Lancichinetti, Andrea; Jewett, Michael C.

    2016-01-01

    The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems. PMID:27835644

  10. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  11. Genome-wide ancestry and divergence patterns from low-coverage sequencing data reveal a complex history of admixture in wild baboons

    PubMed Central

    Wall, Jeffrey D; Schlebusch, Stephen A; Alberts, Susan C; Cox, Laura A; Snyder-Mackler, Noah; Nevonen, Kimberly; Carbone, Lucia; Tung, Jenny

    2017-01-01

    Naturally occurring admixture has now been documented in every major primate lineage, suggesting its key role in primate evolutionary history. Active primate hybrid zones can provide valuable insight into this process. Here, we investigate the history of admixture in one of the best-studied natural primate hybrid zones, between yellow baboons (Papio cynocephalus) and anubis baboons (Papio anubis) in the Amboseli ecosystem of Kenya. We generated a new genome assembly for yellow baboon and low coverage genome-wide resequencing data from yellow baboons, anubis baboons, and known hybrids (n=44). Using a novel composite likelihood method for estimating local ancestry from low coverage data, we found high levels of genetic diversity and genetic differentiation between the parent taxa, and excellent agreement between genome-scale ancestry estimates and a priori pedigree, life history, and morphology-based estimates (r2=0.899). However, even putatively unadmixed Amboseli yellow individuals carried a substantial proportion of anubis ancestry, presumably due to historical admixture. Further, the distribution of shared versus fixed differences between a putatively unadmixed Amboseli yellow baboon and an unadmixed anubis baboon, both sequenced at high coverage, are inconsistent with simple isolation-migration or equilibrium migration models. Our findings suggest a complex process of intermittent contact that has occurred multiple times in baboon evolutionary history, despite no obvious fitness costs to hybrids or major geographic or behavioral barriers. In combination with the extensive phenotypic data available for baboon hybrids, our results provide valuable context for understanding the history of admixture in primates, including in our own lineage. PMID:27145036

  12. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    PubMed

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  13. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  14. Quantum skew divergence

    NASA Astrophysics Data System (ADS)

    Audenaert, Koenraad M. R.

    2014-11-01

    In this paper, we study the quantum generalisation of the skew divergence, which is a dissimilarity measure between distributions introduced by Lee in the context of natural language processing. We provide an in-depth study of the quantum skew divergence, including its relation to other state distinguishability measures. Finally, we present a number of important applications: new continuity inequalities for the quantum Jensen-Shannon divergence and the Holevo information, and a new and short proof of Bravyi's Small Incremental Mixing conjecture.

  15. Quantum skew divergence

    SciTech Connect

    Audenaert, Koenraad M. R.

    2014-11-15

    In this paper, we study the quantum generalisation of the skew divergence, which is a dissimilarity measure between distributions introduced by Lee in the context of natural language processing. We provide an in-depth study of the quantum skew divergence, including its relation to other state distinguishability measures. Finally, we present a number of important applications: new continuity inequalities for the quantum Jensen-Shannon divergence and the Holevo information, and a new and short proof of Bravyi's Small Incremental Mixing conjecture.

  16. Fluorescence energy transfer as a probe for nucleic acid structures and sequences.

    PubMed Central

    Mergny, J L; Boutorine, A S; Garestier, T; Belloc, F; Rougée, M; Bulychev, N V; Koshkin, A A; Bourson, J; Lebedev, A V; Valeur, B

    1994-01-01

    The primary or secondary structure of single-stranded nucleic acids has been investigated with fluorescent oligonucleotides, i.e., oligonucleotides covalently linked to a fluorescent dye. Five different chromophores were used: 2-methoxy-6-chloro-9-amino-acridine, coumarin 500, fluorescein, rhodamine and ethidium. The chemical synthesis of derivatized oligonucleotides is described. Hybridization of two fluorescent oligonucleotides to adjacent nucleic acid sequences led to fluorescence excitation energy transfer between the donor and the acceptor dyes. This phenomenon was used to probe primary and secondary structures of DNA fragments and the orientation of oligodeoxynucleotides synthesized with the alpha-anomers of nucleoside units. Fluorescence energy transfer can be used to reveal the formation of hairpin structures and the translocation of genes between two chromosomes. PMID:8152922

  17. Amino acid sequence of two neurotoxins from the venom of the Egyptian black snake (Walterinnesia aegyptia).

    PubMed

    Samejima, Y; Aoki-Tomomatsu, Y; Yanagisawa, M; Mebs, D

    1997-02-01

    The venom of the Egyptian black snake Walterinnesia aegyptia contains at least three toxins, which act postsynaptically to block the neuromuscular transmission of isolated rat phrenic nerve-diaphragm and chicken biventer cervicis muscle. The complete amino acid sequence of the two toxins, W-III and W-IV, consisting of 62 amino acid residues, was elucidated by Edman degradation of fragments obtained after Staphylococcus aureus protease and prolylpeptidase digestion. Although the toxins exhibit close structural homology to other short-chain postsynaptic neurotoxins from Elapidae venoms, toxin IV is unique by having a free SH-group (cysteine) at position 16. In position 35 of W-III, which is located at the tip of the central loop, threonine is replaced by lysine, which may alter the interaction of the toxin with the acetylcholine receptor, since the toxin is seven times less lethal than toxin W-IV.

  18. Complete genome sequence of Lactococcus lactis IO-1, a lactic acid bacterium that utilizes xylose and produces high levels of L-lactic acid.

    PubMed

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-04-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly L-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  19. Complete genome sequence of Bacillus amyloliquefaciens LL3, which exhibits glutamic acid-independent production of poly-γ-glutamic acid.

    PubMed

    Geng, Weitao; Cao, Mingfeng; Song, Cunjiang; Xie, Hui; Liu, Li; Yang, Chao; Feng, Jun; Zhang, Wei; Jin, Yinghong; Du, Yang; Wang, Shufang

    2011-07-01

    Bacillus amyloliquefaciens is one of most prevalent Gram-positive aerobic spore-forming bacteria with the ability to synthesize polysaccharides and polypeptides. Here, we report the complete genome sequence of B. amyloliquefaciens LL3, which was isolated from fermented food and presents the glutamic acid-independent production of poly-γ-glutamic acid.

  20. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    NASA Astrophysics Data System (ADS)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  1. Design, synthesis, and characterization of a protein sequencing reagent yielding amino acid derivatives with enhanced detectability by mass spectrometry.

    PubMed Central

    Aebersold, R.; Bures, E. J.; Namchuk, M.; Goghari, M. H.; Shushan, B.; Covey, T. C.

    1992-01-01

    We report the design, chemical synthesis, and structural and functional characterization of a novel reagent for protein sequence analysis by the Edman degradation, yielding amino acid derivatives rapidly detectable at high sensitivity by ion-evaporation mass spectrometry. We demonstrate that the reagent 3-[4'(ethylene-N,N,N-trimethylamino)phenyl]-2-isothiocyanate is chemically stable and shows coupling and cyclization/cleavage yields comparable to phenylisothiocyanate, the standard reagent in chemical sequence analysis, under conditions typically encountered in manual or automated sequence analysis. Amino acid derivatives generated with this reagent were detectable by ion-evaporation mass spectrometry at the subfemtomole sensitivity level at a pace of one sample per minute. Furthermore, derivatives were identified by their mass, thus permitting the rapid and highly sensitive determination of the molecular nature of modified amino acids. Derivatives of amino acids with acidic, basic, polar, or hydrophobic side chains were reproducibly detectable at comparable sensitivities. The polar nature of the reagent required covalent immobilization of polypeptides prior to automated sequence analysis. This reagent, used in automated sequence analysis, has the potential for overcoming the limitations in sensitivity, speed, and the ability to characterize modified amino acid residues inherent in the chemical sequencing methods that are currently used. PMID:1304351

  2. [Genomic structure of the autotetraploid oat species Avena macrostachya inferred from comparative analysis of the ITS1 and ITS2 sequences: on the oat karyotype evolution during the early stages of the Avena species divergence].

    PubMed

    Rodionov, A V; Tiupa, N B; Kim, E S; Machs, E M; Loskutov, I G

    2005-05-01

    To examine the genomic structure of Avena macrostachya, internal transcribed spacers, ITS1 and ITS2, as well as nuclear 5.8S tRNA genes from three oat species with AsAs karyotype (A. wiestii, A. hirtula, and A. atlantica), and those from A. longiglumis (AlAl), A. canariensis (AcAc), A. ventricosa (CvCv), A. pilosa, and A. clauda (CpCp) were sequenced. All species of the genus Avena examined represented a monophyletic group (bootstrap index = 98), within which two branches, i.e., species with A- and C-genomes, were distinguished (bootstrap indices = 100). The subject of our study, A. macrostachya, albeit belonging to the phylogenetic branch of C-genome oat species (karyotype with submetacentic and subacrocentric chromosomes), has preserved an isobrachyal karyotype, (i.e., that containing metacentric chromosomes), probably typical of the common Avena ancestor. It was suggested to classify the A. macrostachya genome as a specific form of C-genome, Cm-genome. Among the species from other genera studied, Arrhenatherum elatius was found to be the closest to Avena in ITS1 and ITS structure. Phylogenetic relationships between Avena and Helictotrichon remain intriguingly uncertain. The HPR389153 sequence from H. pratense genome was closest to the ITS1 sequences specific to the Avena A-genomes (p-distance = 0.0237), while the differences of this sequence from the ITS1 of A. macrostachya reached 0.1221. On the other hand, HAD389117 from H. adsurgens was close to the ITS1 specific to Avena C-genomes (p-distance = 0.0189), while its differences from the A-genome specific ITS1 sequences reached 0.1221. It seems likely that the appearance of highly polyploid (2n = 12-21x) species of H. pratense and H. adsurgens could be associated with interspecific hybridization involving Mediterranean oat species carrying A- and C-genomes. A hypothesis on the pathways of Avena chromosomes evolution during the early stages the oat species divergence is proposed.

  3. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production.

    PubMed

    Coulson, Thomas J D; Patten, Cheryl L

    2015-08-06

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences.

  4. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production

    PubMed Central

    Coulson, Thomas J. D.

    2015-01-01

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences. PMID:26251488

  5. Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis subsp. lactis TOMSC161, Isolated from a Nonscalded Curd Pressed Cheese

    PubMed Central

    Velly, H.; Abraham, A.-L.; Loux, V.; Delacroix-Buchet, A.; Fonseca, F.; Bouix, M.

    2014-01-01

    Lactococcus lactis is a lactic acid bacterium used in the production of many fermented foods, such as dairy products. Here, we report the genome sequence of L. lactis subsp. lactis TOMSC161, isolated from nonscalded curd pressed cheese. This genome sequence provides information in relation to dairy environment adaptation. PMID:25377704

  6. Deoxyribonucleic acid sequence of araBAD promoter mutants of Escherichia coli.

    PubMed

    Horwitz, A H; Morandi, C; Wilcox, G

    1980-05-01

    The controlling site region for the araBAD operon is defined, in part, by two classes of cis-acting constitutive mutations. The aralc mutations allow low-level constitutive expression of ara-BAD in the absence of the positive regulatory protein coded for by the araC gene, whereas the araXc mutations allow expression of araBAD in the absence of the cyclic adenosine monophosphate receptor protein. Six independently isolated aralc mutations and three independently isolated araXc mutations were cloned onto the plasmid pBR322 using in vitro recombinant deoxyribonucleic acid techniques and in vivo recombination between plasmid and chromosomal deoxyribonucleic acid. The location of these mutations was determined by deoxyribonucleic acid sequence analysis. All of the aralc mutations occurred at position -35 within the araBAD promoter (+1 = messenger ribonucleic acid start for araBAD) and resulted from an AT leads to GC transition. All of the araXc mutations occurred at position -10 within the araBAD promoter and resulted from a GC leads to AT transition. Models are presented to explain the mode of action of the aralc and araXc mutations.

  7. In the TTF-1 homeodomain the contribution of several amino acids to DNA recognition depends on the bound sequence.

    PubMed Central

    Fabbro, D; Tell, G; Leonardi, A; Pellizzari, L; Pucillo, C; Lonigro, R; Formisano, S; Damante, G

    1996-01-01

    The thyroid transcription factor-1 homeodomain (TTF-1HD) shows a peculiar DNA binding specificity, preferentially recognizing sequences containing the 5'-CAAG-3' core motif. Most other homeodomains instead recognize sites containing the 5'-TAAT-3' core motif. Here, we show that TTF-1HD efficiently recognizes another sequence, called D1, devoid of the 5'-CAAG-3' core motif. Different experimental approaches indicate that TTF-1HD contacts the D1 sequence in a manner which is different to that used to interact with sequences containing the 5'-CAAG-3' core motif. The binding activities that mutants of TTF-1HD display with the D1 sequence or with the sequence containing the 5'-CAAG-3' core motif indicate that the role of several DNA-contacting amino acids is different. In particular, during recognition of the D1 sequence, backbone-interacting amino acids not relevant in binding to sequences containing the 5'-CAAG-3' core motif play an important role. In the TTF-1HD, therefore, the contribution of several amino acids to DNA recognition depends on the bound sequence. These data indicate that although a common bonding network exists in all of the HD/DNA complexes, peculiarities important for DNA recognition may occur in single cases. PMID:8811078

  8. Molecular cloning, encoding sequence, and expression of vaccinia virus nucleic acid-dependent nucleoside triphosphatase gene.

    PubMed Central

    Rodriguez, J F; Kahn, J S; Esteban, M

    1986-01-01

    A rabbit poxvirus genomic library contained within the expression vector lambda gt11 was screened with polyclonal antiserum prepared against vaccinia virus nucleic acid-dependent nucleoside triphosphatase (NTPase)-I enzyme. Five positive phage clones containing from 0.72- to 2.5-kilobase-pair (kbp) inserts expressed a beta-galactosidase fusion protein that was reactive by immunoblotting with the NTPase-I antibody. Hybridization analysis allowed the location of this gene within the vaccinia HindIIID restriction fragment. From the known nucleotide sequence of the 16-kbp vaccinia HindIIID fragment, we identified a region that contains a 1896-base open reading frame coding for a 631-amino acid protein. Analysis of the complete sequence revealed a highly basic protein, with hydrophilic COOH and NH2 termini, various hydrophobic domains, and no significant homology to other known proteins. Translational studies demonstrate that NTPase-I belongs to a late class of viral genes. This protein is highly conserved among Orthopoxviruses. Images PMID:3025846

  9. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    PubMed

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts.

  10. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    PubMed

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  11. The Sequence-Specific Cellular Uptake of Spherical Nucleic Acid Nanoparticle Conjugates

    PubMed Central

    Narayan, Suguna P.; Choi, Chung Hang J.; Hao, Liangliang; Calabrese, Colin M.; Auyeung, Evelyn; Zhang, Chuan; Goor, Olga J.G.M.

    2015-01-01

    We investigated the sequence-dependent cellular uptake of spherical nucleic acid nanoparticle conjugates (SNAs). This process occurs by interaction with class A scavenger receptors (SR-A) and caveolae-mediated endocytosis. It is known that linear poly(guanine) (poly G) is a natural ligand for SR-A, and it has been proposed that interaction of poly G with SR-A is dependent on the formation of G-quadruplexes. Since G-rich oligonucleotides are known to interact strongly with SR-A, we hypothesized that SNAs with higher G contents would be able to enter cells in larger amounts than SNAs composed of other nucleotides, and as such we measured cellular internalization of SNAs as a function of constituent oligonucleotide sequence. Indeed, SNAs with enriched G content show the highest cellular uptake. Using this hypothesis, we chemically conjugated a small molecule (camptothecin) with SNAs to create drug-SNA conjugates and observed that poly G SNAs deliver the most camptothecin to cells and have the highest cytotoxicity in cancer cells. Our data elucidate important design considerations for enhancing the intracellular delivery of spherical nucleic acids. PMID:26097111

  12. Partial amino acid sequences around sulfhydryl groups of soybean beta-amylase.

    PubMed

    Nomura, K; Mikami, B; Morita, Y

    1987-08-01

    Sulfhydryl (SH) groups of soybean beta-amylase were modified with 5-(iodoaceto-amidoethyl)aminonaphthalene-1-sulfonate (IAEDANS) and the SH-containing peptides exhibiting fluorescence were purified after chymotryptic digestion of the modified enzyme. The sequence analysis of the peptides derived from the modification of all SH groups in the denatured enzyme revealed the existence of six SH groups, in contrast to five reported previously. One of them was found to have extremely low reactivity toward SH-reagents without reduction. In the native state, IAEDANS reacted with 2 mol of SH groups per mol of the enzyme (SH1 and SH2) accompanied with inactivation of the enzyme owing to the modification of SH2 located near the active site of this enzyme. The selective modification of SH2 with IAEDANS was attained after the blocking of SH1 with 5,5'-dithiobis-(2-nitrobenzoic acid). The amino acid sequences of the peptides containing SH1 and SH2 were determined to be Cys-Ala-Asn-Pro-Gln and His-Gln-Cys-Gly-Gly-Asn-Val-Gly-Asp-Ile-Val-Asn-Ile-Pro-Ile-Pro-Gln-Trp, respectively.

  13. Genome Sequence of Lactobacillus rhamnosus Strain CASL, an Efficient l-Lactic Acid Producer from Cheap Substrate Cassava

    PubMed Central

    Yu, Bo; Su, Fei; Wang, Limin; Zhao, Bo; Qin, Jiayang; Ma, Cuiqing; Xu, Ping; Ma, Yanhe

    2011-01-01

    Lactobacillus rhamnosus is a type of probiotic bacteria with industrial potential for l-lactic acid production. We announce the draft genome sequence of L. rhamnosus CASL (2,855,156 bp with a G+C content of 46.6%), which is an efficient producer of l-lactic acid from cheap, nonfood substrate cassava with a high production titer. PMID:22123765

  14. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-03-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus.

  15. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed Central

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-01-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus. PMID:3355530

  16. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    PubMed Central

    2010-01-01

    Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C. sticklandii genome and

  17. Amino acid sequence of neurotoxin III of the scorpion Androctonus austrialis Hector.

    PubMed

    Kopeyan, C; Martinez, G; Rochat, H

    1979-03-01

    The amino acid sequence of neurotoxin III, purified from the venom of the North African scorpion Androctonus australis Hector, has been determined by Edman degradation using a liquid-phase sequencer. Carboxypeptidase A hydrolyses confirmed not only the sequence of the five last residues but also the presence of a free alpha-carboxylic group at the C-terminus. Edman degradation was conducted on one hand with the Quadrol [N,N,N',N'-tetrakis(2-hydroxypropyl)ethylene diamine] program and S-alkylated protein before or after coupling with sulfophenylisothiocynate (the first 34 residues were thus identified), on the other hand on tryptic and chymotryptic peptides with a dimethylbenzylamine program (residues 1--23 and 31--34 were confirmed, the positions of residues 35-64 were established). Neurotoxin III was found to belong to the same group of scorpion toxins active on mammals as neurotoxin I purified from the same venom (50 homologous positions exist in the two proteins).

  18. Purification, amino acid sequence and characterisation of kangaroo IGF-I.

    PubMed

    Yandell, C A; Francis, G L; Wheldrake, J F; Upton, Z

    1998-01-01

    Insulin-like growth factor-I (IGF-I) and IGF-II have been purified to homogeneity from kangaroo (Macropus fuliginosus) serum, thus this represents the first report of the purification, sequencing and characterisation of marsupial IGFs. N-Terminal protein sequencing reveals that there are six amino acid differences between kangaroo and human IGF-I. Kangaroo IGF-II has been partially sequenced and no differences were found between human and kangaroo IGF-II in the 53 residues identified. Thus the IGFs appear to be remarkably structurally conserved during mammalian radiation. In addition, in vitro characterisation of kangaroo IGF-I demonstrated that the functional properties of human, kangaroo and chicken IGF-I are very similar. In an assay measuring the ability of the proteins to stimulate protein synthesis in rat L6 myoblasts, all IGF-I proteins were found to be equally potent. The ability of all three proteins to compete for binding with radiolabelled human IGF-I to type-1 IGF receptors in L6 myoblasts and in Sminthopsis crassicaudata transformed lung fibroblasts, a marsupial cell line, was comparable. Furthermore, kangaroo and human IGF-I react equally in a human IGF-I RIA using a human reference standard, radiolabelled human IGF-I and a polyclonal antibody raised against recombinant human IGF-I. This study indicates that not only is the primary structure of eutherian and metatherian IGF-I conserved, but also the proteins appear to be functionally similar.

  19. Complete Genome Sequence of the Prototype Lactic Acid Bacterium Lactococcus lactis subsp. cremoris MG1363▿

    PubMed Central

    Wegmann, Udo; O'Connell-Motherway, Mary; Zomer, Aldert; Buist, Girbe; Shearman, Claire; Canchaya, Carlos; Ventura, Marco; Goesmann, Alexander; Gasson, Michael J.; Kuipers, Oscar P.; van Sinderen, Douwe; Kok, Jan

    2007-01-01

    Lactococcus lactis is of great importance for the nutrition of hundreds of millions of people worldwide. This paper describes the genome sequence of Lactococcus lactis subsp. cremoris MG1363, the lactococcal strain most intensively studied throughout the world. The 2,529,478-bp genome contains 81 pseudogenes and encodes 2,436 proteins. Of the 530 unique proteins, 47 belong to the COG (clusters of orthologous groups) functional category “carbohydrate metabolism and transport,” by far the largest category of novel proteins in comparison with L. lactis subsp. lactis IL1403. Nearly one-fifth of the 71 insertion elements are concentrated in a specific 56-kb region. This integration hot-spot region carries genes that are typically associated with lactococcal plasmids and a repeat sequence specifically found on plasmids and in the “lateral gene transfer hot spot” in the genome of Streptococcus thermophilus. Although the parent of L. lactis MG1363 was used to demonstrate lysogeny in Lactococcus, L. lactis MG1363 carries four remnant/satellite phages and two apparently complete prophages. The availability of the L. lactis MG1363 genome sequence will reinforce its status as the prototype among lactic acid bacteria through facilitation of further applied and fundamental research. PMID:17307855

  20. The ABRF Edman Sequencing Research Group 2008 Study: Investigation into Homopolymeric Amino Acid N-Terminal Sequence Tags and Their Effects on Automated Edman Degradation

    PubMed Central

    Thoma, R. S.; Smith, J. S.; Sandoval, W.; Leone, J. W.; Hunziker, P.; Hampton, B.; Linse, K. D.; Denslow, N. D.

    2009-01-01

    The Edman Sequence Research Group (ESRG) of the Association of Biomolecular Resource designs and executes interlaboratory studies investigating the use of automated Edman degradation for protein and peptide analysis. In 2008, the ESRG enlisted the help of core sequencing facilities to investigate the effects of a repeating amino acid tag at the N-terminus of a protein. Commonly, to facilitate protein purification, an affinity tag containing a polyhistidine sequence is conjugated to the N-terminus of the protein. After expression, polyhistidine-tagged protein is readily purified via chelation with an immobilized metal affinity resin. The addition of the polyhistidine tag presents unique challenges for the determination of protein identity using Edman degradation chemistry. Participating laboratories were asked to sequence one protein engineered in three configurations: with an N-terminal polyhistidine tag; with an N-terminal polyalanine tag; or with no tag. Study participants were asked to return a data file containing the uncorrected amino acid picomole yields for the first 17 cycles. Initial and repetitive yield (R.Y.) information and the amount of lag were evaluated. Information about instrumentation and sample treatment was also collected as part of the study. For this study, the majority of participating laboratories successfully called the amino acid sequence for 17 cycles for all three test proteins. In general, laboratories found it more difficult to call the sequence containing the polyhistidine tag. Lag was observed earlier and more consistently with the polyhistidine-tagged protein than the polyalanine-tagged protein. Histidine yields were significantly less than the alanine yields in the tag portion of each analysis. The polyhistidine and polyalanine protein-R.Y. calculations were found to be equivalent. These calculations showed that the nontagged portion from each protein was equivalent. The terminal histidines from the tagged portion of the protein

  1. The amino acid sequence around the active-site cysteine and histidine residues, and the buried cysteine residue in ficin.

    PubMed

    Husain, S S; Lowe, G

    1970-04-01

    Ficin that had been prepared from the latex of Ficus glabrata by salt fractionation and chromatography on carboxymethylcellulose was completely and irreversibly inhibited with 1,3-dibromo[2-(14)C]acetone and then treated with N-(4-dimethylamino-3,5-dinitrophenyl)maleimide in 6m-guanidinium chloride. After reduction and carboxymethylation of the labelled protein, it was digested with trypsin and alpha-chymotrypsin. Two radioactive peptides and two coloured peptides were isolated chromatographically and their sequences determined. The radioactive peptides revealed the amino acid sequences around the active-site cysteine and histidine residues and showed a high degree of homology with the omino acid sequence around the active-site cysteine and histidine residues in papain. The coloured peptides allowed the amino acid sequence around the buried cysteine residue in ficin to be determined.

  2. The `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis: isolation of the gene, nucleotide and amino acid sequence

    PubMed Central

    Michel, H.; Weyer, K. A.; Gruenberg, H.; Lottspeich, F.

    1985-01-01

    The gene coding for the `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis was isolated in an expression vector. Expression of the heavy subunit in Escherichia coli was detected with antibodies raised against crystalline reaction centres. The entire subunit, and not a fusion protein, was expressed in E. coli. The protein coding region of the gene was sequenced and the amino acid sequence derived. Part of the amino acid sequence was confirmed by chemical sequence analysis of the protein. The heavy subunit consists of 258 amino acids and its mol. wt. is 28 345. It possesses one membrane-spanning α-helical segment, as was revealed by the concomitant X-ray structure analysis. ImagesFig. 1.Fig. 2. PMID:16453623

  3. Purification, amino acid sequence and immunological characterization of Ole e 6, a cysteine-enriched allergen from olive tree pollen.

    PubMed

    Batanero, E; Ledesma, A; Villalba, M; Rodríguez, R

    1997-06-30

    The Ole e 6 allergen from olive tree pollen has been isolated by combining gel permeation and reverse-phase chromatographies. It is a single and highly acidic (pI 4.2) polypeptide chain protein. Its NH2-terminal amino acid sequence has been determined by Edman degradation. Total RNA from the olive tree pollen was isolated, and a specific cDNA was amplified by the polymerase chain reaction using a degenerate oligonucleotide primer designed according to the NH2-terminal sequence of the protein. The nucleotide sequencing of the cDNA rendered an open reading frame encoding a 50 amino acid polypeptide chain, in which two sets of the sequential motif Cys-X3-Cys-X3-Cys are present. No sequence similarity has been found between this protein and other previously described polypeptides.

  4. Nucleotide and derived amino acid sequences of the major porin of Comamonas acidovorans and comparison of porin primary structures.

    PubMed Central

    Gerbl-Rieger, S; Peters, J; Kellermann, J; Lottspeich, F; Baumeister, W

    1991-01-01

    The DNA sequence of the gene which codes for the major outer membrane porin (Omp32) of Comamonas acidovorans has been determined. The structural gene encodes a precursor consisting of 351 amino acid residues with a signal peptide of 19 amino acid residues. Comparisons with amino acid sequences of outer membrane proteins and porins from several other members of the class Proteobacteria and of the Chlamydia trachomatis porin and the Neurospora crassa mitochondrial porin revealed a motif of eight regions of local homology. The results of this analysis are discussed with regard to common structural features of porins. PMID:1848840

  5. The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences.

    PubMed

    White, S H

    1994-04-01

    This paper continues an examination of the hypothesis that modern proteins evolved from random heteropeptide sequences. In support of the hypothesis, White and Jacobs (1993, J Mol Evol 36:79-95) have shown that any sequence chosen randomly from a large collection of nonhomologous proteins has a 90% or better chance of having a lengthwise distribution of amino acids that is indistinguishable from the random expectation regardless of amino acid type. The goal of the present study was to investigate the possibility that the random-origin hypothesis could explain the lengths of modern protein sequences without invoking specific mechanisms such as gene duplication or exon splicing. The sets of sequences examined were taken from the 1989 PIR database and consisted of 1,792 "super-family" proteins selected to have little sequence identity, 623 E. coli sequences, and 398 human sequences. The length distributions of the proteins could be described with high significance by either of two closely related probability density functions: The gamma distribution with parameter 2 or the distribution for the sum of two exponential random independent variables. A simple theory for the distributions was developed which assumes that (1) protoprotein sequences had exponentially distributed random independent lengths, (2) the length dependence of protein stability determined which of these protoproteins could fold into compact primitive proteins and thereby attain the potential for biochemical activity, (3) the useful protein sequences were preserved by the primitive genome, and (4) the resulting distribution of sequence lengths is reflected by modern proteins. The theory successfully predicts the two observed distributions which can be distinguished by the functional form of the dependence of protein stability on length. The theory leads to three interesting conclusions. First, it predicts that a tetra-nucleotide was the signal for primitive translation termination. This prediction is

  6. Fragmentation Characteristics of Deprotonated N-linked Glycopeptides: Influences of Amino Acid Composition and Sequence

    NASA Astrophysics Data System (ADS)

    Nishikaze, Takashi; Kawabata, Shin-ichirou; Tanaka, Koichi

    2014-06-01

    Glycopeptide structural analysis using tandem mass spectrometry is becoming a common approach for elucidating site-specific N-glycosylation. The analysis is generally performed in positive-ion mode. Therefore, fragmentation of protonated glycopeptides has been extensively investigated; however, few studies are available on deprotonated glycopeptides, despite the usefulness of negative-ion mode analysis in detecting glycopeptide signals. Here, large sets of glycopeptides derived from well-characterized glycoproteins were investigated to understand the fragmentation behavior of deprotonated N-linked glycopeptides under low-energy collision-induced dissociation (CID) conditions. The fragment ion species were found to be significantly variable depending on their amino acid sequence and could be classified into three types: (i) glycan fragment ions, (ii) glycan-lost fragment ions and their secondary cleavage products, and (iii) fragment ions with intact glycan moiety. The CID spectra of glycopeptides having a short peptide sequence were dominated by type (i) glycan fragments (e.g., 2,4AR, 2,4AR-1, D, and E ions). These fragments define detailed structural features of the glycan moiety such as branching. For glycopeptides with medium or long peptide sequences, the major fragments were type (ii) ions (e.g., [peptide + 0,2X0-H]- and [peptide-NH3-H]-). The appearance of type (iii) ions strongly depended on the peptide sequence, and especially on the presence of Asp, Asn, and Glu. When a glycosylated Asn is located on the C-terminus, an interesting fragment having an Asn residue with intact glycan moiety, [glycan + Asn-36]-, was abundantly formed. Observed fragments are reasonably explained by a combination of existing fragmentation rules suggested for N-glycans and peptides.

  7. An amino acid sequence motif sufficient for subnuclear localization of an arginine/serine-rich splicing factor.

    PubMed

    Hedley, M L; Amrein, H; Maniatis, T

    1995-12-05

    We have identified an amino acid sequence in the Drosophila Transformer (Tra) protein that is capable of directing a heterologous protein to nuclear speckles, regions of the nucleus previously shown to contain high concentrations of spliceosomal small nuclear RNAs and splicing factors. This sequence contains a nucleoplasmin-like bipartite nuclear localization signal (NLS) and a repeating arginine/serine (RS) dipeptide sequence adjacent to a short stretch of basic amino acids. Sequence comparisons from a number of other splicing factors that colocalize to nuclear speckles reveal the presence of one or more copies of this motif. We propose a two-step subnuclear localization mechanism for splicing factors. The first step is transport across the nuclear envelope via the nucleoplasmin-like NLS, while the second step is association with components in the speckled domain via the RS dipeptide sequence.

  8. Purification and partial amino acid sequence of the chloroplast cytochrome b-559.

    PubMed

    Widger, W R; Cramer, W A; Hermodson, M; Meyer, D; Gullifor, M

    1984-03-25

    The hydrophobic cytochrome b-559, purified from unstacked, ethanol-washed spinach thylakoid membranes, using extraction with 2% Triton X-100 in 4 M urea and three chromatographic steps in the presence of protease inhibitors, has a dominant band on sodium dodecyl sulfate-urea gels corresponding to Mr = 10,000. The yield of this preparation is 30-50% (5-10 mg) starting with 600 mg of chlorophyll. The heme content yields a calculated molecular weight of no more than 17,500/heme, and perhaps somewhat smaller after correction for impurities. The Mr = 10,000 band is stained by the tetramethylbenzidine-H2O2 heme reagent on lithium dodecyl sulfate gels run at 0 degrees C. The Mr = 10,000 protein, further separated by high performance liquid chromatography, contains a unique NH2 terminus that is not blocked, and the amino acid sequence for the first 27 residues is NH2-Ser-Gly-Ser-Thr-Gly-Glu-Arg-Ser-Phe-Ala-Asp-Ile-Ile-Thr-Ser-Ile-Arg-Tyr-Trp -Val-Ile-X-Ser-Ile-Thr-Ile-Pro. . . COOH. Approximately 55% of the amino acids are hydrophobic, based on amino acid analysis of the Mr = 10,000 peptide, which also indicated the presence of at least one histidine. Only one cytochrome b-559 component could be identified, whose yield indicated that it arises from a single b-559 protein in chloroplasts corresponding to the in situ high potential cytochrome of the chloroplast photosystem II.

  9. Evolutionary Divergence of Arabidopsis thaliana Classical Peroxidases.

    PubMed

    Kupriyanova, E V; Mamoshina, P O; Ezhova, T A

    2015-10-01

    Polymorphisms of 62 peroxidase genes derived from Arabidopsis thaliana were investigated to evaluate evolutionary dynamics and divergence of peroxidase proteins. By comparing divergence of duplicated genes AtPrx53-AtPrx54 and AtPrx36-AtPrx72 and their products, nucleotide and amino acid substitutions were identified that were apparently targets of positive selection. These substitutions were detected among paralogs of 461 ecotypes from Arabidopsis thaliana. Some of these substitutions are conservative and matched paralogous peroxidases in other Brassicaceae species. These results suggest that after duplication, peroxidase genes evolved under the pressure of positive selection, and amino acid substitutions identified during our study provided divergence of properties and physiological functions in peroxidases. Our predictions regarding functional significance for amino acid residues identified in variable sites of peroxidases may allow further experimental assessment of evolution of peroxidases after gene duplication.

  10. Alignment of 700 globin sequences: extent of amino acid substitution and its correlation with variation in volume.

    PubMed Central

    Kapp, O. H.; Moens, L.; Vanfleteren, J.; Trotman, C. N.; Suzuki, T.; Vinogradov, S. N.

    1995-01-01

    Seven-hundred globin sequences, including 146 nonvertebrate sequences, were aligned on the basis of conservation of secondary structure and the avoidance of gap penalties. Of the 182 positions needed to accommodate all the globin sequences, only 84 are common to all, including the absolutely conserved PheCD1 and HisF8. The mean number of amino acid substitutions per position ranges from 8 to 13 for all globins and 5 to 9 for internal positions. Although the total sequence volumes have a variation approximately 2-3%, the variation in volume per position ranges from approximately 13% for the internal to approximately 21% for the surface positions. Plausible correlations exist between amino acid substitution and the variation in volume per position for the 84 common and the internal but not the surface positions. The amino acid substitution matrix derived from the 84 common positions was used to evaluate sequence similarity within the globins and between the globins and phycocyanins C and colicins A, via calculation of pairwise similarity scores. The scores for globin-globin comparisons over the 84 common positions overlap the globin-phycocyanin and globin-colicin scores, with the former being intermediate. For the subset of internal positions, overlap is minimal between the three groups of scores. These results imply a continuum of amino acid sequences able to assume the common three-on-three alpha-helical structure and suggest that the determinants of the latter include sites other than those inaccessible to solvent. PMID:8535255

  11. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

    SciTech Connect

    Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

    1987-07-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO/sub 4//PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene.

  12. Real-Time Nucleic Acid Sequence-Based Amplification Assay for Detection of Hepatitis A Virus

    PubMed Central

    Abd El Galil, Khaled H.; El Sokkary, M. A.; Kheira, S. M.; Salazar, Andre M.; Yates, Marylynn V.; Chen, Wilfred; Mulchandani, Ashok

    2005-01-01

    A nucleic acid sequence-based amplification (NASBA) assay in combination with a molecular beacon was developed for the real-time detection and quantification of hepatitis A virus (HAV). A 202-bp, highly conserved 5′ noncoding region of HAV was targeted. The sensitivity of the real-time NASBA assay was tested with 10-fold dilutions of viral RNA, and a detection limit of 1 PFU was obtained. The specificity of the assay was demonstrated by testing with other environmental pathogens and indicator microorganisms, with only HAV positively identified. When combined with immunomagnetic separation, the NASBA assay successfully detected as few as 10 PFU from seeded lake water samples. Due to its isothermal nature, its speed, and its similar sensitivity compared to the real-time RT-PCR assay, this newly reported real-time NASBA method will have broad applications for the rapid detection of HAV in contaminated food or water. PMID:16269748

  13. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  14. Sequence-defined shuttles for targeted nucleic acid and protein delivery.

    PubMed

    Röder, Ruth; Wagner, Ernst

    2014-01-01

    Molecular medicine opens into a space of novel specific therapeutic agents: intracellularly active drugs such as peptides, proteins or nucleic acids, which are not able to cross cell membranes and enter the intracellular space on their own. Through the development of cell-targeted shuttles for specific delivery, this restriction in delivery has the potential to be converted into an advantage. On the one hand, due to the multiple extra- and intracellular barriers, such carrier systems need to be multifunctional. On the other hand, they must be precise and reproducibly manufactured due to pharmaceutical reasons. Here we review the design of precise sequence-defined delivery carriers, including solid-phase synthesized peptides and nonpeptidic oligomers, or nucleotide-based carriers such as aptamers and origami nanoboxes.

  15. Identification of amino acid sequences in the polyomavirus capsid proteins that serve as nuclear localization signals

    NASA Technical Reports Server (NTRS)

    Chang, D.; Haynes, J. I. Jr; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

    1993-01-01

    The molecular mechanism participating in the transport of newly synthesized proteins from the cytoplasm to the nucleus in mammalian cells is poorly understood. Recently, the nuclear localization signal sequences (NLS) of many nuclear proteins have been identified, and most have been found to be composed of a highly basic amino acid stretch. A genetic "subtractive" and a biochemical "additive" approach were used in our studies to identify the NLS's of the polyomavirus structural capsid proteins. An NLS was identified at the N-terminus (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) of the major capsid protein VP1 and at the C-terminus (Glu307 -Glu-Asp-Gly-Pro-Glu-Lys-Lys-Lys-Arg-Arg-Leu318) of the VP2/VP3 minor capsid proteins.

  16. The amino acid sequence of a carbohydrate-containing fragment of hen ovotransferrin.

    PubMed Central

    Kingston, I B; Williams, J

    1975-01-01

    1. Hen ovotransferrin was treated with CNBr and fractionated by gel filtration. 2. After further treatment by reduction and carboxymethylation a carbohydrate-containing fragment of molecular weight 11990 was obtained (fragment BCd). 3. The amino acid sequence of this fragment was determined. It consists of a single chain of 94 residues. 4. The structure of a tryptic glycopeptide derived from whole ovotransferrin permitted a further eight residues to be assigned at the N-terminus of fragment BCd. 5. Heterogeneity was found at two positions. 6. Further evidence has been deposited as Supplementary Publication SUP 50045 (19 pages) at the British Library (Lending Division), Boston Spa, Wetherby, W. Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1975), 145, 5. PMID:1172663

  17. Phylogenetic analysis of beta-papillomaviruses as inferred from nucleotide and amino acid sequence data.

    PubMed

    Gottschling, Marc; Köhler, Anja; Stockfleth, Eggert; Nindl, Ingo

    2007-01-01

    Human papillomaviruses (HPV) of the beta-group seem to be involved in the pathogenesis of non-melanoma skin cancer. Papillomaviruses are host specific and are considered closely co-evolving with their hosts. Evolutionary incongruence between early genes and late genes has been reported among oncogenic genital alpha-papillomaviruses and considerably challenge phylogenetic reconstructions. We investigated the relationships of 29 beta-HPV (25 types plus four putative new types, subtypes, or variants) as inferred from codon aligned and amino acid sequence data of the genes E1, E2, E6, E7, L1, and L2 using likelihood, distance, and parsimony approaches. An analysis of a L1 fragment included additional nucleotide and amino acid sequences from seven non-human beta-papillomaviruses. Early genes and late genes evolution did not conflict significantly in beta-papillomaviruses based on partition homogeneity tests (p > or = 0.001). As inferred from the complete genome analyses, beta-papillomaviruses were monophyletic and segregated into four highly supported monophyletic assemblages corresponding to the species 1, 2, 3, and fused 4/5. They basically split into the species 1 and the remainder of beta-papillomaviruses, whose species 3, 4, and 5 constituted the sistergroup of species 2. beta-Papillomaviruses have been isolated from humans, apes, and monkeys, and phylogenetic analyses of the L1 fragment showed non-human papillomaviruses highly polyphyletic nesting within the HPV species. Thus, host and virus phylogenies were not congruent in beta-papillomaviruses, and multiple invasions across species borders may contribute (additionally to host-linked evolution) to their diversification.

  18. Amino acid sequence homology between rat and human C-reactive protein.

    PubMed Central

    Taylor, J A; Bruton, C J; Anderson, J K; Mole, J E; De Beer, F C; Baltz, M L; Pepys, M B

    1984-01-01

    The rat serum protein that undergoes Ca2+-dependent binding to pneumococcal C-polysaccharide and to phosphocholine residues, and that is evidently a member of the pentraxin family of proteins by virtue of its appearance under the electron microscope, has been variously designated as rat C-reactive protein (CRP) [de Beer, Baltz, Munn, Feinstein, Taylor, Bruton, Clamp & Pepys (1982) Immunology 45, 55-70], 'phosphoryl choline-binding protein' [Nagpurkar & Mookerjea (1981) J. Biol. Chem. 256, 7440-7448] and rat serum amyloid P component (SAP) [Pontet, D'Asnieres, Gache, Escaig & Engler (1981) Biochim. Biophys. Acta 671, 202-210]. The partial amino acid sequence (45 residues) towards the C-terminus of this protein was determined, and it showed 71.7% identity with the known sequence of human CRP but only 54.3% identity with human SAP. Since human CRP and SAP are themselves approximately 50% homologous, the level of identity between the rat protein and human SAP is evidence only of membership of the pentraxin family. In contrast, the much greater resemblance to human CRP confirms that the rat C-polysaccharide-binding/phosphocholine-binding protein is in fact rat CRP. PMID:6477504

  19. Amino acid sequences of alpha-helical segments from S-carbosymethylkerateine-A. Complete sequence of a type-I segment.

    PubMed Central

    Gough, K H; Inglis, A S; Crewther, W G

    1978-01-01

    The amino acid sequence of a type-I helical segment from the low-sulphur protein (S-carboxymethylkerateine-A) of wool was determined by combining automatic and manual-sequencing data. Whereas in the type-II helical segment most of the cationic groups occur in pairs, 11 of the 22 anionic residues in the sequence of the type-I segment were situated next to a second anionic residue. This suggests possible interactions between type-I and type-II helical segments in alpha-keratin. As observed with the sequence of a type-II helical segment a model constructed on 3.6 residues per turn of helix shows a line of hydrophobic residues along the helix, thereby supporting the physicochemical evidence that the molecule is predominantly helical and forms part of a coiled-coil structure. Examination of the sequence data by predictive methods indicates the possibilty of extensive sections of alpha-helix interspersed with discontinuities. The molecule contains a number of regions with peptide sequences identical with those found by other workers after enzymic digestion of fractions from oxidized wool. Images Fig. 1. PMID:697725

  20. Spermatogenesis of the lizard Lacerta vivipara: histological studies and amino acid sequence of a protamine lacertine 1.

    PubMed

    Martinage, A; Depeiges, A; Wouters, D; Morel, L; Sautière, P

    1996-06-01

    The lizard Lacerta vivipara is a seasonal breeder with a well characterized reproductive cycle. An histological study of the lizard testis has been performed at different stages of spermatogenesis and the nuclear basic proteins content was assessed by electrophoretical analysis. Two protamines, lacertines 1 and 2, are present in spermatozoa in April and May. We have isolated lacertine1 and characterized a protamine with a mass of 4,963.7 Da. Amino acid sequence of this protamine (41 residues) was established from data provided by automated Edman degradation. It is characterized by a basic amino acid stretch in the N- and C-terminal regions and by a central part which only consists of 3 different intermingled amino acids. This protamine presents 62% homology with scylliorhinine Z3 from dog-fish Scylliorhinus caniculus and 58% homology with quail protamine. The reported lizard protamine sequence is the first reptilian protamine sequence available so far.

  1. The amino acid sequence of the cytochrome c-554(547) from the chemolithotrophic bacterium Thiobacillus neapolitanus.

    PubMed Central

    Ambler, R P; Meyer, T E; Trudinger, P A; Kamen, M D

    1985-01-01

    An amino acid sequence is proposed for the cytochrome c-554(547) from the bacterium Thiobacillus neapolitanus N.C.I.B. 8539). It consists of a polypeptide chain of 91 residues, with a pair of haem-attachment cysteine residues at positions 15 and 18. There is similarity in sequence with each of the halves of the sequence of the dihaem cytochromes c4 and with a cytochrome c-554(548) from a halophilic strain of Paracoccus. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50127 (11 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1985) 225, 5. PMID:2988504

  2. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    SciTech Connect

    Myers, G.; Korber, B.; Wain-Hobson, S.; Smith, R.F.; Pavlakis, G.N.

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  3. Opsin gene duplication and divergence in ray-finned fish.

    PubMed

    Rennison, Diana J; Owens, Gregory L; Taylor, John S

    2012-03-01

    Opsin gene sequences were first reported in the 1980s. The goal of that research was to test the hypothesis that human opsins were members of a single gene family and that variation in human color vision was mediated by mutations in these genes. While the new data supported both hypotheses, the greatest contribution of this work was, arguably, that it provided the data necessary for PCR-based surveys in a diversity of other species. Such studies, and recent whole genome sequencing projects, have uncovered exceptionally large opsin gene repertoires in ray-finned fishes (taxon, Actinopterygii). Guppies and zebrafish, for example, have 10 visual opsin genes each. Here we review the duplication and divergence events that have generated these gene collections. Phylogenetic analyses revealed that large opsin gene repertories in fish have been generated by gene duplication and divergence events that span the age of the ray-finned fishes. Data from whole genome sequencing projects and from large-insert clones show that tandem duplication is the primary mode of opsin gene family expansion in fishes. In some instances gene conversion between tandem duplicates has obscured evolutionary relationships among genes and generated unique key-site haplotypes. We mapped amino acid substitutions at so-called key-sites onto phylogenies and this exposed many examples of convergence. We found that dN/dS values were higher on the branches of our trees that followed gene duplication than on branches that followed speciation events, suggesting that duplication relaxes constraints on opsin sequence evolution. Though the focus of the review is opsin sequence evolution, we also note that there are few clear connections between opsin gene repertoires and variation in spectral environment, morphological traits, or life history traits.

  4. Nucleic acid sequence of an internal image-bearing monoclonal anti-idiotype and its comparison to the sequence of the external antigen.

    PubMed Central

    Bruck, C; Co, M S; Slaoui, M; Gaulton, G N; Smith, T; Fields, B N; Mullins, J I; Greene, M I

    1986-01-01

    The monoclonal anti-idiotypic antibody (mAb2) 87.92.6 directed against the 9B.G5 antibody specific for the virus neutralizing epitope on the mammalian reovirus type 3 hemagglutinin was previously demonstrated to express an internal image of the receptor binding epitope of the reovirus type 3. Furthermore, this mAb2 has autoimmune reactivity to the cell surface receptor of the reovirus. The nucleotide and deduced amino acid sequences of the 87.92.6 mAb2 heavy and light chains are described in this report. The sequence analysis reveals that the same heavy chain variable and joining (VH and JH) gene segments are used by the 87.92.6 anti-idiotypic mAb2 and by the dominant idiotypes of the BALB/c anti-GAT (cGAT) and anti-NP (NPa) responses. [GAT; random polymer that is 60% glutamic acid, 30% alanine, and 10% tyrosine. NP; (4-hydroxy-3-nitrophenyl)-acetyl.] Despite extensive homology at the level of the heavy chain variable regions, the NPa positive BALB/c anti-NP monoclonal antibody 17.2.25 binds neither 9B.G5 nor the cellular receptor for the hemagglutinin. Amino acid sequence comparison between the viral hemagglutinin and the 87.92.6 mAb2 light chain "internal image," reveals an area of significant homology indicating that antigen mimicry by antibodies may be achieved by sharing primary structure. PMID:2428036

  5. Draft Genome Sequence of Escherichia coli O157:H7 ATCC 35150 and a Nalidixic Acid-Resistant Mutant Derivative

    PubMed Central

    Markell, James A.; Koziol, Adam G.

    2015-01-01

    Shiga toxin-producing Escherichia coli strains, occasionally isolated from food, are of public health importance. Here, we report on the 5.30-Mbp draft genome sequence of E. coli O157:H7 EDL931 (strain ATCC 35150) and the 5.32-Mbp draft genome sequence of a nalidixic acid-resistant mutant derivative used as a distinguishable control strain in food-testing laboratories. PMID:26205873

  6. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    PubMed

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein.

  7. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    NASA Astrophysics Data System (ADS)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  8. Purification, characterization, gene cloning and nucleotide sequencing of D: -stereospecific amino acid amidase from soil bacterium: Delftia acidovorans.

    PubMed

    Hongpattarakere, Tipparat; Komeda, Hidenobu; Asano, Yasuhisa

    2005-12-01

    The D-amino acid amidase-producing bacterium was isolated from soil samples using an enrichment culture technique in medium broth containing D-phenylalanine amide as a sole source of nitrogen. The strain exhibiting the strongest activity was identified as Delftia acidovorans strain 16. This strain produced intracellular D-amino acid amidase constitutively. The enzyme was purified about 380-fold to homogeneity and its molecular mass was estimated to be about 50 kDa, on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The enzyme was active preferentially toward D-amino acid amides rather than their L-counterparts. It exhibited strong amino acid amidase activity toward aromatic amino acid amides including D-phenylalanine amide, D-tryptophan amide and D-tyrosine amide, yet it was not specifically active toward low-molecular-weight D-amino acid amides such as D-alanine amide, L-alanine amide and L-serine amide. Moreover, it was not specifically active toward oligopeptides. The enzyme showed maximum activity at 40 degrees C and pH 8.5 and appeared to be very stable, with 92.5% remaining activity after the reaction was performed at 45 degrees C for 30 min. However, it was mostly inactivated in the presence of phenylmethanesulfonyl fluoride or Cd2+, Ag+, Zn2+, Hg2+ and As3+ . The NH2 terminal and internal amino acid sequences of the enzyme were determined; and the gene was cloned and sequenced. The enzyme gene damA encodes a 466-amino-acid protein (molecular mass 49,860.46 Da); and the deduced amino acid sequence exhibits homology to the D-amino acid amidase from Variovorax paradoxus (67.9% identity), the amidotransferase A subunit from Burkholderia fungorum (50% identity) and other enantioselective amidases.

  9. Effects of Acidic Peptide Size and Sequence on Trivalent Praseodymium Adduction and Electron Transfer Dissociation Mass Spectrometry.

    PubMed

    Commodore, Juliette J; Cassady, Carolyn J

    2017-02-07

    Using the lanthanide ion praseodymium, Pr(III), metallated ion formation and electron transfer dissociation (ETD) were studied for 25 biological and model acidic peptides. For chain lengths of seven or more residues, even highly acidic peptides that can be difficult to protonate by electrospray ionization will metallate and undergo abundant ETD fragmentation. Peptides composed of predominantly acidic residues form only the deprotonated ion, [M + Pr - H](2+) ; this ion yields near complete ETD sequence coverage for larger peptides. Peptides with a mixture of acidic and neutral residues, generate [M + Pr](3+) , which cleaves between every residue for many peptides. Acidic peptides that contain at least one residue with a basic side chain also produce the protonated ion, [M + Pr + H](4+) ; this ion undergoes the most extensive sequence coverage by ETD. Primarily metallated and non-metallated c- and z-ions form for all peptides investigated. Metal adducted product ions are only present when at least half of the peptide sequence can be incorporated into the ion; this suggests that the metal ion simultaneously attaches to more than one acidic site. The only site consistently lacking dissociation is at the N-terminal side of a proline residue. Increasing peptide chain length generates more backbone cleavage for metal-peptide complexes with the same charge state. For acidic peptides with the same length, increasing the precursor ion charge state from 2+ to 3+ also leads to more cleavage. The results of this study indicate that highly acidic peptides can be sequenced by ETD of complexes formed with Pr(III).

  10. Complete mitochondrial genomes of three neobatrachian anurans: a case study of divergence time estimation using different data and calibration settings.

    PubMed

    Igawa, Takeshi; Kurabayashi, Atsushi; Usuki, Chisako; Fujii, Tamotsu; Sumida, Masayuki

    2008-01-15

    We sequenced the whole mitochondrial (mt) genomes of three neobatrachian species: Japanese tree frog Hyla japonica, Japanese common toad Bufo japonicus, and narrow-mouthed toad Microhyla okinavensis. The gene arrangements of these genomes diverged from that of basal anurans (suborder Archaeobatrachia), but are the same as that of the members of derived frogs (i.e., superfamily Hyloidae and Ranoidae in suborder Neobatrachia), suggesting the one-time occurrence of a gene rearrangement event in an ancestral lineage of derived anurans. Furthermore, several distinct repeat motifs including putative termination-associated sequences (TASs) and conserved sequence blocks (CSBs) were observed in the control regions (CRs) of B. japonicus and H. japonica, while no repeat motifs were found in that of M. okinavensis. Phylogenetic analyses using both nucleotide and amino acid data of mt genes support monophyly of neobatrachians. The estimated divergence time based on amino acid data with multiple reference points suggests that the three living amphibian orders may have originated in the Carboniferous period, and that the divergences of anurans had occurred between the Permian and Tertiary periods. We also checked the influence of the data types and the settings of reference times on divergence time estimation. The resultant divergence times estimated from several datasets and reference time settings suggest that the substitution saturation of nucleotide data may lead to overestimated (i.e., older) branching times, especially for early divergent taxa. We also found a highly accelerated substitution rate in neobatrachian mt genes, and fast substitution possibly resulted in overestimation. To correct this erroneous estimation, it is efficient to apply several reference points among neobatrachians.

  11. Gene duplication and divergence affecting drug content in Cannabis sativa.

    PubMed

    Weiblen, George D; Wenger, Jonathan P; Craft, Kathleen J; ElSohly, Mahmoud A; Mehmedic, Zlatko; Treiber, Erin L; Marks, M David

    2015-12-01

    Cannabis sativa is an economically important source of durable fibers, nutritious seeds, and psychoactive drugs but few economic plants are so poorly understood genetically. Marijuana and hemp were crossed to evaluate competing models of cannabinoid inheritance and to explain the predominance of tetrahydrocannabinolic acid (THCA) in marijuana compared with cannabidiolic acid (CBDA) in hemp. Individuals in the resulting F2 population were assessed for differential expression of cannabinoid synthase genes and were used in linkage mapping. Genetic markers associated with divergent cannabinoid phenotypes were identified. Although phenotypic segregation and a major quantitative trait locus (QTL) for the THCA/CBDA ratio were consistent with a simple model of codominant alleles at a single locus, the diversity of THCA and CBDA synthase sequences observed in the mapping population, the position of enzyme coding loci on the map, and patterns of expression suggest multiple linked loci. Phylogenetic analysis further suggests a history of duplication and divergence affecting drug content. Marijuana is distinguished from hemp by a nonfunctional CBDA synthase that appears to have been positively selected to enhance psychoactivity. An unlinked QTL for cannabinoid quantity may also have played a role in the recent escalation of drug potency.

  12. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  13. Identification of tropomyosins as major allergens in antarctic krill and mantis shrimp and their amino acid sequence characteristics.

    PubMed

    Motoyama, Kanna; Suma, Yota; Ishizaki, Shoichiro; Nagashima, Yuji; Lu, Ying; Ushio, Hideki; Shiomi, Kazuo

    2008-01-01

    Tropomyosin represents a major allergen of decapod crustaceans such as shrimps and crabs, and its highly conserved amino acid sequence (>90% identity) is a molecular basis of the immunoglobulin E (IgE) cross-reactivity among decapods. At present, however, little information is available about allergens in edible crustaceans other than decapods. In this study, the major allergen in two species of edible crustaceans, Antarctic krill Euphausia superba and mantis shrimp Oratosquilla oratoria that are taxonomically distinct from decapods, was demonstrated to be tropomyosin by IgE-immunoblotting using patient sera. The cross-reactivity of the tropomyosins from both species with decapod tropomyosins was also confirmed by inhibition IgE immunoblotting. Sequences of the tropomyosins from both species were determined by complementary deoxyribonucleic acid cloning. The mantis shrimp tropomyosin has high sequence identity (>90% identity) with decapod tropomyosins, especially with fast-type tropomyosins. On the other hand, the Antarctic krill tropomyosin is characterized by diverse alterations in region 13-42, the amino acid sequence of which is highly conserved for decapod tropomyosins, and hence, it shares somewhat lower sequence identity (82.4-89.8% identity) with decapod tropomyosins than the mantis shrimp tropomyosin. Quantification by enzyme-linked immunosorbent assay revealed that Antarctic krill contains tropomyosin at almost the same level as decapods, suggesting that its allergenicity is equivalent to decapods. However, mantis shrimp was assumed to be substantially not allergenic because of the extremely low content of tropomyosin.

  14. Molecular cloning and sequencing of a cDNA encoding the thioesterase domain of the rat fatty acid synthetase.

    PubMed

    Naggert, J; Witkowski, A; Mikkelsen, J; Smith, S

    1988-01-25

    A cloned cDNA containing the entire coding sequence for the long-chain S-acyl fatty acid synthetase thioester hydrolase (thioesterase I) component as well as the 3'-noncoding region of the fatty acid synthetase has been isolated using an expression vector and domain-specific antibodies. The coding region was assigned to the thioesterase I domain by identification of sequences coding for characterized peptide fragments, amino-terminal analysis of the isolated thioesterase I domain and the presence of the serine esterase active-site sequence motif. The thioesterase I domain is 306 amino acids long with a calculated molecular mass of 33,476 daltons; its DNA is flanked at the 5'-end by a region coding for the acyl carrier protein domain and at the 3'-end by a 1,537-base pairs-long noncoding sequence with a poly(A) tail. The thioesterase I domain exhibits a low, albeit discernible, homology with the discrete medium-chain S-acyl fatty acid synthetase thioester hydrolases (thioesterase II) from rat mammary gland and duck uropygial gland, suggesting a distant but common evolutionary ancestry for these proteins.

  15. Human parainfluenza type 3 virus hemagglutinin-neuraminidase glycoprotein: nucleotide sequence of mRNA and limited amino acid sequence of the purified protein.

    PubMed Central

    Elango, N; Coligan, J E; Jambou, R C; Venkatesan, S

    1986-01-01

    The nucleotide sequence of mRNA for the hemagglutinin-neuraminidase (HN) protein of human parainfluenza type 3 virus obtained from the corresponding cDNA clone had a single long open reading frame encoding a putative protein of 64,254 daltons consisting of 572 amino acids. The deduced protein sequence was confirmed by limited N-terminal amino acid microsequencing of CNBr cleavage fragments of native HN that was purified by immunoprecipitation. The HN protein is moderately hydrophobic and has four potential sites (Asn-X-Ser/Thr) of N-glycosylation in the C-terminal half of the molecule. It is devoid of both the N-terminal signal sequence and the C-terminal membrane anchorage domain characteristic of the hemagglutinin of influenza virus and the fusion (F0) protein of the paramyxoviruses. Instead, it has a single prominent hydrophobic region capable of membrane insertion beginning at 32 residues from the N terminus. This N-terminal membrane insertion is similar to that of influenza virus neuraminidase and the recently reported structures of HN proteins of Sendai virus and simian virus 5. Images PMID:3003381

  16. Sequence dependent N-terminal rearrangement and degradation of peptide nucleic acid (PNA) in aqueous solution

    NASA Technical Reports Server (NTRS)

    Eriksson, M.; Christensen, L.; Schmidt, J.; Haaima, G.; Orgel, L.; Nielsen, P. E.

    1998-01-01

    The stability of the PNA (peptide nucleic acid) thymine monomer inverted question markN-[2-(thymin-1-ylacetyl)]-N-(2-aminoaminoethyl)glycine inverted question mark and those of various PNA oligomers (5-8-mers) have been measured at room temperature (20 degrees C) as a function of pH. The thymine monomer undergoes N-acyl transfer rearrangement with a half-life of 34 days at pH 11 as analyzed by 1H NMR; and two reactions, the N-acyl transfer and a sequential degradation, are found by HPLC analysis to occur at measurable rates for the oligomers at pH 9 or above. Dependent on the amino-terminal sequence, half-lives of 350 h to 163 days were found at pH 9. At pH 12 the half-lives ranged from 1.5 h to 21 days. The results are discussed in terms of PNA as a gene therapeutic drug as well as a possible prebiotic genetic material.

  17. Solubility Challenges in High Concentration Monoclonal Antibody Formulations: Relationship with Amino Acid Sequence and Intermolecular Interactions.

    PubMed

    Pindrus, Mariya; Shire, Steven J; Kelley, Robert F; Demeule, Barthélemy; Wong, Rita; Xu, Yiren; Yadav, Sandeep

    2015-11-02

    The purpose of this work was to elucidate the molecular interactions leading to monoclonal antibody self-association and precipitation and utilize biophysical measurements to predict solubility behavior at high protein concentration. Two monoclonal antibodies (mAb-G and mAb-R) binding to overlapping epitopes were investigated. Precipitation of mAb-G solutions was most prominent at high ionic strength conditions and demonstrated strong dependence on ionic strength, as well as slight dependence on solution pH. At similar conditions no precipitation was observed for mAb-R solutions. Intermolecular interactions (interaction parameter, kD) related well with high concentration solubility behavior of both antibodies. Upon increasing buffer ionic strength, interactions of mAb-R tended to weaken, while those of mAb-G became more attractive. To investigate the role of amino acid sequence on precipitation behavior, mutants were designed by substituting the CDR of mAb-R into the mAb-G framework (GM-1) or deleting two hydrophobic residues in the CDR of mAb-G (GM-2). No precipitation was observed at high ionic strength for either mutant. The molecular interactions of mutants were similar in magnitude to those of mAb-R. The results suggest that presence of hydrophobic groups in the CDR of mAb-G may be responsible for compromising its solubility at high ionic strength conditions since deleting these residues mitigated the solubility issue.

  18. Frequencies of amino acid strings in globular protein sequences indicate suppression of blocks of consecutive hydrophobic residues

    PubMed Central

    Schwartz, Russell; Istrail, Sorin; King, Jonathan

    2001-01-01

    Patterns of hydrophobic and hydrophilic residues play a major role in protein folding and function. Long, predominantly hydrophobic strings of 20–22 amino acids each are associated with transmembrane helices and have been used to identify such sequences. Much less attention has been paid to hydrophobic sequences within globular proteins. In prior work on computer simulations of the competition between on-pathway folding and off-pathway aggregate formation, we found that long sequences of consecutive hydrophobic residues promoted aggregation within the model, even controlling for overall hydrophobic content. We report here on an analysis of the frequencies of different lengths of contiguous blocks of hydrophobic residues in a database of amino acid sequences of proteins of known structure. Sequences of three or more consecutive hydrophobic residues are found to be significantly less common in actual globular proteins than would be predicted if residues were selected independently. The result may reflect selection against long blocks of hydrophobic residues within globular proteins relative to what would be expected if residue hydrophobicities were independent of those of nearby residues in the sequence. PMID:11316883

  19. Amino acid sequence of rabbit kidney neutral endopeptidase 24.11 (enkephalinase) deduced from a complementary DNA.

    PubMed Central

    Devault, A; Lazure, C; Nault, C; Le Moual, H; Seidah, N G; Chrétien, M; Kahn, P; Powell, J; Mallet, J; Beaumont, A

    1987-01-01

    Neutral endopeptidase (EC 3.4.24.11) is a major constituent of kidney brush border membranes. It is also present in the brain where it has been shown to be involved in the inactivation of opioid peptides, methionine- and leucine-enkephalins. For this reason this enzyme is often called 'enkephalinase'. In order to characterize the primary structure of the enzyme, oligonucleotide probes were designed from partial amino acid sequences and used to isolate clones from kidney cDNA libraries. Sequencing of the cDNA inserts revealed the complete primary structure of the enzyme. Neutral endopeptidase consists of 750 amino acids. It contains a short N-terminal cytoplasmic domain (27 amino acids), a single membrane-spanning segment (23 amino acids) and an extracellular domain that comprises most of the protein mass. The comparison of the primary structure of neutral endopeptidase with that of thermolysin, a bacterial Zn-metallopeptidase, indicates that most of the amino acid residues involved in Zn coordination and catalytic activity in thermolysin are found within highly honmologous sequences in neutral endopeptidase. Images Fig. 1. Fig. 3. PMID:2440677

  20. DNA-binding and transactivation properties of Pax-6: three amino acids in the paired domain are responsible for the different sequence recognition of Pax-6 and BSAP (Pax-5).

    PubMed Central

    Czerny, T; Busslinger, M

    1995-01-01

    Pax-6 is known to be a key regulator of vertebrate eye development. We have now isolated cDNA for an invertebrate Pax-6 protein from sea urchin embryos. Transcripts of this gene first appear during development at the gastrula stage and are later expressed at high levels in the tube foot of the adult sea urchin. The sea urchin Pax-6 protein is highly homologous throughout the whole protein to its vertebrate counterpart with the paired domain and homeodomain being virtually identical. Consequently, we found that the DNA-binding and transactivation properties of the sea urchin and mouse Pax-6 proteins are very similar, if not identical. A potent activation domain capable of stimulating transcription from proximal promoter and distal enhancer positions was localized within the C-terminal sequences of both the sea urchin and mouse Pax-6 proteins. The homeodomain of Pax-6 was shown to cooperatively dimerize on DNA sequences consisting of an inverted repeat of the TAAT motif with a preferred spacing of 3 nucleotides. The consensus recognition sequence of the Pax-6 paired domain deviates primarily only at one position from that of BSAP (Pax-5), and yet the two proteins exhibit largely different binding specificities for individual, naturally occurring sites. By creating Pax-6-BSAP fusion proteins, we were able to identify a short amino acid stretch in the N-terminal part of the paired domain which is responsible for these differences in DNA-binding specificity. Mutation of three Pax-6-specific residues in this region (at positions 42, 44, and 47 of the paired domain) to the corresponding amino acids of BSAP resulted in a complete switch of the DNA-binding specificity from Pax-6 to BSAP. These three amino acids were furthermore shown to discriminate between the Pax-6- and BSAP-specific nucleotide at the divergent position of the two consensus recognition sequences. PMID:7739566

  1. Classifying nucleic acid sub-sequences as introns or exons using genetic programming

    SciTech Connect

    Handley, S.

    1995-12-31

    An evolutionary computation technique, genetic programming, created programs that classify messenger RNA sequences into one of two classes: (1) the sequence is expressed as (part of) a protein (an exon), or (2) not expressed as protein (an intron).

  2. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    NASA Technical Reports Server (NTRS)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  3. Clickable Nucleic Acids: Sequence-Controlled Periodic Copolymer/Oligomer Synthesis by Orthogonal Thiol-X Reactions.

    PubMed

    Xi, Weixian; Pattanayak, Sankha; Wang, Chen; Fairbanks, Benjamin; Gong, Tao; Wagner, Justine; Kloxin, Christopher J; Bowman, Christopher N

    2015-11-23

    Synthetic polymer approaches generally lack the ability to control the primary sequence, with sequence control referred to as the holy grail. Two click chemistry reactions were now combined to form nucleobase-containing sequence-controlled polymers in simple polymerization reactions. Two distinct approaches are used to form these click nucleic acid (CNA) polymers. These approaches employ thiol-ene and thiol-Michael reactions to form homopolymers of a single nucleobase (e.g., poly(A)n ) or homopolymers of specific repeating nucleobase sequences (e.g., poly(ATC)n). Furthermore, the incorporation of monofunctional thiol-terminated polymers into the polymerization system enables the preparation of multiblock copolymers in a single reaction vessel; the length of the diblock copolymer can be tuned by the stoichiometric ratio and/or the monomer functionality. These polymers are also used for organogel formation where complementary CNA-based polymers form reversible crosslinks.

  4. Converging or Diverging Lens?

    ERIC Educational Resources Information Center

    Branca, Mario

    2013-01-01

    Why does a lens magnify? Why does it shrink objects? Why does this happen? The activities that we propose here are useful in helping us to understand how lenses work, and they show that the same lens can have different magnification capabilities. A converging lens can also act as a diverging lens. (Contains 4 figures.)

  5. Sequence Comparison and Phylogeny of Nucleotide Sequence of Coat Protein and Nucleic Acid Binding Protein of a Distinct Isolate of Shallot virus X from India.

    PubMed

    Majumder, S; Baranwal, V K

    2011-06-01

    Shallot virus X (ShVX), a type species in the genus Allexivirus of the family Alfaflexiviridae has been associated with shallot plants in India and other shallot growing countries like Russia, Germany, Netherland, and New Zealand. Coat protein (CP) and nucleic acid binding protein (NB) region of the virus was obtained by reverse transcriptase polymerase chain reaction from scales leaves of shallot bulbs. The partial cDNA contained two open reading frames encoding proteins of molecular weights of 28.66 and 14.18 kDa belonging to Flexi_CP super-family and viral NB super-family, respectively. The percent identity and phylogenetic analysis of amino acid sequences of CP and NB region of the virus associated with shallot indicated that it was a distinct isolate of ShVX.

  6. Differentiation of acetic acid bacteria based on sequence analysis of 16S-23S rRNA gene internal transcribed spacer sequences.

    PubMed

    González, Angel; Mas, Albert

    2011-06-30

    The 16S-23S gene internal transcribed spacer sequence of sixty-four strains belonging to different acetic acid bacteria genera were analyzed, and phylogenetic trees were generated for each genera. The topologies of the different trees were in accordance with the 16S rRNA gene trees, although the similarity percentages obtained between the species was shown to be much lower. These values suggest the usefulness of including the 16S-23S gene internal transcribed spacer region as a part of the polyphasic approach required for the further classification of acetic acid bacteria. Furthermore, the region could be a good target for primer and probe design. It has also been validated for use in the identification of unknown samples of this bacterial group from wine vinegar and fruit condiments.

  7. A rapid method for manual or automated purification of fluorescently labeled nucleic acids for sequencing, genotyping, and microarrays.

    PubMed

    Springer, Amy L; Booth, Lisa R; Braid, Michael D; Houde, Christiane M; Hughes, Karin A; Kaiser, Robert J; Pedrak, Casandra; Spicer, Douglas A; Stolyar, Sergey

    2003-03-01

    Fluorescent dyes provide specific, sensitive, and multiplexed detection of nucleic acids. To maximize sensitivity, fluorescently labeled reaction products (e.g., cycle sequencing or primer extension products) must be purified away from residual dye-labeled precursors. Successful high-throughput analyses require that this purification be reliable, rapid, and amenable to automation. Common methods for purifying reaction products involve several steps and require processes that are not easily automated. Prolinx, Inc. has devel oped RapXtract superparamagnetic separation technology affording rapid and easy-to-perform methods that yield high-quality product and are easily automated. The technology uses superparamagnetic particles that specifically remove unincorporated dye-labeled precursors. These particles are efficiently pelleted in the presence of a magnetic field, making them ideal for purification because of the rapid separations that they allow. RapXtract-purified sequencing reactions yield data with good signal and high Phred quality scores, and they work with various sequencing dye chemistries, including BigDye and near-infrared fluorescence IRDyes. RapXtract technology can also be used to purify dye primer sequencing reactions, primer extension reactions for genotyping analysis, and nucleic acid labeling reactions for microarray hybridization. The ease of use and versatility of RapXtract technology makes it a good choice for manual or automated purification of fluorescently labeled nucleic acids.

  8. Gene structure and amino acid sequence of Latimeria chalumnae (coelacanth) myelin DM20: phylogenetic relation of the fish.

    PubMed

    Tohyama, Y; Kasama-Yoshida, H; Sakuma, M; Kobayashi, Y; Cao, Y; Hasegawa, M; Kojima, H; Tamai, Y; Tanokura, M; Kurihara, T

    1999-07-01

    The structure of Latimeria chalumnae (coelacanth) proteolipid protein/DM20 gene excluding exon 1 was determined, and the amino acid sequence of Latimeria DM20 corresponding to exons 2-7 was deduced. The nucleotide sequence of exon 3 suggests that only DM20 isoform is expressed in Latimeria. The structure of proteolipid protein/DM20 gene is well preserved among human, dog, mouse, and Latimeria. Southern blot analysis indicates that Latimeria DM20 gene is a single-copy gene. When the amino acid sequences of DM20 were compared among various species, Latimeria was more similar to tetrapods than other fishes including lungfish, confirming the previous finding by immunoreactivity (Waehneldt and Malotka 1989 J. Neurochem. 52:1941-1943). However, when phylogenetic trees were constructed from the DM20 sequences, lungfish was clearly the closest to tetrapods. Latimeria was situated outside of lungfish by the maximum likelihood method. The apparent similarity of Latimeria DM20 to tetrapod proteolipid protein/DM20 is explained by the slow amino acid substitution rate of Latimeria DM20.

  9. Evolutionary Divergence of Aggregatibacter actinomycetemcomitans.

    PubMed

    Kittichotirat, W; Bumgarner, R E; Chen, C

    2016-01-01

    Gram-negative facultative Aggregatibacter actinomycetemcomitans is an oral pathogen associated with periodontitis. The genetic heterogeneity among A. actinomycetemcomitans strains has been long recognized. This study provides a comprehensive genomic analysis of A. actinomycetemcomitans and the closely related nonpathogenic Aggregatibacter aphrophilus. Whole genome sequencing by Illumina MiSeq platform was performed for 31 A. actinomycetemcomitans and 2 A. aphrophilus strains. Sequence similarity analysis shows a total of 3,220 unique genes across the 2 species, where 1,550 are core genes present in all genomes and 1,670 are variable genes (accessory genes) missing in at least 1 genome. Phylogenetic analysis based on 397 concatenated core genes distinguished A. aphrophilus and A. actinomycetemcomitans. The latter was in turn divided into 5 clades: clade b (serotype b), clade c (serotype c), clade e/f (serotypes e and f), clade a/d (serotypes a and d), and clade e' (serotype e strains). Accessory genes accounted for 14.1% to 23.2% of the A. actinomycetemcomitans genomes, with a majority belonging to the category of poorly characterized by Cluster of Orthologous Groups classification. These accessory genes were often organized into genomic islands (n = 387) with base composition biases, suggesting their acquisitions via horizontal gene transfer. There was a greater degree of similarity in gene content and genomic islands among strains within clades than between clades. Strains of clade e' isolated from human were found to be missing the genomic island that carries genes encoding cytolethal distending toxins. Taken together, the results suggest a pattern of sequential divergence, starting from the separation of A. aphrophilus and A. actinomycetemcomitans through gain and loss of genes and ending with the divergence of the latter species into distinct clades and serotypes. With differing constellations of genes, the A. actinomycetemcomitans clades may have evolved

  10. Cloning and sequencing of the Bet v 1-homologous allergen Fra a 1 in strawberry (Fragaria ananassa) shows the presence of an intron and little variability in amino acid sequence.

    PubMed

    Musidlowska-Persson, Anna; Alm, Rikard; Emanuelsson, Cecilia

    2007-02-01

    The Fra a 1 allergen in strawberry (Fragaria ananassa) is homologous to the major birch pollen allergen Bet v 1, which has numerous isoforms differing in terms of amino acid sequence and immunological impact. To map the extent of sequence differences in the Fra a 1 allergen, PCR cloning and sequencing was applied. Several genomic sequences of Fra a 1, with a length of either 584, 591 or 594 nucleotides, were obtained from three different strawberry varieties. All contained one intron, with the length of either 101 or 110 nucleotides. By sequencing 30 different clones, eight different DNA sequences were obtained, giving in total five potential Fra a 1 protein isoforms, with high sequence similarity (>97% sequence identity) and only seven positions of amino acid variability, which were largely confirmed by mass spectrometry of expressed proteins. We conclude that the sequence variability in the strawberry allergen Fra a 1 is small, within and between strawberry varieties, and that multiple spots, previously detected in 2DE, are presumably due to differences in post-translational modification rather than differences in amino acid sequence. The most abundant Fra a 1 isoform sequence, recombinantly expressed in Escherichia coli after removal of the intron, was recognized by IgE from strawberry allergic patients. It cross-reacted with antibodies to Bet v 1 and the homologous apple allergen Mal d 1 (61 and 78% sequence identity, respectively), and will be used in further analyses of variation in Fra a 1-expression.

  11. K-Pax2: Bayesian identification of cluster-defining amino acid positions in large sequence datasets

    PubMed Central

    Grad, Yonatan; Cobey, Sarah; Puranen, Juha Santeri; Corander, Jukka

    2015-01-01

    The recent growth in publicly available sequence data has introduced new opportunities for studying microbial evolution and spread. Because the pace of sequence accumulation tends to exceed the pace of experimental studies of protein function and the roles of individual amino acids, statistical tools to identify meaningful patterns in protein diversity are essential. Large sequence alignments from fast-evolving micro-organisms are particularly challenging to dissect using standard tools from phylogenetics and multivariate statistics because biologically relevant functional signals are easily masked by neutral variation and noise. To meet this need, a novel computational method is introduced that is easily executed in parallel using a cluster environment and can handle thousands of sequences with minimal subjective input from the user. The usefulness of this kind of machine learning is demonstrated by applying it to nearly 5000 haemagglutinin sequences of influenza A/H3N2.Antigenic and 3D structural mapping of the results show that the method can recover the major jumps in antigenic phenotype that occurred between 1968 and 2013 and identify specific amino acids associated with these changes. The method is expected to provide a useful tool to uncover patterns of protein evolution. PMID:28348810

  12. The amino acid sequence of Ole e I, the major allergen from olive tree (Olea europaea) pollen.

    PubMed

    Villalba, M; Batanero, E; López-Otín, C; Sánchez, L M; Monsalve, R I; González de la Peña, M A; Lahoz, C; Rodríguez, R

    1993-09-15

    The complete primary structure of the major allergen from Olea europaea (olive tree) pollen, Ole e I (IUIS nomenclature), has been determined. The amino acid sequence was established by automated Edman degradation of the reduced and alkylated molecule as well as of selected fragments obtained by proteolytic digestions. Ole e I contains a single polypeptide chain of 145 amino acid residues with a calculated molecular mass of 16331 Da. No free sulfhydryl groups have been detected in the native protein. The molecule contains a putative glycosylation site. A high degree of microheterogeneity has been observed, mainly centered in the first 33% of the molecule. Comparison of Ole e I sequence with protein sequence databases showed no similarity with other known allergens. However, it has a 36% and 38% sequence identity with the putative polypeptide structures, deduced, respectively, from nucleotide sequences of genes isolated from tomato anthers and corn pollen, which have been suggested to be involved in the growing of the pollen tube. Therefore, the olive tree allergen may be a constitutive protein of the pollen involved in reproductive functions.

  13. Complete genome sequence of Enterococcus mundtii QU 25, an efficient L-(+)-lactic acid-producing bacterium.

    PubMed

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-08-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified-one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci.

  14. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    PubMed

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered.

  15. Isolation and a partial amino acid sequence of insulin from the islet tissue of cod (Gadus callarias)

    PubMed Central

    Grant, P. T.; Reid, K. B. M.

    1968-01-01

    1. Insulin has been isolated by gel filtration and ion-exchange chromatography from extracts of the discrete islet tissue of cod. The final preparation yielded a single band on electrophoresis at two pH values. The biological potency was 11·5 international units/mg. in mouse-convulsion and other assay procedures. 2. Glycine and methionine were shown to be the N-terminal amino acids of the A and B chains respectively. An estimate of the molecular weight together with amino acid analyses indicated that cod insulin, like the bovine hormone, consists of 51 amino acid residues. In contrast, the amino acid composition differs markedly from bovine insulin. 3. Oxidation of insulin with performic acid yielded the A and B peptide chains, which were separated by ion-exchange chromatography. Sequence studies on smaller peptides isolated from enzymic digests or from dilute acetic acid hydrolysates of the two chains have established the sequential order of 14 of the 21 amino acid residues of the A chain and 25 of the 30 amino acid residues of the B chain. PMID:4866431

  16. Oxygen affinity and amino acid sequence of myoglobins from endothermic and ectothermic fish.

    PubMed

    Marcinek, D J; Bonaventura, J; Wittenberg, J B; Block, B A

    2001-04-01

    Myoglobin (Mb) buffers intracellular O2 and facilitates diffusion of O2 through the cell. These functions of Mb will be most effective when intracellular PO2 is near the partial pressure of oxygen at which Mb is half saturated (P50) of the molecule. We test the hypothesis that Mb oxygen affinity has evolved such that it is conserved when adjusted for body temperature among closely related animals. We measure oxygen P50s tonometrically and oxygen dissociation rate constants with stopped flow and generate amino acid sequence from cDNA of Mbs from fish with different body temperatures. P50s for the endothermic bluefin tuna, skipjack tuna, and blue marlin at 20 degrees C were 0.62 +/- 0.02, 0.59 +/- 0.01, 0.58 +/- 0.04 mmHg, respectively, and were significantly lower than those for ectothermic bonito (1.03 +/- 0.07 mmHg) and mackerel (1.39 +/- 0.03 mmHg). Because the oxygen affinity of Mb decreases with increasing temperature, the above differences in oxygen affinity between endothermic and ectothermic fish are reduced when adjusted for the in vivo muscle temperature of the animal. Oxygen dissociation rate constants at 20 degrees C for the endothermic species ranged from 34.1 to 49.3 s(-1), whereas those for mackerel and bonito were 102 and 62 s(-1), respectively. Correlated with the low oxygen affinity and fast dissociation kinetics of mackerel Mb is a substitution of alanine for proline that would likely result in a more flexible mackerel protein.

  17. Molecular cloning, nucleotide sequence, and abscisic acid induction of a suberization-associated highly anionic peroxidase.

    PubMed

    Roberts, E; Kolattukudy, P E

    1989-06-01

    A highly anionic peroxidase induced in suberizing cells was suggested to be the key enzyme involved in polymerization of phenolic monomers to generate the aromatic matrix of suberin. The enzyme encoded by a potato cDNA was found to be highly homologous to the anionic peroxidase induced in suberizing tomato fruit. A tomato genomic library was screened using the potato anionic peroxidase cDNA and one genomic clone was isolated that contained two tandemly oriented anionic peroxidase genes. These genes were sequenced and were 96% and 87% identical to the mRNA for potato anionic peroxidase. Both genes consist of three exons with the relative positions of their two introns being conserved between the two genes. Primer extension analysis showed that only one of the genes is expressed in the periderm of 3 day wound-healed tomato fruits. Southern blot analyses suggested that there are two copies each of the two highly homologous genes per haploid genome in both potato and tomato. Abscisic acid (ABA) induced the accumulation of the anionic peroxidase transcripts in potato and tomato callus tissues. Northern blots showed that peroxidase mRNA was detectable at 2 days and was maximal at 8 days after transfer of potato callus to solid agar media containing 10(-4) M ABA. The transcripts induced by ABA in both potato and tomato callus were identical in size to those induced in wound-healing potato tuber and tomato fruit. The anionic peroxidase peptide was detected in extracts of potato callus grown on the ABA-containing media by western blot analysis. The results support the suggestion that stimulation of suberization by ABA involves the induction of the highly anionic peroxidase.

  18. Complete genome sequence of the probiotic lactic acid bacterium Lactobacillus acidophilus NCFM

    PubMed Central

    Altermann, Eric; Russell, W. Michael; Azcarate-Peril, M. Andrea; Barrangou, Rodolphe; Buck, B. Logan; McAuliffe, Olivia; Souther, Nicole; Dobson, Alleson; Duong, Tri; Callanan, Michael; Lick, Sonja; Hamrick, Alice; Cano, Raul; Klaenhammer, Todd R.

    2005-01-01

    Lactobacillus acidophilus NCFM is a probiotic bacterium that has been produced commercially since 1972. The complete genome is 1,993,564 nt and devoid of plasmids. The average GC content is 34.71% with 1,864 predicted ORFs, of which 72.5% were functionally classified. Nine phage-related integrases were predicted, but no complete prophages were found. However, three unique regions designated as potential autonomous units (PAUs) were identified. These units resemble a unique structure and bear characteristics of both plasmids and phages. Analysis of the three PAUs revealed the presence of two R/M systems and a prophage maintenance system killer protein. A spacers interspersed direct repeat locus containing 32 nearly perfect 29-bp repeats was discovered and may provide a unique molecular signature for this organism. In silico analyses predicted 17 transposase genes and a chromosomal locus for lactacin B, a class II bacteriocin. Several mucus- and fibronectin-binding proteins, implicated in adhesion to human intestinal cells, were also identified. Gene clusters for transport of a diverse group of carbohydrates, including fructooligosaccharides and raffinose, were present and often accompanied by transcriptional regulators of the lacI family. For protein degradation and peptide utilization, the organism encoded 20 putative peptidases, homologs for PrtP and PrtM, and two complete oligopeptide transport systems. Nine two-component regulatory systems were predicted, some associated with determinants implicated in bacteriocin production and acid tolerance. Collectively, these features within the genome sequence of L. acidophilus are likely to contribute to the organisms' gastric survival and promote interactions with the intestinal mucosa and microbiota. PMID:15671160

  19. Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

    PubMed Central

    Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

    1986-01-01

    A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461

  20. Peptide Mass Fingerprinting and N-Terminal Amino Acid Sequencing of Glycosylated Cysteine Protease of Euphorbia nivulia Buch.-Ham.

    PubMed Central

    Badgujar, Shamkant B.; Mahajan, Raghunath T.

    2013-01-01

    A new cysteine protease named Nivulian-II has been purified from the latex of Euphorbia nivulia Buch.-Ham. The apparent molecular mass of Nivulian-II is 43670.846 Da (MALDI TOF/MS). Peptide mass fingerprint analysis revealed peptide matches to Maturase K (Q52ZV1_9MAGN) of Banksia quercifolia. The N-terminal sequence (DFPPNTCCCICC) showed partial homology with those of other cysteine proteinases of biological origin. This is the first paper to characterize a Nivulian-II of E. nivulia latex with respect to amino acid sequencing. PMID:23476742

  1. A phylogenomic approach to resolve the basal pterygote divergence.

    PubMed

    Simon, Sabrina; Strauss, Sascha; von Haeseler, Arndt; Hadrys, Heike

    2009-12-01

    One of the most fascinating Bauplan transitions in the animal kingdom was the invention of insect wings, a change that also contributed to the success and enormous diversity of this animal group. However, the origin of insect flight and the relationships of basal winged insect orders are still controversial. Three hypotheses have been proposed to explain the phylogeny of winged insects: 1) the traditional Palaeoptera hypothesis (Ephemeroptera + Odonata, Neoptera), 2) the Metapterygota hypothesis (Ephemeroptera, Odonata + Neoptera), and 3) the Chiastomyaria hypothesis (Odonata, Ephemeroptera + Neoptera). Neither phylogenetic analyses of single genes nor even multiple marker systems (e.g., molecular markers + morphological characters) have yet been able to conclusively resolve basal pterygote divergences. A possible explanation for the lack of resolution is that the divergences took place in the mid-Devonian within a short period of time and attempts to solve this problem have been confounded by the major challenge of finding molecular markers to accurately track these short ancient internodes. Although phylogenomic data are available for Neoptera and some wingless (apterygote) orders, they are lacking for the crucial Odonata and Ephemeroptera orders. We adopt a multigene approach including data from two new expressed sequence tag projects-from the orders Ephemeroptera (Baetis sp.) and Odonata (Ischnura elegans)-to evaluate the potential of phylogenomic analyses in clarifying this unresolved issue. We analyzed two data sets that differed in represented taxa, genes, and overall sequence lengths: maxspe (15 taxa, 125 genes, and 31,643 amino acid positions) and maxgen (8 taxa, 150 genes, and 42,541 amino acid positions). Maximum likelihood and Bayesian inference analyses both place the Odonata at the base of the winged insects. Furthermore, statistical hypotheses testing rejected both the Palaeoptera and the Metapterygota hypotheses. The comprehensive molecular data set

  2. Divergence Boundary Conditions for Vector Helmholtz Equations with Divergence Constraints

    NASA Technical Reports Server (NTRS)

    Kangro, Urve; Nicolaides, Roy

    1997-01-01

    The idea of replacing a divergence constraint by a divergence boundary condition is investigated. The connections between the formulations are considered in detail. It is shown that the most common methods of using divergence boundary conditions do not always work properly. Necessary and sufficient conditions for the equivalence of the formulations are given.

  3. DNA Sequence and Expression Variation of Hop (Humulus lupulus) Valerophenone Synthase (VPS), a Key Gene in Bitter Acid Biosynthesis

    PubMed Central

    Castro, Consuelo B.; Whittock, Lucy D.; Whittock, Simon P.; Leggett, Grey; Koutoulis, Anthony

    2008-01-01

    Background The hop plant (Humulus lupulus) is a source of many secondary metabolites, with bitter acids essential in the beer brewing industry and others having potential applications for human health. This study investigated variation in DNA sequence and gene expression of valerophenone synthase (VPS), a key gene in the bitter acid biosynthesis pathway of hop. Methods Sequence variation was studied in 12 varieties, and expression was analysed in four of the 12 varieties in a series across the development of the hop cone. Results Nine single nucleotide polymorphisms (SNPs) were detected in VPS, seven of which were synonymous. The two non-synonymous polymorphisms did not appear to be related to typical bitter acid profiles of the varieties studied. However, real-time quantitative reverse-transcription polymerase chain reaction (qRT-PCR) analysis of VPS expression during hop cone development showed a clear link with the bitter acid content. The highest levels of VPS expression were observed in two triploid varieties, ‘Symphony’ and ‘Ember’, which typically have high bitter acid levels. Conclusions In all hop varieties studied, VPS expression was lowest in the leaves and an increase in expression was consistently observed during the early stages of cone development. PMID:18519445

  4. A knowledge engineering approach to recognizing and extracting sequences of nucleic acids from scientific literature.

    PubMed

    García-Remesal, Miguel; Maojo, Victor; Crespo, José

    2010-01-01

    In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences.

  5. Cloning, sequence, and developmental expression of a type 5, tartrate-resistant, acid phosphatase of rat bone.

    PubMed

    Ek-Rylander, B; Bill, P; Norgård, M; Nilsson, S; Andersson, G

    1991-12-25

    Tartrate-resistant acid phosphatase (TRAP) is a characteristic constituent of osteoclasts and some mononuclear preosteoclasts and, therefore, used as a histochemical and biochemical marker for osteoclasts and bone resorption. We now report the isolation of a 1397-base pair (bp) full-length TRAP/tartrate-resistant acid ATPase (TrATPase) cDNA clone from a neonatal rat calvaria lambda gt11 cDNA library. The cDNA clone consists of a 92-bp untranslated 5'-flank, an open reading frame of 981 bp and a 324-bp untranslated 3'-poly(A)-containing region. The deduced protein sequence of 327 amino acids contains a putative cleavable signal sequence of 21 amino acids. The mature polypeptide of 306 amino acids has a calculated Mr of 34,350 Da and a pI of 9.18, and it contains two potential N-glycosylation sites and the lysosomal targeting sequence DKRFQ. At the protein level, the sequence displays 89-94% homology to TRAP enzymes from human placenta, beef spleen, and uteroferrin and identity to the N terminus of purified rat bone TRAP/TrATPase. An N-terminal amino acid segment is strikingly homologous to the corresponding region in lysosomal and prostatic acid phosphatases. The cDNA recognized a 1.5-kilobase mRNA in long bones and calvaria, and in vitro translation using, as template, mRNA transcribed from the full-length insert yielded an immunoprecipitated product of 34 kDa. In neonatal rats, TRAP/TrATPase mRNA was highly expressed in skeletal tissues, with much lower (less than 10%) levels detected in spleen, thymus, liver, skin, brain, kidney, brain, lung, and heart. In situ hybridization demonstrated specific labeling of osteoclasts at endostal surfaces and bone trabeculae of long bones. Thus, despite the apparent similarity of this osteoclastic TRAP/TrATPase with type 5, tartrate-resistant and purple, acid phosphatases expressed in other mammalian tissues, this gene appears to be preferentially expressed at skeletal sites.

  6. Snake venoms. The amino-acid sequence of protein S5C4 from Dendroaspis jamesoni kaimosae (Jameson's mamba) venom.

    PubMed

    Joubert, F J; Strydom, A J; Taljaard, N

    1978-06-01

    A major component (S5C4) was purified from Jameson's mamba by gel filtration on Sephadex G-50 and by ion-exchange chromotography on CM-cellulose. Protein S5C4 contains 60 amino acid residues and is cross-linked by four intrachain disulphide bridges. The complete primary structure of the protein has been elucidated. The toxicities, the immunochemical properties, the sequence and the invariant amino acid residues of protein S5C4 resemble subgroup II of the angusticeps-type proteins.

  7. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  8. Complete amino acid sequence of an acidic, cardiotoxic phospholipase A2 from the venom of Ophiophagus hannah (King Cobra): a novel cobra venom enzyme with "pancreatic loop".

    PubMed

    Huang, M Z; Gopalakrishnakone, P; Chung, M C; Kini, R M

    1997-02-15

    A phospholipase A2 (OHV A-PLA2) from the venom of Ophiophagus hannah (King cobra) is an acidic protein exhibiting cardiotoxicity, myotoxicity, and antiplatelet activity. The complete amino acid sequence of OHV A-PLA2 has been determined using a combination of Edman degradation and mass spectrometric techniques. OHV A-PLA2 is composed of a single chain of 124 amino acid residues with 14 cysteines and a calculated molecular weight of 13719 Da. It contains the loop of residues (62-66) found in pancreatic PLA2s and hence belongs to class IB enzymes. This pancreatic loop is between two proline residues (Pro 59 and Pro 68) and contains several hydrophilic amino acids (Ser and Asp). This region has high degree of conformational flexibility and is on the surface of the molecule, and hence it may be a potential protein-protein interaction site. A relatively low sequence homology is found between OHV A-PLA2 and other known cardiotoxic PLA2s, and hence a contiguous segment could not be identified as a site responsible for the cardiotoxic activity.

  9. Snake venoms. The amino-acid sequence of trypsin inhibitor E of Dendroaspis polylepis polylepis (Black Mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1978-06-01

    Trypsin inhibitor E from black mamba venom comprises 59 amino acid residues in a single polypeptide chain, cross-linked by three intrachain disulphide bridges. The complete primary structure of inhibitor E was elucidated. The sequence is homologous with trypsin inhibitors from different sources. Unique among this homologous series of proteinase inhibitors, inhibitor E has an affinity for transition metal ions, exemplified here by Cu2 and Co2+.

  10. The amino acid sequence of the zinc-requiring beta-lactamase II from the bacterium Bacillus cereus 569.

    PubMed

    Ambler, R P; Daniel, M; Fleming, J; Hermoso, J M; Pang, C; Waley, S G

    1985-09-23

    The amino acid sequence of the zinc-requiring beta-lactamase II from Bacillus cereus strain 569 has been determined. It consists of a single polypeptide chain of 227 residues. It is the only example so far fully characterized of a class B beta-lactamase, and is structurally and mechanistically distinct from both the widely distributed class A beta-lactamases (such as the Escherichia coli RTEM enzyme) and from the chromosomally encoded class C enzymes from Gram-negative bacteria.

  11. A simple ligation-based method to increase the information density in sequencing reactions used to deconvolute nucleic acid selections

    PubMed Central

    Childs-Disney, Jessica L.; Disney, Matthew D.

    2008-01-01

    Herein, a method is described to increase the information density of sequencing experiments used to deconvolute nucleic acid selections. The method is facile and should be applicable to any selection experiment. A critical feature of this method is the use of biotinylated primers to amplify and encode a BamHI restriction site on both ends of a PCR product. After amplification, the PCR reaction is captured onto streptavidin resin, washed, and digested directly on the resin. Resin-based digestion affords clean product that is devoid of partially digested products and unincorporated PCR primers. The product's complementary ends are annealed and ligated together with T4 DNA ligase. Analysis of ligation products shows formation of concatemers of different length and little detectable monomer. Sequencing results produced data that routinely contained three to four copies of the library. This method allows for more efficient formulation of structure-activity relationships since multiple active sequences are identified from a single clone. PMID:18065718

  12. The isolation, purification and amino-acid sequence of insulin from the teleost fish Cottus scorpius (daddy sculpin).

    PubMed

    Cutfield, J F; Cutfield, S M; Carne, A; Emdin, S O; Falkmer, S

    1986-07-01

    Insulin from the principal islets of the teleost fish, Cottus scorpius (daddy sculpin), has been isolated and sequenced. Purification involved acid/alcohol extraction, gel filtration, and reverse-phase high-performance liquid chromatography to yield nearly 1 mg pure insulin/g wet weight islet tissue. Biological potency was estimated as 40% compared to porcine insulin. The sculpin insulin crystallised in the absence of zinc ions although zinc is known to be present in the islets in significant amounts. Two other hormones, glucagon and pancreatic polypeptide, were copurified with the insulin, and an N-terminal sequence for pancreatic polypeptide was determined. The primary structure of sculpin insulin shows a number of sequence changes unique so far amongst teleost fish. These changes occur at A14 (Arg), A15 (Val), and B2 (Asp). The B chain contains 29 amino acids and there is no N-terminal extension as seen with several other fish. Presumably as a result of the amino acid substitutions, sculpin insulin does not readily form crystals containing zinc-insulin hexamers, despite the presence of the coordinating B10 His.

  13. The amino acid sequence of a carbohydrate-containing immunoglobulin-light-chain-type amyloid-fibril protein.

    PubMed Central

    Tveteraas, T; Sletten, K; Westermark, P

    1985-01-01

    The amino acid sequence of an amyloid-fibril protein Es492 of immunoglobulin-lambda-light-chain origin (AL) was elucidated. The amyloid fibrils were obtained from the spleen of a patient who died from systemic amyloidosis. The amino acid sequence was elucidated from structural studies of peptides derived from digestion of the protein with trypsin, thermolysin, chymotrypsin and Staphylococcus aureus V8 proteinase and from cleavage of the protein with CNBr and BNPS-skatole. A heterogeneity in the length of the polypeptide was seen in the C-terminal region. The protein was by sequence homology to other lambda-chains shown to be of the V lambda II subgroup. Although an extensive homology was seen, some amino acid residues in positions 26, 31, 32, 40, 44, 93, 97, 98 and 99 have not previously been reported in these positions of V lambda II proteins. The significance of these residues in the fibril formation is unclear. The protein was found to contain carbohydrate, with glycosylation sites in two of the hypervariable regions. PMID:3936482

  14. Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis.

    PubMed Central

    Gorbalenya, A E; Koonin, E V; Donchenko, A P; Blinov, V M

    1989-01-01

    Amino acid sequences of 2 giant non-structural polyproteins (F1 and F2) of infectious bronchitis virus (IBV), a member of Coronaviridae, were compared, by computer-assisted methods, to sequences of a number of other positive strand RNA viral and cellular proteins. By this approach, juxtaposed putative RNA-dependent RNA polymerase, nucleic acid binding ("finger"-like) and RNA helicase domains were identified in F2. Together, these domains might constitute the core of the protein complex involved in the primer-dependent transcription, replication and recombination of coronaviruses. In F1, two cysteine protease-like domains and a growth factor-like one were revealed. One of the putative proteases of IBV is similar to 3C proteases of picornaviruses and related enzymes of como- nepo- and potyviruses. Search of IBV F1 and F2 sequences for sites similar to those cleaved by the latter proteases and intercomparison of the surrounding sequence stretches revealed 13 dipeptides Q/S(G) which are probably cleaved by the coronavirus 3C-like protease. Based on these observations, a partial tentative scheme for the functional organization and expression strategy of the non-structural polyproteins of IBV was proposed. It implies that, despite the general similarity to other positive strand RNA viruses, and particularly to potyviruses, coronaviruses possess a number of unique structural and functional features. PMID:2526320

  15. Recognition of 5'-YpG-3' sequences by coupled stacking/hydrogen bonding interactions with amino acid residues.

    PubMed

    Lamoureux, Jason S; Maynes, Jason T; Glover, J N Mark

    2004-01-09

    The combined biochemical and structural study of hundreds of protein-DNA complexes has indicated that sequence-specific interactions are mediated by two mechanisms termed direct and indirect readout. Direct readout involves direct interactions between the protein and base-specific atoms exposed in the major and minor grooves of DNA. For indirect readout, the protein recognizes DNA by sensing conformational variations in the structure dependent on nucleotide sequence, typically through interactions with the phosphodiester backbone. Based on our recent structure of Ndt80 bound to DNA in conjunction with a search of the existing PDB database, we propose a new method of sequence-specific recognition that utilizes both direct and indirect readout. In this mode, a single amino acid side-chain recognizes two consecutive base-pairs. The 3'-base is recognized by canonical direct readout, while the 5'-base is recognized through a variation of indirect readout, whereby the conformational flexibility of the particular dinucleotide step, namely a 5'-pyrimidine-purine-3' step, facilitates its recognition by the amino acid via cation-pi interactions. In most cases, this mode of DNA recognition helps explain the sequence specificity of the protein for its target DNA.

  16. Kranz and single-cell forms of C4 plants in the subfamily Suaedoideae show kinetic C4 convergence for PEPC and Rubisco with divergent amino acid substitutions

    PubMed Central

    Rosnow, Josh J.; Evans, Marc A.; Kapralov, Maxim V.; Cousins, Asaph B.; Edwards, Gerald E.; Roalson, Eric H.

    2015-01-01

    The two carboxylation reactions performed by phosphoenolpyruvate carboxylase (PEPC) and ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) are vital in the fixation of inorganic carbon for C4 plants. The abundance of PEPC is substantially elevated in C4 leaves, while the location of Rubisco is restricted to one of two chloroplast types. These differences compared with C3 leaves have been shown to result in convergent enzyme optimization in some C4 species. Investigation into the kinetic properties of PEPC and Rubisco from Kranz C4, single cell C4, and C3 species in Chenopodiaceae s. s. subfamily Suaedoideae showed that these major carboxylases in C4 Suaedoideae species lack the same mutations found in other C4 systems which have been examined; but still have similar convergent kinetic properties. Positive selection analysis on the N-terminus of PEPC identified residues 364 and 368 to be under positive selection with a posterior probability >0.99 using Bayes empirical Bayes. Compared with previous analyses on other C4 species, PEPC from C4 Suaedoideae species have different convergent amino acids that result in a higher K m for PEP and malate tolerance compared with C3 species. Kinetic analysis of Rubisco showed that C4 species have a higher catalytic efficiency of Rubisco (k catc in mol CO2 mol–1 Rubisco active sites s–1), despite lacking convergent substitutions in the rbcL gene. The importance of kinetic changes to the two-carboxylation reactions in C4 leaves related to amino acid selection is discussed. PMID:26417023

  17. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance

    PubMed Central

    Baranzoni, Gian Marco; Reichenberger, Erin R.; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  18. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance.

    PubMed

    Baranzoni, Gian Marco; Fratamico, Pina M; Reichenberger, Erin R; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-07-28

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here.

  19. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  20. Amino acid sequence and some properties of phytolacain G, a cysteine protease from growing fruit of pokeweed, Phytolacca americana.

    PubMed

    Uchikoba, T; Arima, K; Yonezawa, H; Shimada, M; Kaneda, M

    2000-10-18

    A protease, phytolacain G, has been found to appear on CM-Sepharose ion-exchange chromatography of greenish small-size fruits of pokeweed, Phytolacca americana L, from ca. 2 weeks after flowering, and increases during fruit enlargement. Reddish ripe fruit of the pokeweed contained both phytolacain G and R. The molecular mass of phytolacain G was estimated to be 25.5 kDa by SDS-PAGE. Its amino acid sequence was reconstructed by automated sequence analysis of the peptides obtained after cleavage with Achromobacter protease I, chymotrypsin, and cyanogen bromide. The enzyme is composed of 216 amino acid residues, of which it shares 152 identical amino acid residues (70%) with phytolacain R, 126 (58%) with melain G, 108 (50%) with papain, 106 (49%) with actinidain, and 96 (44%) with stem bromelain. The amino acid residues forming the substrate binding S(2) pocket of papain, Tyr67, Pro68, Trp69, Val133, and Phe207, were predicted to be replaced by Trp, Met, His, Ala, and Ser in phytolacain G, respectively. As a consequence of these substitutions, the S(2) pocket is expected to be less hydrophobic in phytolacain G than in papain.

  1. Sequence divergence and loss-of-function phenotypes of S locus F-box brothers genes are consistent with non-self recognition by multiple pollen determinants in self-incompatibility of Japanese pear (Pyrus pyrifolia).

    PubMed

    Kakui, Hiroyuki; Kato, Masaki; Ushijima, Koichiro; Kitaguchi, Miyoko; Kato, Shu; Sassa, Hidenori

    2011-12-01

    The S-RNase-based gametophytic self-incompatibility (SI) of Rosaceae, Solanaceae, and Plantaginaceae is controlled by at least two tightly linked genes located at the complex S locus; the highly polymorphic S-RNase for pistil specificity and the F-box gene (SFB/SLF) for pollen. Self-incompatibility in Prunus (Rosaceae) is considered to represent a 'self recognition by a single factor' system, because loss-of-function of SFB is associated with self-compatibility, and allelic divergence of SFB is high and comparable to that of S-RNase. In contrast, Petunia (Solanaceae) exhibits 'non-self recognition by multiple factors'. However, the distribution of 'self recognition' and 'non-self recognition' SI systems in different taxa is not clear. In addition, in 'non-self recognition' systems, a loss-of-function phenotype of pollen S is unknown. Here we analyze the divergence of SFBB genes, the multiple pollen S candidates, of a rosaceous plant Japanese pear (Pyrus pyrifolia) and show that intrahaplotypic divergence is high and comparable to the allelic diversity of S-RNase while interhaplotypic divergence is very low. Next, we analyzed loss-of-function of the SFBB1 type gene. Genetic analysis showed that pollen with the mutant haplotype S(4sm) lacking SFBB1-S(4) is rejected by pistils with an otherwise compatible S(1) while it is accepted by other non-self pistils. We found that the S(5) haplotype encodes a truncated SFBB1 protein, even though S(5) pollen is accepted normally by pistils with S(1) and other non-self haplotypes. These findings suggest that Japanese pear has a 'non-self recognition by multiple factors' SI system, although it is a species of Rosaceae to which Prunus also belongs.

  2. Three Divergent Subpopulations of the Malaria Parasite Plasmodium knowlesi

    PubMed Central

    Lin, Lee C.; Rovie-Ryan, Jeffrine J.; Kadir, Khamisah A.; Anderios, Fread; Hisam, Shamilah; Sharma, Reuben S.K.; Singh, Balbir; Conway, David J.

    2017-01-01

    Multilocus microsatellite genotyping of Plasmodium knowlesi isolates previously indicated 2 divergent parasite subpopulations in humans on the island of Borneo, each associated with a different macaque reservoir host species. Geographic divergence was also apparent, and independent sequence data have indicated particularly deep divergence between parasites from mainland Southeast Asia and Borneo. To resolve the overall population structure, multilocus microsatellite genotyping was conducted on a new sample of 182 P. knowlesi infections (obtained from 134 humans and 48 wild macaques) from diverse areas of Malaysia, first analyzed separately and then in combination with previous data. All analyses confirmed 2 divergent clusters of human cases in Malaysian Borneo, associated with long-tailed macaques and pig-tailed macaques, and a third cluster in humans and most macaques in peninsular Malaysia. High levels of pairwise divergence between each of these sympatric and allopatric subpopulations have implications for the epidemiology and control of this zoonotic species. PMID:28322705

  3. A profile-based method for identifying functional divergence of orthologous genes in bacterial genomes

    PubMed Central

    Wheeler, Nicole E.; Barquist, Lars; Kingsley, Robert A.; Gardner, Paul P.

    2016-01-01

    Motivation: Next generation sequencing technologies have provided us with a wealth of information on genetic variation, but predicting the functional significance of this variation is a difficult task. While many comparative genomics studies have focused on gene flux and large scale changes, relatively little attention has been paid to quantifying the effects of single nucleotide polymorphisms and indels on protein function, particularly in bacterial genomics. Results: We present a hidden Markov model based approach we call delta-bitscore (DBS) for identifying orthologous proteins that have diverged at the amino acid sequence level in a way that is likely to impact biological function. We benchmark this approach with several widely used datasets and apply it to a proof-of-concept study of orthologous proteomes in an investigation of host adaptation in Salmonella enterica. We highlight the value of the method in identifying functional divergence of genes, and suggest that this tool may be a better approach than the commonly used dN/dS metric for identifying functionally significant genetic changes occurring in recently diverged organisms. Availability and Implementation: A program implementing DBS for pairwise genome comparisons is freely available at: https://github.com/UCanCompBio/deltaBS. Contact: nicole.wheeler@pg.canterbury.ac.nz or lars.barquist@uni-wuerzburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27503221

  4. The mitochondrial genome of spotted green pufferfish Tetraodon nigroviridis (Teleostei: Tetraodontiformes) and divergence time estimation among model organisms in fishes.

    PubMed

    Yamanoue, Yusuke; Miya, Masaki; Inoue, Jun G; Matsuura, Keiichi; Nishida, Mutsumi

    2006-02-01

    We determined the whole mitochondrial genome sequence for spotted green pufferfish, Tetraodon nigroviridis (Teleostei: Tetraodontiformes). The genome (16,488 bp) contained 37 genes (two ribosomal RNA genes, 22 transfer RNA genes, and 13 protein-coding genes) plus control region as found in other vertebrates, with the gene order identical to that of typical vertebrates. The sequence was used to estimate phylogenetic relationships and divergence times among major lineages of fishes, including representative model organisms in fishes. We employed partitioned Bayesian approaches for these two analyses using two datasets that comprised concatenated amino acid sequences from 12 protein-coding genes (excluding the ND6 gene) and concatenated nucleotide sequences from the 12 protein-coding genes (without 3rd codon positions), 22 transfer RNA genes, and two ribosomal RNA genes. The resultant trees from the two datasets were well resolved and largely congruent with those from previous studies, with spotted green pufferfish being placed in a reasonable phylogenetic position. The approximate divergence times between spotted green pufferfish and model organisms in fishes were 85 million years ago (MYA) vs. torafugu, 183 MYA vs. three-spined stickleback, 191 MYA vs. medaka, and 324 MYA vs. zebrafish, all of which were about twice as old as the divergence times estimated by their earliest occurrences in fossil records.

  5. Modulation of anti-endotoxin property of Temporin L by minor amino acid substitution in identified phenylalanine zipper sequence.

    PubMed

    Srivastava, Saurabh; Kumar, Amit; Tripathi, Amit Kumar; Tandon, Anshika; Ghosh, Jimut Kanti

    2016-11-01

    A 13-residue frog antimicrobial peptide Temporin L (TempL) possesses versatile antimicrobial activities and is considered a lead molecule for the development of new antimicrobial agents. To find out the amino acid sequences that influence the anti-microbial property of TempL, a phenylalanine zipper-like sequence was identified in it which was not reported earlier. Several alanine-substituted analogs and a scrambled peptide having the same composition of TempL were designed for evaluating the role of this motif. To investigate whether leucine residues instead of phenylalanine residues at 'a' and/or 'd' position(s) of the heptad repeat sequence could alter its antimicrobial property, several TempL analogs were synthesized after replacing these phenylalanine residues with leucine residues. Replacing phenylalanine residues with alanine residues in the phenylalanine zipper sequence significantly compromised the anti-endotoxin property of TempL. This is evident from the higher production of tumor necrosis factor-α and interleukin-6 in lipopolysaccharide (LPS)-stimulated rat bone-marrow-derived macrophage cells in the presence of its alanine-substituted analogs than TempL itself. However, replacement of these phenylalanine residues with leucine residues significantly augmented anti-endotoxin property of TempL. A single alanine-substituted TempL analog (F8A-TempL) showed significantly reduced cytotoxicity but retained the antibacterial activity of TempL, while the two single leucine-substituted analogs (F5L-TempL and F8L-TempL), although exhibiting lower cytotoxicity, were able to retain the antibacterial activity of the parent peptide. The results demonstrate how minor amino acid substitutions in the identified phenylalanine zipper sequence in TempL could yield analogs with better antibacterial and/or anti-endotoxin properties with their plausible mechanism of action.

  6. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms

    PubMed Central

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/ PMID:26424080

  7. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    PubMed

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/.

  8. The human erythrocyte anion-transport protein. Partial amino acid sequence, conformation and a possible molecular mechanism for anion exchange.

    PubMed Central

    Brock, C J; Tanner, M J; Kempf, C

    1983-01-01

    The N-terminal 72 residues of an integral membrane fragment, P5, of the human erythrocyte anion-transport protein, which is known to be directly involved in the anion-exchange process, was shown to have the following amino acid sequence: Met-Val-Pro-Lys-Pro-Gln-Gly-Pro-Leu-Pro-Asn-Thr-Ala-Leu-Leu-Ser-Leu-Val-Leu-Met -Ala-Gly-Thr-Phe-Phe-Phe-Ala-Met-Met-Leu-Arg-Lys-Phe-Lys-Asn-Ser-Ser-Tyr-Phe-Pro-Gly-Lys-Leu-Arg-Arg-Val-Ile-Gly-Asp-Phe-Gly-Val-Pro-Ile-Ser-Ile-Leu-Ile-Met-Val-Leu-Val-Asp-Phe-Phe-Ile-Gln-Asp-Thr-Tyr-Thr-Gln- The structure of this fragment was analysed, with account being taken of the constraints that apply to the folding of integral membrane proteins and the topographical locations of various sites in the sequence. It was concluded that this sequence forms two transmembrane alpha-helices. These are probably part of a cluster of amphipathic transmembrane alpha-helices, which could comprise that part of the protein responsible for transport activity. The presently available evidence relating to the anion-exchange process was considered with the structural features noted in this study and a possible molecular mechanism is proposed. In this model the rearrangement of a network of intramembranous charged pairs mediates the translocation of an anion between anion-binding regions at each surface of the membrane, which are composed of clusters of positively charged amino acids. This model imposes a sequential exchange mechanism on the system. Supplementary material, including Tables and Figures describing the compositions of peptides determined by amino acid analysis and sequence studies, quantitative and qualitative data that provide a residue-by-residue justification for the sequence assignment and a description of modifications to and use of the solid-phase sequencer has been deposited as Supplementary Publication SUP 50123 (12 pages) with the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be

  9. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2230 from Karijini National Park, Australia

    PubMed Central

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O’Hara, Graham; Goodwin, Lynne; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2013-01-01

    Burkholderia sp. strain WSM2230 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod isolated from acidic soil collected in 2001 from Karijini National Park, Western Australia, using Kennedia coccinea (Coral Vine) as a host. WSM2230 was initially effective in nitrogen-fixation with K. coccinea, but subsequently lost symbiotic competence. Here we describe the features of Burkholderia sp. strain WSM2230, together with genome sequence information and its annotation. The 6,309,801 bp high-quality-draft genome is arranged into 33 scaffolds of 33 contigs containing 5,590 protein-coding genes and 63 RNA-only encoding genes. The genome sequence of WSM2230 failed to identify nodulation genes and provides an explanation for the observed failure of the laboratory grown strain to nodulate. The genome of this strain is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project. PMID:25197440

  10. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2230 from Karijini National Park, Australia.

    PubMed

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O'Hara, Graham; Goodwin, Lynne; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2014-06-15

    Burkholderia sp. strain WSM2230 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod isolated from acidic soil collected in 2001 from Karijini National Park, Western Australia, using Kennedia coccinea (Coral Vine) as a host. WSM2230 was initially effective in nitrogen-fixation with K. coccinea, but subsequently lost symbiotic competence. Here we describe the features of Burkholderia sp. strain WSM2230, together with genome sequence information and its annotation. The 6,309,801 bp high-quality-draft genome is arranged into 33 scaffolds of 33 contigs containing 5,590 protein-coding genes and 63 RNA-only encoding genes. The genome sequence of WSM2230 failed to identify nodulation genes and provides an explanation for the observed failure of the laboratory grown strain to nodulate. The genome of this strain is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  11. Cloning, sequence analysis and expression of the F1F0-ATPase beta-subunit from wine lactic acid bacteria.

    PubMed

    Sievers, Martin; Uermösi, Christina; Fehlmann, Marc; Krieger, Sibylle

    2003-09-01

    The nucleotide sequences of the genes encoding the F1F0-ATPase beta-subunit from Oenococcus oeni, Leuconostoc mesenteroides subsp. mesenteroides, Pediococcus damnosus, Pediococcus parvulus, Lactobacillus brevis and Lactobacillus hilgardii were determined. Their deduced amino acid sequences showed homology values of 79-98%. Data from the alignment and ATPase tree indicated that O. oeni and L. mesenteroides subsp. mesenteroides formed a group well-separated from P. damnosus and P. parvulus and from the group comprises L. brevis and L. hilgardii. The N-terminus of the F1F0-ATPase beta-subunit of O. oeni contains a stretch of additional 38 amino acid residues. The catalytic site of the ATPase beta-subunit of the investigated strains is characterized by the two conserved motifs GGAGVGKT and GERTRE. The amplified atpD coding sequences were inserted into the pCRT7/CT-TOPO vector using TA-cloning strategy and transformed in Escherichia coli. SDS-PAGE and Western blot analyses confirmed that O. oeni has an ATPase beta-subunit protein which is larger in size than the corresponding molecules from the investigated strains.

  12. The amino acid sequence of protein SCMK-B2C from the high-sulphur fraction of wool keratin

    PubMed Central

    Elleman, T. C.

    1972-01-01

    1. The amino acid sequence of a protein from the reduced and carboxymethylated high-sulphur fraction of wool has been determined. 2. The sequence of this S-carboxymethylkerateine (SCMK-B2C) of 151 amino acid residues displays much internal homology and an unusual residue distribution. Thus a ten-residue sequence occurs four times near the N-terminus and five times near the C-terminus with few changes. These regions contain much of the molecule's half-cystine, whereas between them there is a region of 19 residues that are mainly small and devoid of cystine and proline. 3. Certain models of the wool fibre based on its mechanical and physical properties propose a matrix of small compact globular units linked together to form beaded chains. The unusual distribution of the component residues of protein SCMK-B2C suggests structures in the wool-fibre matrix compatible with certain features of the proposed models. PMID:4678578

  13. "De-novo" amino acid sequence elucidation of protein G'e by combined "Top-Down" and "Bottom-Up" mass spectrometry

    NASA Astrophysics Data System (ADS)

    Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F. M.; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L.; Glocker, Michael O.

    2015-03-01

    Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein Ǵ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α- N-gluconoylation and α- N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α- N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant ( K d ) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.

  14. In Silico Characterization of Functional Divergence of Two Cathelicidin Variants in Indian Sheep

    PubMed Central

    Dhaliwal, Kamaljeet K; Arora, Jaspreet S; Mukhopadhyay, Chandra S; Dubey, Prem P

    2015-01-01

    The present work focuses on the in silico characterization of functional divergence of two ovine cathelicidin coding sequence (cds) variants (ie, Cath1 and Cath2) of Indian sheep. Overlapping partial cds of both the cathelicidin variants were cloned in pJet1.2/blunt vector and sequenced. Evolutionary analysis of the Cath2 and Cath1 indicated that the mammalian cathelicidins clustered separately from avian fowlicidins. The avian fowlicidins, which are very different from mammalian cathelicidins (Caths), clearly displayed signatures of purifying selection. The pairwise sequence alignments of translated amino acid sequences of these two sheep cathelicidins showed gaps in the antimicrobial domain of Cath1 variant; however, the amino terminal cathelin regions of both the Caths were conserved. Amino acid sequence analysis of full-length cathelicidins available at public database revealed that Cath1, Cath2, and Cath7 of different ruminant species (including our Cath1 and Cath2 variants) formed individual clads, suggesting that these types have evolved to target specific types of microbes. In silico analysis of Cath1 and Cath2 peptide sequences indicated that the C-terminal antimicrobial peptide domain of Cath2 is more immunogenic than that of the ovine Cath1 due to its higher positive antigenic index, making Cath1 a promising antigen for production of monoclonal antibodies. PMID:26380546

  15. Defining sequence space and reaction products within the cyanuric acid hydrolase (AtzD)/barbiturase protein family.

    PubMed

    Seffernick, Jennifer L; Erickson, Jasmine S; Cameron, Stephan M; Cho, Seunghee; Dodge, Anthony G; Richman, Jack E; Sadowsky, Michael J; Wackett, Lawrence P

    2012-09-01

    Cyanuric acid hydrolases (AtzD) and barbiturases are homologous, found almost exclusively in bacteria, and comprise a rare protein family with no discernible linkage to other protein families or an X-ray structural class. There has been confusion in the literature and in genome projects regarding the reaction products, the assignment of individual sequences as either cyanuric acid hydrolases or barbiturases, and spurious connection of this family to another protein family. The present study has addressed those issues. First, the published enzyme reaction products of cyanuric acid hydrolase are incorrectly identified as biuret and carbon dioxide. The current study employed (13)C nuclear magnetic resonance (NMR) spectroscopy and mass spectrometry to show that cyanuric acid hydrolase releases carboxybiuret, which spontaneously decarboxylates to biuret. This is significant because it revealed that homologous cyanuric acid hydrolases and barbiturases catalyze completely analogous reactions. Second, enzymes that had been annotated incorrectly in genome projects have been reassigned here by bioinformatics, gene cloning, and protein characterization studies. Third, the AtzD/barbiturase family has previously been suggested to consist of members of the amidohydrolase superfamily, a large class of metallohydrolases. Bioinformatics and the lack of bound metals both argue against a connection to the amidohydrolase superfamily. Lastly, steady-state kinetic measurements and observations of protein stability suggested that the AtzD/barbiturase family might be an undistinguished protein family that has undergone some resurgence with the recent introduction of industrial s-triazine compounds such as atrazine and melamine into the environment.

  16. A molecular mechanism realizing sequence-specific recognition of nucleic acids by TDP-43

    PubMed Central

    Furukawa, Yoshiaki; Suzuki, Yoh; Fukuoka, Mami; Nagasawa, Kenichi; Nakagome, Kenta; Shimizu, Hideaki; Mukaiyama, Atsushi; Akiyama, Shuji

    2016-01-01

    TAR DNA-binding protein 43 (TDP-43) is a DNA/RNA-binding protein containing two consecutive RNA recognition motifs (RRM1 and RRM2) in tandem. Functional abnormality of TDP-43 has been proposed to cause neurodegeneration, but it remains obscure how the physiological functions of this protein are regulated. Here, we show distinct roles of RRM1 and RRM2 in the sequence-specific substrate recognition of TDP-43. RRM1 was found to bind a wide spectrum of ssDNA sequences, while no binding was observed between RRM2 and ssDNA. When two RRMs are fused in tandem as in native TDP-43, the fused construct almost exclusively binds ssDNA with a TG-repeat sequence. In contrast, such sequence-specificity was not observed in a simple mixture of RRM1 and RRM2. We thus propose that the spatial arrangement of multiple RRMs in DNA/RNA binding proteins provides steric effects on the substrate-binding site and thereby controls the specificity of its substrate nucleotide sequences. PMID:26838063

  17. The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

    PubMed

    Mir, Rafia; Jallu, Shais; Singh, T P

    2015-06-01

    The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.

  18. Nucleotide sequence and spatial expression pattern of a drought- and abscisic Acid-induced gene of tomato.

    PubMed

    Plant, A L; Cohen, A; Moses, M S; Bray, E A

    1991-11-01

    The nucleotide sequence of le16, a tomato (Lycopersicon esculentum Mill.) gene induced by drought stress and regulated by abscisic acid specifically in aerial vegetative tissue, is presented. The single open reading frame contained within the gene has the capacity to encode a polypeptide of 12.7 kilodaltons and is interrupted by a small intron. The predicted polypeptide is rich in leucine, glycine, and alanine and has an isoelectric point of 8.7. The amino terminus is hydrophobic and characteristic of signal sequences that target polypeptides for export from the cytoplasm. There is homology (47.2% identity) between the amino terminus of the LE 16 polypeptide and the corresponding amino terminal domain of the maize phospholipid transfer protein. le16 was expressed in drought-stressed leaf, petiole, and stem tissue and to a much lower extent in the pericarp of mature green tomato fruit and developing seeds. No expression was detected in the pericarp of red fruit or in drought-stressed roots. Expression of le16 was also induced in leaf tissue by a variety of other abiotic stresses including polyethylene glycol-mediated water deficit, salinity, cold stress, and heat stress. None of these stresses or direct applications of abscisic acid induced the expression of le16 in the roots of the same plants. The unique expression characteristics of this gene indicates that novel regulatory mechanisms, in addition to endogenous abscisic acid, are involved in controlling gene expression.

  19. Characterization of fatty acid-producing wastewater microbial communities using next generation sequencing technologies

    EPA Science Inventory

    While wastewater represents a viable source of bacterial biodiesel production, very little is known on the composition of these microbial communities. We studied the taxonomic diversity and succession of microbial communities in bioreactors accumulating fatty acids using 454-pyro...

  20. A reliable and sensitive bead-based fluorescence assay for identification of nucleic acid sequences

    NASA Astrophysics Data System (ADS)

    Klamp, Tobias; Yahiatène, Idir; Lampe, André; Schüttpelz, Mark; Sauer, Markus

    2011-03-01

    The sensitive and rapid detection of pathogenic DNA is of tremendous importance in the field of diagnostics. We demonstrate the ability of detecting and quantifying single- and double-stranded pathogenic DNA with picomolar sensitivity in a bead-based fluorescence assay. Selecting appropriate capturing and detection sequences enables rapid (2 h) and reliable DNA quantification. We show that synthetic sequences of S. pneumoniae and M. luteus can be quantified in very small sample volumes (20 μL) across a linear detection range over four orders of magnitude from 1 nM to 1 pM, using a miniaturized wide-field fluorescence microscope without amplification steps. The method offers single molecule detection sensitivity without using complex setups and thus volunteers as simple, robust, and reliable method for the sensitive detection of DNA and RNA sequences.

  1. Using Dendritic Heat Maps to Simultaneously Display Genotype Divergence with Phenotype Divergence

    PubMed Central

    Kellom, Matthew; Raymond, Jason

    2016-01-01

    The advancement of techniques to visualize and analyze large-scale sequencing datasets is an area of active research and is rooted in traditional techniques such as heat maps and dendrograms. We introduce dendritic heat maps that display heat map results over aligned DNA sequence clusters for a range of clustering cutoffs. Dendritic heat maps aid in visualizing the effects of group differences on clustering hierarchy and relative abundance of sampled sequences. Here, we artificially generate two separate datasets with simplified mutation and population growth procedures with GC content group separation to use as example phenotypes. In this work, we use the term phenotype to represent any feature by which groups can be separated. These sequences were clustered in a fractional identity range of 0.75 to 1.0 using agglomerative minimum-, maximum-, and average-linkage algorithms, as well as a divisive centroid-based algorithm. We demonstrate that dendritic heat maps give freedom to scrutinize specific clustering levels across a range of cutoffs, track changes in phenotype inequity across multiple levels of sequence clustering specificity, and easily visualize how deeply rooted changes in phenotype inequity are in a dataset. As genotypes diverge in sample populations, clusters are shown to break apart into smaller clusters at higher identity cutoff levels, similar to a dendrogram. Phenotype divergence, which is shown as a heat map of relative abundance bin response, may or may not follow genotype divergences. This joined view highlights the relationship between genotype and phenotype divergence for treatment groups. We discuss the minimum-, maximum-, average-, and centroid-linkage algorithm approaches to building dendritic heat maps and make a case for the divisive “top-down” centroid-based clustering methodology as being the best option visualize the effects of changing factors on clustering hierarchy and relative abundance. PMID:27536963

  2. Lactose synthesis in a monotreme, the echidna (Tachyglossus aculeatus): isolation and amino acid sequence of echidna alpha-lactalbumin.

    PubMed

    Messer, M; Griffiths, M; Rismiller, P D; Shaw, D C

    1997-10-01

    alpha-Lactalbumin and lysozyme were each isolated from echidna (Tachyglossus aculeatus) milk by gel permeation and ion exchange chromatography. The alpha-lactalbumin modified the action of echidna milk galactosyltransferase to promote the synthesis of lactose but had very little effect on bovine galactosyltransferase. Echidna alpha-lactalbumin is a glycosylated protein with an apparent molecular weight of 20,000 (SDS-PAGE) whose concentration in the milk is very low compared with the concentrations of alpha-lactalbumin in the milk of other species. Its amino acid sequence is more similar to that of another monotreme, the platypus (Ornithorhynchus anatinus), than to the sequences of eutherian or marsupial alpha-lactalbumins. Echidna milk lysozyme, even at high concentrations, did not promote the synthesis of lactose by either echidna or bovine galactosyltransferase. We conclude that lactose synthesis in the echidna occurs by the same mechanism as that found in the platypus and other mammals.

  3. Effects of the amino acid sequence on thermal conduction through β-sheet crystals of natural silk protein.

    PubMed

    Zhang, Lin; Bai, Zhitong; Ban, Heng; Liu, Ling

    2015-11-21

    Recent experiments have discovered very different thermal conductivities between the spider silk and the silkworm silk. Decoding the molecular mechanisms underpinning the distinct thermal properties may guide the rational design of synthetic silk materials and other biomaterials for multifunctionality and tunable properties. However, such an understanding is lacking, mainly due to the complex structure and phonon physics associated with the silk materials. Here, using non-equilibrium molecular dynamics, we demonstrate that the amino acid sequence plays a key role in the thermal conduction process through β-sheets, essential building blocks of natural silks and a variety of other biomaterials. Three representative β-sheet types, i.e. poly-A, poly-(GA), and poly-G, are shown to have distinct structural features and phonon dynamics leading to different thermal conductivities. A fundamental understanding of the sequence effects may stimulate the design and engineering of polymers and biopolymers for desired thermal properties.

  4. The Role of HIV-1 gp41 Glycoprotein in Infectious Tropism Inferred from Physico-Chemical Properties of its Amino Acid Sequence

    NASA Astrophysics Data System (ADS)

    Figueroa, E.; Villarreal, C.; Huerta, L.; Cocho, G.

    2006-09-01

    We performed a statistical analysis of the amino acid sequence of the gp41 ectodomain of the Human Immunodeficiency Virus type 1. We foun