Science.gov

Sample records for acid sequence divergence

  1. Inferences from protein and nucleic acid sequences - Early molecular evolution, divergence of kingdoms and rates of change

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.; Barker, W. C.; Mclaughlin, P. J.

    1974-01-01

    Description of new sensitive, objective methods for establishing the probable common ancestry of very distantly related sequences and the quantitative evolutionary change which has taken place. These methods are applied to four families of proteins and nucleic acids and evolutionary trees will be derived where possible. Of the three families containing duplications of genetic material, two are nucleic acids: transfer RNA and 5S ribosomal RNA. Both of these structures are functional in the synthesis of coded proteins, and prototypes must have been present in the cell at the inception of the fundamental coding process that all living things share. There are many types of tRNA which recognize the various nucleotide triplets and the 20 amino acids. These types are thought to have arisen as a result of many gene duplications. Relationships among these types are discussed. The 5S ribosomal RNA, presently functional in both eukaryotes and prokaryotes, is very likely descended from an early form incorporating almost a complete duplication of genetic material. The amount of evolution in the various lines can again be compared. The other two families containing duplications are proteins; ferredoxin and cytochrome c.

  2. Variation in seed fatty acid composition and sequence divergence in the FAD2 gene coding region between wild and cultivated sesame.

    PubMed

    Chen, Zhenbang; Tonnis, Brandon; Morris, Brad; Wang, Richard B; Zhang, Amy L; Pinnow, David; Wang, Ming Li

    2014-12-01

    Sesame germplasm harbors genetic diversity which can be useful for sesame improvement in breeding programs. Seven accessions with different levels of oleic acid were selected from the entire USDA sesame germplasm collection (1232 accessions) and planted for morphological observation and re-examination of fatty acid composition. The coding region of the FAD2 gene for fatty acid desaturase (FAD) in these accessions was also sequenced. Cultivated sesame accessions flowered and matured earlier than the wild species. The cultivated sesame seeds contained a significantly higher percentage of oleic acid (40.4%) than the seeds of the wild species (26.1%). Nucleotide polymorphisms were identified in the FAD2 gene coding region between wild and cultivated species. Some nucleotide polymorphisms led to amino acid changes, one of which was located in the enzyme active site and may contribute to the altered fatty acid composition. Based on the morphology observation, chemical analysis, and sequence analysis, it was determined that two accessions were misnamed and need to be reclassified. The results obtained from this study are useful for sesame improvement in molecular breeding programs.

  3. Independence divergence-generated binary trees of amino acids.

    PubMed

    Tusnády, G E; Tusnády, G; Simon, I

    1995-05-01

    The discovery of the relationship between amino acids is important in terms of the replacement ability, as used in protein engineering homology studies, and gaining a better understanding of the roles which various properties of the residues play in the creation of a unique, stable, 3-D protein structure. Amino acid sequences of proteins edited by evolution are anything but random. The measure of nonrandomness, i.e. the level of editing, can be characterized by an independence divergence value. This parameter is used to generate binary tree relationships between amino acids. The relationships of residues presented in this paper are based on protein building features and not on the physico-chemical characteristics of amino acids. This approach is not biased by the tautology present in all sequence similarity-based relationship studies. The roles which various physico-chemical characteristics play in the determination of the relationships between amino acids are also discussed.

  4. DIVEIN: a web server to analyze phylogenies, sequence divergence, diversity, and informative sites.

    PubMed

    Deng, Wenjie; Maust, Brandon S; Nickle, David C; Learn, Gerald H; Liu, Yi; Heath, Laura; Kosakovsky Pond, Sergei L; Mullins, James I

    2010-05-01

    DIVEIN is a web interface that performs automated phylogenetic and other analyses of nucleotide and amino acid sequences. Starting with a set of aligned sequences, DIVEIN estimates evolutionary parameters and phylogenetic trees while allowing the user to choose from a variety of evolutionary models; it then reconstructs the consensus (CON), most recent common ancestor (MRCA), and center of tree (COT) sequences. DIVEIN also provides tools for further analyses, including condensing sequence alignments to show only informative sites or private mutations; computing phylogenetic or pairwise divergence from any user-specified sequence (CON, MRCA, COT, or existing sequence from the alignment); computing and outputting all genetic distances in column format; calculating summary statistics of diversity and divergence from pairwise distances; and graphically representing the inferred tree and plots of divergence, diversity, and distance distribution histograms. DIVEIN is available at http://indra.mullins.microbiol.washington.edu/DIVEIN.

  5. Kinetoplast DNA minicircles: regions of extensive sequence divergence.

    PubMed Central

    Rogers, W O; Wirth, D F

    1987-01-01

    Previous work has shown that the kinetoplast minicircle DNA of Leishmania species exhibits species-specific sequence divergence and this observation has led to the development of a DNA probe-based diagnostic test for leishmaniasis. In the work reported here, we demonstrate that the minicircle is composed of three types of DNA sequences with differing specificities reflecting different rates of DNA sequence change. A library of cloned fragments of kinetoplast DNA (kDNA) from Leishmania mexicana amazonensis was prepared and the cloned subfragments were found to contain DNA sequences with different taxonomic specificities based on hybridization analysis with various species of Leishmania. Four groups of subfragments were found, those that hybridized with a large number of Leishmania sp. as well as sequences unique to the species, subspecies, or isolate. Analysis of nested deletions of a single, full-length minicircle demonstrates that these different taxonomic specificities are contained within a single minicircle. This implies that different regions of a single minicircle have DNA sequences that diverge at different rates. These sequences represent potentially valuable tools in diagnostic, epidemiologic, and ecological studies of leishmaniasis and provide the basis for a model of kDNA sequence evolution. Images PMID:3025880

  6. Variation in Seed Fatty Acid Composition, and Sequence Divergence in the FAD2 Gene Coding Region between Wild and Cultivated Sesame

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Sesame germplasm harbors genetic diversity which can be useful for sesame improvement in breeding programs. Seven accessions with different levels of oleic acid were selected from the entire USDA sesame germplasm collection (1232 accessions) and planted for morphological observation and re-examinati...

  7. Generalization of Entropy Based Divergence Measures for Symbolic Sequence Analysis

    PubMed Central

    Ré, Miguel A.; Azad, Rajeev K.

    2014-01-01

    Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms. PMID:24728338

  8. Generalization of entropy based divergence measures for symbolic sequence analysis.

    PubMed

    Ré, Miguel A; Azad, Rajeev K

    2014-01-01

    Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms. PMID:24728338

  9. PhyPA: Phylogenetic method with pairwise sequence alignment outperforms likelihood methods in phylogenetics involving highly diverged sequences.

    PubMed

    Xia, Xuhua

    2016-09-01

    While pairwise sequence alignment (PSA) by dynamic programming is guaranteed to generate one of the optimal alignments, multiple sequence alignment (MSA) of highly divergent sequences often results in poorly aligned sequences, plaguing all subsequent phylogenetic analysis. One way to avoid this problem is to use only PSA to reconstruct phylogenetic trees, which can only be done with distance-based methods. I compared the accuracy of this new computational approach (named PhyPA for phylogenetics by pairwise alignment) against the maximum likelihood method using MSA (the ML+MSA approach), based on nucleotide, amino acid and codon sequences simulated with different topologies and tree lengths. I present a surprising discovery that the fast PhyPA method consistently outperforms the slow ML+MSA approach for highly diverged sequences even when all optimization options were turned on for the ML+MSA approach. Only when sequences are not highly diverged (i.e., when a reliable MSA can be obtained) does the ML+MSA approach outperforms PhyPA. The true topologies are always recovered by ML with the true alignment from the simulation. However, with MSA derived from alignment programs such as MAFFT or MUSCLE, the recovered topology consistently has higher likelihood than that for the true topology. Thus, the failure to recover the true topology by the ML+MSA is not because of insufficient search of tree space, but by the distortion of phylogenetic signal by MSA methods. I have implemented in DAMBE PhyPA and two approaches making use of multi-gene data sets to derive phylogenetic support for subtrees equivalent to resampling techniques such as bootstrapping and jackknifing.

  10. PhyPA: Phylogenetic method with pairwise sequence alignment outperforms likelihood methods in phylogenetics involving highly diverged sequences.

    PubMed

    Xia, Xuhua

    2016-09-01

    While pairwise sequence alignment (PSA) by dynamic programming is guaranteed to generate one of the optimal alignments, multiple sequence alignment (MSA) of highly divergent sequences often results in poorly aligned sequences, plaguing all subsequent phylogenetic analysis. One way to avoid this problem is to use only PSA to reconstruct phylogenetic trees, which can only be done with distance-based methods. I compared the accuracy of this new computational approach (named PhyPA for phylogenetics by pairwise alignment) against the maximum likelihood method using MSA (the ML+MSA approach), based on nucleotide, amino acid and codon sequences simulated with different topologies and tree lengths. I present a surprising discovery that the fast PhyPA method consistently outperforms the slow ML+MSA approach for highly diverged sequences even when all optimization options were turned on for the ML+MSA approach. Only when sequences are not highly diverged (i.e., when a reliable MSA can be obtained) does the ML+MSA approach outperforms PhyPA. The true topologies are always recovered by ML with the true alignment from the simulation. However, with MSA derived from alignment programs such as MAFFT or MUSCLE, the recovered topology consistently has higher likelihood than that for the true topology. Thus, the failure to recover the true topology by the ML+MSA is not because of insufficient search of tree space, but by the distortion of phylogenetic signal by MSA methods. I have implemented in DAMBE PhyPA and two approaches making use of multi-gene data sets to derive phylogenetic support for subtrees equivalent to resampling techniques such as bootstrapping and jackknifing. PMID:27377322

  11. Sequence of inequalities among fuzzy mean difference divergence measures and their applications.

    PubMed

    Tomar, Vijay Prakash; Ohlan, Anshu

    2014-01-01

    This paper presents a sequence of fuzzy mean difference divergence measures. The validity of these fuzzy mean difference divergence measures is proved axiomatically. In addition, it introduces a sequence of inequalities among some of these fuzzy mean difference divergence measures. The applications of proposed fuzzy mean difference divergence measures in the context of pattern recognition have been presented using a numerical example. It is shown that the proposed fuzzy mean difference divergence measures are well suited to use with linguistic variables. Finally, on establishing inequalities, we find that our proposed measures are computationally much more efficient.

  12. Extraordinary sequence divergence at Tsga8, an X-linked gene involved in mouse spermiogenesis.

    PubMed

    Good, Jeffrey M; Vanderpool, Dan; Smith, Kimberly L; Nachman, Michael W

    2011-05-01

    The X chromosome plays an important role in both adaptive evolution and speciation. We used a molecular evolutionary screen of X-linked genes potentially involved in reproductive isolation in mice to identify putative targets of recurrent positive selection. We then sequenced five very rapidly evolving genes within and between several closely related species of mice in the genus Mus. All five genes were involved in male reproduction and four of the genes showed evidence of recurrent positive selection. The most remarkable evolutionary patterns were found at Testis-specific gene a8 (Tsga8), a spermatogenesis-specific gene expressed during postmeiotic chromatin condensation and nuclear transformation. Tsga8 was characterized by extremely high levels of insertion-deletion variation of an alanine-rich repetitive motif in natural populations of Mus domesticus and M. musculus, differing in length from the reference mouse genome by up to 89 amino acids (27% of the total protein length). This population-level variation was coupled with striking divergence in protein sequence and length between closely related mouse species. Although no clear orthologs had previously been described for Tsga8 in other mammalian species, we have identified a highly divergent hypothetical gene on the rat X chromosome that shares clear orthology with the 5' and 3' ends of Tsga8. Further inspection of this ortholog verified that it is expressed in rat testis and shares remarkable similarity with mouse Tsga8 across several general features of the protein sequence despite no conservation of nucleotide sequence across over 60% of the rat-coding domain. Overall, Tsga8 appears to be one of the most rapidly evolving genes to have been described in rodents. We discuss the potential evolutionary causes and functional implications of this extraordinary divergence and the possible contribution of Tsga8 and the other four genes we examined to reproductive isolation in mice. PMID:21186189

  13. Composition for nucleic acid sequencing

    SciTech Connect

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  14. Divergence of unique DNA sequences in Lepidoptera (insecta)

    SciTech Connect

    Ado, N.Yu.; Petrov, N.B.

    1985-07-01

    The authors contribute additional information about the interrelationship between lepidoteran taxons. Such information may be obtained by the techniques of genosystematics. DNA was extracted from the last larval instars of images by the usual methods. The labeled and unlabeled fragments were 150-250 nucleotide pairs (n.p.) long. DNA hybridization was performed under gentle conditions: 0.5 M Na phosphage buffer, pH 6.8, at 55/sup 0/C, incubation to 10,000 C/sub o/T (product of initial concentration of single-strand DNA molecules, moles/liter, and time of incubation (in seconds)), permitting the comparison of genomes of species from taxons phylogenetically far apart. The ratio of labeled fragments of reference unique DNA sequences to unlabeled total DNA was 1:2500. The techniques of hybridization and determination of the thermal stability (T/sub m/) of the duplexes, taking into account the auto-reassociation of the labeled DNA fragments. The radioactivity was determined in a dioxane-based scintillator, after partial acid hydrolysis of the DNA at 55/sup 0/C, in a counter. Under the experimental conditions chosen, reassociation of unique DNA sequences in homologous reactions was 70-85%

  15. Molecular evolution and sequence divergence of plant chalcone synthase and chalcone synthase-Like genes.

    PubMed

    Han, Yingying; Zhao, Wenwen; Wang, Zhicui; Zhu, Jingying; Liu, Qisong

    2014-06-01

    Plant chalcone synthase (CHS) and CHS-Like (CHSL) proteins are polyketide synthases. In this study, we evaluated the molecular evolution of this gene family using representative types of CHSL genes, including stilbene synthase (STS), 2-pyrone synthase (2-PS), bibenzyl synthase (BBS), acridone synthase (ACS), biphenyl synthase (BIS), benzalacetone synthase, coumaroyl triacetic acid synthase (CTAS), and benzophenone synthase (BPS), along with their CHS homologs from the same species of both angiosperms and gymnosperms. A cDNA-based phylogeny indicated that CHSLs had diverse evolutionary patterns. STS, ACS, and 2-PS clustered with CHSs from the same species (late diverged pattern), while CTAS, BBS, BPS, and BIS were distant from their CHS homologs (early diverged pattern). The amino-acid phylogeny suggested that CHS and CHSL proteins formed clades according to enzyme function. The CHSs and CHSLs from Polygonaceae and Arachis had unique evolutionary histories. Synonymous mutation rates were lower in late diverged CHSLs than in early diverged ones, indicating that gene duplications occurred more recently in late diverged CHSLs than in early diverged ones. Relative rate tests proved that late diverged CHSLs had unequal rates to CHSs from the same species when using fatty acid synthase, which evolved from the common ancestor with the CHS superfamily, as the outgroup, while the early diverged lineages had equal rates. This indicated that late diverged CHSLs experienced more frequent mutation than early diverged CHSLs after gene duplication, allowing obtaining new functions in relatively short period of time.

  16. Functionally conserved enhancers with divergent sequences in distant vertebrates

    SciTech Connect

    Yang, Song; Oksenberg, Nir; Takayama, Sachiko; Heo, Seok -Jin; Poliakov, Alexander; Ahituv, Nadav; Dubchak, Inna; Boffelli, Dario

    2015-10-30

    To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. In conclusion, our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.

  17. Structure and sequence divergence of two archaebacterial genes.

    PubMed Central

    Cue, D; Beckler, G S; Reeve, J N; Konisky, J

    1985-01-01

    The DNA sequences of a region that includes the hisA gene of two related methanogenic archaebacteria, Methanococcus voltae and Methanococcus vannielii, have been compared. Both organisms show a similar genome organization in this region, displaying three open reading frames (ORFs) separated by regions of very high A + T content. Two of the ORFs, including ORFHisA, show significant DNA sequence homology. As might be expected for organisms having a genome that is A + T-rich, there is a high preference for A and U as the third base in codons. Although the regions upstream of the structural genes contain prokaryotic-like promoter sequences, it is not known whether they are recognized as promoters in these archaebacterial cells. A ribosome binding site, G-G-T-G, is located 6 base pairs preceding the ATG translation initiation sequence of both hisA genes. The sequences upstream of the two hisA genes show only limited sequence homology. The M. voltae intergenic region contains four tandemly arranged repetitions of an 11-base-pair sequence, whereas the M. vannielii sequence contains both direct and inverted repetitive sequences. Based on the degree of hisA sequence homology, we conclude that M. voltae and M. vannielii are less closely related taxonomically than are members of the enteric group of eubacteria. PMID:3923489

  18. The role of Drosophila mismatch repair in suppressing recombination between diverged sequences.

    PubMed

    Do, Anthony T; LaRocque, Jeannine R

    2015-11-30

    DNA double-strand breaks (DSBs) must be accurately repaired to maintain genomic integrity. DSBs can be repaired by homologous recombination (HR), which uses an identical sequence as a template to restore the genetic information lost at the break. Suppression of recombination between diverged sequences is essential to the repair of DSBs without aberrant and potentially mutagenic recombination between non-identical sequences, such as Alu repeats in the human genome. The mismatch repair (MMR) machinery has been found to suppress recombination between diverged sequences in murine cells. To test if this phenomenon is conserved in whole organisms, two DSB repair systems were utilized in Drosophila melanogaster. The DR-white and DR-white.mu assays provide a method of measuring DSB repair outcomes between identical and diverged sequences respectively. msh6(-/-) flies, deficient in MMR, were not capable of suppressing recombination between sequences with 1.4% divergence, and the average gene conversion tract length did not differ between msh6(-/+) and msh6(-/-)flies. These findings suggest that MMR has an early role in suppressing recombination between diverged sequences that is conserved in Drosophila.

  19. Structure and sequence divergence of two archaebacterial genes

    SciTech Connect

    Cue, D.; Beckler, G.S.; Reeve, J.N.; Konisky, J.

    1985-06-01

    The DNA sequences of a region that includes the hisA gene of two related methanogenic archaebacteria, Methanococcus voltae and Methanococcus vannielii, have been compared. Both organisms show a similar genome organization in this region, displaying three open reading frames (ORFs) separated by regions of very high A+T content. Two of the ORFs, including ORFHisA, show significant DNA sequence homology. As might be expected for organisms having a genome that is A+T-rich, there is a high preference for A and U as the third base in codons. A ribosome binding site, G-G-T-G, is located 6 base pairs preceding the ATG translation initiation sequence of both hisA genes. The sequences upstream of the two hisA genes show only limited sequence homology. The M. voltae intergenic region contains four tandemly arranged repetitions of an 11-base-pair sequence, whereas the M. vannielii sequence contains both direct and inverted repetitive sequences. Based on the degree of hisA sequence homology, the authors conclude that M. voltae and M. vannielii are less closely related taxonomically than are members of the enteric group of eubacteria.

  20. High speed nucleic acid sequencing

    SciTech Connect

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  1. Comparative Species Divergence across Eight Triplets of Spiny Lizards (Sceloporus) Using Genomic Sequence Data

    PubMed Central

    Leaché, Adam D.; Harris, Rebecca B.; Maliska, Max E.; Linkem, Charles W.

    2013-01-01

    Species divergence is typically thought to occur in the absence of gene flow, but many empirical studies are discovering that gene flow may be more pervasive during species formation. Although many examples of divergence with gene flow have been identified, few clades have been investigated in a comparative manner, and fewer have been studied using genome-wide sequence data. We contrast species divergence genetic histories across eight triplets of North American Sceloporus lizards using a maximum likelihood implementation of the isolation–migration (IM) model. Gene flow at the time of species divergence is modeled indirectly as variation in species divergence time across the genome or explicitly using a migration rate parameter. Likelihood ratio tests (LRTs) are used to test the null model of no gene flow at speciation against these two alternative gene flow models. We also use the Akaike information criterion to rank the models. Hundreds of loci are needed for the LRTs to have statistical power, and we use genome sequencing of reduced representation libraries to obtain DNA sequence alignments at many loci (between 340 and 3,478; mean = 1,678) for each triplet. We find that current species distributions are a poor predictor of whether a species pair diverged with gene flow. Interrogating the genome using the triplet method expedites the comparative study of species divergence history and the estimation of genetic parameters associated with speciation. PMID:24259316

  2. Efficient algorithm for video sequence matching using the Hausdorff distance and the directed divergence

    NASA Astrophysics Data System (ADS)

    Kim, Sang Hyun; Park, Rae-Hong

    2000-12-01

    To manipulate large video databases, effective video indexing and retrieval are required. While most algorithms for video retrieval can be used for frame-wise user query or video content query, video sequence matching has not been investigated much. In this paper, we propose an efficient algorithm to match the video sequences using the modified Hausdorff distance, and a video indexing method using the directed divergence of histograms between successive frames. To effectively match the video sequences and to reduce the computational complexity, we use the key frames extracted by the cumulative directed divergence, and compare the set of key frames, using the Hausdorff distance. Experimental results show that the proposed video sequence matching and video indexing algorithms using the Hausdorff distance and the directed divergence yield the remarkably high accuracy and performances compared with conventional algorithms such as histogram difference or histogram intersection methods.

  3. Simple sequence repeat variations expedite phage divergence: Mechanisms of indels and gene mutations.

    PubMed

    Lin, Tiao-Yin

    2016-07-01

    Phages are the most abundant biological entities and influence prokaryotic communities on Earth. Comparing closely related genomes sheds light on molecular events shaping phage evolution. Simple sequence repeat (SSR) variations impart over half of the genomic changes between T7M and T3, indicating an important role of SSRs in accelerating phage genetic divergence. Differences in coding and noncoding regions of phages infecting different hosts, coliphages T7M and T3, Yersinia phage ϕYeO3-12, and Salmonella phage ϕSG-JL2, frequently arise from SSR variations. Such variations modify noncoding and coding regions; the latter efficiently changes multiple amino acids, thereby hastening protein evolution. Four classes of events are found to drive SSR variations: insertion/deletion of SSR units, expansion/contraction of SSRs without alteration of genome length, changes of repeat motifs, and generation/loss of repeats. The categorization demonstrates the ways SSRs mutate in genomes during phage evolution. Indels are common constituents of genome variations and human diseases, yet, how they occur without preexisting repeat sequence is less understood. Non-repeat-unit-based misalignment-elongation (NRUBME) is proposed to be one mechanism for indels without adjacent repeats. NRUBME or consecutive NRUBME may also change repeat motifs or generate new repeats. NRUBME invoking a non-Watson-Crick base pair explains insertions that initiate mononucleotide repeats. Furthermore, NRUBME successfully interprets many inexplicable human di- to tetranucleotide repeat generations. This study provides the first evidence of SSR variations expediting phage divergence, and enables insights into the events and mechanisms of genome evolution. NRUBME allows us to emulate natural evolution to design indels for various applications.

  4. Large sequence divergence among mitochondrial DNA genotypes within populations of eastern African black-backed jackals.

    PubMed

    Wayne, R K; Meyer, A; Lehman, N; Van Valkenburgh, B; Kat, P W; Fuller, T K; Girman, D; O'Brien, S J

    1990-03-01

    In discussions about the relative rate of molecular evolution, intraspecific variability in rate is rarely considered. An underlying assumption is that intraspecific sequence differences are small, and thus variations in rate would be difficult to detect or would not affect comparisons among distantly related taxa. However, several studies on mammalian mitochondrial DNA (mtDNA) have revealed considerable intraspecific sequence divergence. In this report, we test for differences in the rate of intraspecific evolution by comparing mtDNA sequences, as inferred from restriction site polymorphisms and direct sequencing, between mtDNA genotypes of the eastern African black-backed jackal, Canis mesomelas elongae, and those of two other sympatric jackal species. Our results are unusual for several reasons. First, mtDNA sequence divergence within several contiguous black-backed jackal populations is large (8.0%). Previous intraspecific studies of terrestrial mammals have generally found values of less than 5% within a single population, with larger divergence values most often occurring among mtDNA genotypes from geographically distant or isolated localities. Second, only 4 mtDNA genotypes were present in our sample of 64 jackals. The large sequence divergence observed among these mtDNA genotypes suggests there should be many more genotypes of intermediate sequence divergence if they had evolved in sympatry. Finally, estimates of the rate of mtDNA sequence evolution differ by approximately 2- to 4-fold among black-backed jackal mtDNA genotypes, thus indicating a substantial heterogeneity in the rate of sequence evolution. The results are difficult to reconcile with ideas of a constant molecular clock based on random fixation of selectively neutral or nearly neutral mtDNA sequence mutations.

  5. Expression Divergence Is Correlated with Sequence Evolution but Not Positive Selection in Conifers.

    PubMed

    Hodgins, Kathryn A; Yeaman, Sam; Nurkowski, Kristin A; Rieseberg, Loren H; Aitken, Sally N

    2016-06-01

    The evolutionary and genomic determinants of sequence evolution in conifers are poorly understood, and previous studies have found only limited evidence for positive selection. Using RNAseq data, we compared gene expression profiles to patterns of divergence and polymorphism in 44 seedlings of lodgepole pine (Pinus contorta) and 39 seedlings of interior spruce (Picea glauca × engelmannii) to elucidate the evolutionary forces that shape their genomes and their plastic responses to abiotic stress. We found that rapidly diverging genes tend to have greater expression divergence, lower expression levels, reduced levels of synonymous site diversity, and longer proteins than slowly diverging genes. Similar patterns were identified for the untranslated regions, but with some exceptions. We found evidence that genes with low expression levels had a larger fraction of nearly neutral sites, suggesting a primary role for negative selection in determining the association between evolutionary rate and expression level. There was limited evidence for differences in the rate of positive selection among genes with divergent versus conserved expression profiles and some evidence supporting relaxed selection in genes diverging in expression between the species. Finally, we identified a small number of genes that showed evidence of site-specific positive selection using divergence data alone. However, estimates of the proportion of sites fixed by positive selection (α) were in the range of other plant species with large effective population sizes suggesting relatively high rates of adaptive divergence among conifers. PMID:26873578

  6. Conservation patterns in different functional sequence categoriesof divergent Drosophila species

    SciTech Connect

    Papatsenko, Dmitri; Kislyuk, Andrey; Levine, Michael; Dubchak, Inna

    2005-10-01

    We have explored the distributions of fully conservedungapped blocks in genome-wide pairwise alignments of recently completedspecies of Drosophila: D.yakuba, D.ananassae, D.pseudoobscura, D.virilisand D.mojavensis. Based on these distributions we have found that nearlyevery functional sequence category possesses its own distinctiveconservation pattern, sometimes independent of the overall sequenceconservation level. In the coding and regulatory regions, the ungappedblocks were longer than in introns, UTRs and non-functional sequences. Atthe same time, the blocks in the coding regions carried 3N+2 signaturecharacteristic to synonymic substitutions in the 3rd codon positions.Larger block sizes in transcription regulatory regions can be explainedby the presence of conserved arrays of binding sites for transcriptionfactors. We also have shown that the longest ungapped blocks, or'ultraconserved' sequences, are associated with specific gene groups,including those encoding ion channels and components of the cytoskeleton.We discussed how restrained conservation patterns may help in mappingfunctional sequence categories and improving genomeannotation.

  7. Genomic islands of divergence in hybridizing Heliconius butterflies identified by large-scale targeted sequencing

    PubMed Central

    Nadeau, Nicola J.; Whibley, Annabel; Jones, Robert T.; Davey, John W.; Dasmahapatra, Kanchon K.; Baxter, Simon W.; Quail, Michael A.; Joron, Mathieu; ffrench-Constant, Richard H.; Blaxter, Mark L.; Mallet, James; Jiggins, Chris D.

    2012-01-01

    Heliconius butterflies represent a recent radiation of species, in which wing pattern divergence has been implicated in speciation. Several loci that control wing pattern phenotypes have been mapped and two were identified through sequencing. These same gene regions play a role in adaptation across the whole Heliconius radiation. Previous studies of population genetic patterns at these regions have sequenced small amplicons. Here, we use targeted next-generation sequence capture to survey patterns of divergence across these entire regions in divergent geographical races and species of Heliconius. This technique was successful both within and between species for obtaining high coverage of almost all coding regions and sufficient coverage of non-coding regions to perform population genetic analyses. We find major peaks of elevated population differentiation between races across hybrid zones, which indicate regions under strong divergent selection. These ‘islands’ of divergence appear to be more extensive between closely related species, but there is less clear evidence for such islands between more distantly related species at two further points along the ‘speciation continuum’. We also sequence fosmid clones across these regions in different Heliconius melpomene races. We find no major structural rearrangements but many relatively large (greater than 1 kb) insertion/deletion events (including gain/loss of transposable elements) that are variable between races. PMID:22201164

  8. Sequence divergence in two tandemly located pilin genes of Eikenella corrodens.

    PubMed Central

    Tønjum, T; Weir, S; Bøvre, K; Progulske-Fox, A; Marrs, C F

    1993-01-01

    Eikenella corrodens normally inhabits the human respiratory and gastrointestinal tracts but is frequently the cause of abscesses at various sites. Using the N-terminal portion of the Moraxella nonliquefaciens pilin gene as a hybridization probe, we cloned two tandemly located pilin genes of E. corrodens 31745, ecpC and ecpD, and expressed the two pilin genes separately in Escherichia coli. A comparison of the predicted amino acid sequences of E. corrodens 31745 EcpC and EcpD revealed considerable divergence between the sequences of these two pilins and even less similarity to EcpA and EcpB of E. corrodens type strain ATCC 23834. EcpC from E. corrodens 31745 displayed high degrees of homology to the pilins of Neisseria gonorrhoeae and Pseudomonas aeruginosa. EcpD from E. corrodens 31745 showed the highest homologies with the pilin of one of the three P. aeruginosa classes, whereas EcpA and EcpB of strain ATCC 23834 most closely resemble Moraxella bovis pilins. These findings raise interesting questions about potential genetic transfer between different bacterial species, as opposed to convergent evolution. Images PMID:8478080

  9. Divergence between samples of chimpanzee and human DNA sequences is 5%, counting indels

    PubMed Central

    Britten, Roy J.

    2002-01-01

    Five chimpanzee bacterial artificial chromosome (BAC) sequences (described in GenBank) have been compared with the best matching regions of the human genome sequence to assay the amount and kind of DNA divergence. The conclusion is the old saw that we share 98.5% of our DNA sequence with chimpanzee is probably in error. For this sample, a better estimate would be that 95% of the base pairs are exactly shared between chimpanzee and human DNA. In this sample of 779 kb, the divergence due to base substitution is 1.4%, and there is an additional 3.4% difference due to the presence of indels. The gaps in alignment are present in about equal amounts in the chimp and human sequences. They occur equally in repeated and nonrepeated sequences, as detected by repeatmasker (http://ftp.genome.washington.edu/RM/RepeatMasker.html). PMID:12368483

  10. Patterns of Y and X chromosome DNA sequence divergence during the Felidae radiation.

    PubMed Central

    Pecon Slattery, J; O'Brien, S J

    1998-01-01

    The 37 species of modern cats have evolved from approximately eight phylogenetic lineages within the past 10 to 15 million years. The Felidae family has been described with multiple measures of morphologic and molecular evolutionary methods that serve as a framework for tracking gene divergence during brief evolutionary periods. In this report, we compare the mode and tempo of evolution of noncoding sequences of a large intron within Zfy (783 bp) and Zfx (854 bp), homologous genes located on the felid Y and X chromosomes, respectively. Zfy sequence variation evolves at about twice the rate of Zfx, and both gene intron sequences track feline hierarchical topologies accurately. As homoplasies are infrequent in patterns of nucleotide substitution, the Y chromosome sequence displays a remarkable degree of phylogenetic consistency among cat species and provides a highly informative glimpse of divergence of sex chromosome sequences in Felidae. PMID:9539439

  11. Evaluation of intra- and interspecific divergence of satellite DNA sequences by nucleotide frequency calculation and pairwise sequence comparison

    PubMed Central

    2003-01-01

    Satellite DNA sequences are known to be highly variable and to have been subjected to concerted evolution that homogenizes member sequences within species. We have analyzed the mode of evolution of satellite DNA sequences in four fishes from the genus Diplodus by calculating the nucleotide frequency of the sequence array and the phylogenetic distances between member sequences. Calculation of nucleotide frequency and pairwise sequence comparison enabled us to characterize the divergence among member sequences in this satellite DNA family. The results suggest that the evolutionary rate of satellite DNA in D. bellottii is about two-fold greater than the average of the other three fishes, and that the sequence homogenization event occurred in D. puntazzo more recently than in the others. The procedures described here are effective to characterize mode of evolution of satellite DNA. PMID:12734555

  12. Evaluation of intra- and interspecific divergence of satellite DNA sequences by nucleotide frequency calculation and pairwise sequence comparison.

    PubMed

    Kato, Mikio

    2003-01-01

    Satellite DNA sequences are known to be highly variable and to have been subjected to concerted evolution that homogenizes member sequences within species. We have analyzed the mode of evolution of satellite DNA sequences in four fishes from the genus Diplodus by calculating the nucleotide frequency of the sequence array and the phylogenetic distances between member sequences. Calculation of nucleotide frequency and pairwise sequence comparison enabled us to characterize the divergence among member sequences in this satellite DNA family. The results suggest that the evolutionary rate of satellite DNA in D. bellottii is about two-fold greater than the average of the other three fishes, and that the sequence homogenization event occurred in D. puntazzo more recently than in the others. The procedures described here are effective to characterize mode of evolution of satellite DNA. PMID:12734555

  13. Alignment-free microbial phylogenomics under scenarios of sequence divergence, genome rearrangement and lateral genetic transfer

    PubMed Central

    Bernard, Guillaume; Chan, Cheong Xin; Ragan, Mark A.

    2016-01-01

    Alignment-free (AF) approaches have recently been highlighted as alternatives to methods based on multiple sequence alignment in phylogenetic inference. However, the sensitivity of AF methods to genome-scale evolutionary scenarios is little known. Here, using simulated microbial genome data we systematically assess the sensitivity of nine AF methods to three important evolutionary scenarios: sequence divergence, lateral genetic transfer (LGT) and genome rearrangement. Among these, AF methods are most sensitive to the extent of sequence divergence, less sensitive to low and moderate frequencies of LGT, and most robust against genome rearrangement. We describe the application of AF methods to three well-studied empirical genome datasets, and introduce a new application of the jackknife to assess node support. Our results demonstrate that AF phylogenomics is computationally scalable to multi-genome data and can generate biologically meaningful phylogenies and insights into microbial evolution. PMID:27363362

  14. Using multi-locus allelic sequence data to estimate genetic divergence among four Lilium (Liliaceae) cultivars.

    PubMed

    Shahin, Arwa; Smulders, Marinus J M; van Tuyl, Jaap M; Arens, Paul; Bakker, Freek T

    2014-01-01

    Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium. PMID:25368628

  15. Coding sequence divergence between two closely related plant species: Arabidopsis thaliana and Brassica rapa ssp. pekinensis.

    PubMed

    Tiffin, Peter; Hahn, Matthew W

    2002-06-01

    To characterize the coding-sequence divergence of closely related genomes, we compared DNA sequence divergence between sequences from a Brassica rapa ssp. pekinensis EST library isolated from flower buds and genomic sequences from Arabidopsis thaliana. The specific objectives were (i) to determine the distribution of and relationship between K(a) and K(s), (ii) to identify genes with the lowest and highest K(a): K(s) values, and (iii) to evaluate how codon usage has diverged between two closely related species. We found that the distribution of K(a): K(s) was unimodal, and that substitution rates were more variable at nonsynonymous than synonymous sites, and detected no evidence that K(a) and K(s) were positively correlated. Several genes had K(a): K(s) values equal to or near zero, as expected for genes that have evolved under strong selective constraint. In contrast, there were no genes with K(a): K(s) >1 and thus we found no strong evidence that any of the 218 sequences we analyzed have evolved in response to positive selection. We detected a stronger codon bias but a lower frequency of GC at synonymous sites in A. thaliana than B. rapa. Moreover, there has been a shift in the profile of most commonly used synonymous codons since these two species diverged from one another. This shift in codon usage may have been caused by stronger selection acting on codon usage or by a shift in the direction of mutational bias in the B. rapa phylogenetic lineage.

  16. Mitotic crossovers between diverged sequences are regulated by mismatch repair proteins in Saccaromyces cerevisiae.

    PubMed Central

    Datta, A; Adjiri, A; New, L; Crouse, G F; Jinks Robertson, S

    1996-01-01

    Mismatch repair systems correct replication- and recombination-associated mispaired bases and influence the stability of simple repeats. These systems thus serve multiple roles in maintaining genetic stability in eukaryotes, and human mismatch repair defects have been associated with hereditary predisposition to cancer. In prokaryotes, mismatch repair systems also have been shown to limit recombination between diverged (homologous) sequences. We have developed a unique intron-based assay system to examine the effects of yeast mismatch repair genes (PMS1, MSH2, and MSH3) on crossovers between homologous sequences. We find that the apparent antirecombination effects of mismatch repair proteins in mitosis are related to the degree of substrate divergence. Defects in mismatch repair can elevate homologous recombination between 91% homologous substrates as much as 100-fold while having only modest effects on recombination between 77% homologous substrates. These observations have implications for genome stability and general mechanisms of recombination in eukaryotes. PMID:8622653

  17. Novel non-parametric models to estimate evolutionary rates and divergence times from heterochronous sequence data

    PubMed Central

    2014-01-01

    Background Early methods for estimating divergence times from gene sequence data relied on the assumption of a molecular clock. More sophisticated methods were created to model rate variation and used auto-correlation of rates, local clocks, or the so called “uncorrelated relaxed clock” where substitution rates are assumed to be drawn from a parametric distribution. In the case of Bayesian inference methods the impact of the prior on branching times is not clearly understood, and if the amount of data is limited the posterior could be strongly influenced by the prior. Results We develop a maximum likelihood method – Physher – that uses local or discrete clocks to estimate evolutionary rates and divergence times from heterochronous sequence data. Using two empirical data sets we show that our discrete clock estimates are similar to those obtained by other methods, and that Physher outperformed some methods in the estimation of the root age of an influenza virus data set. A simulation analysis suggests that Physher can outperform a Bayesian method when the real topology contains two long branches below the root node, even when evolution is strongly clock-like. Conclusions These results suggest it is advisable to use a variety of methods to estimate evolutionary rates and divergence times from heterochronous sequence data. Physher and the associated data sets used here are available online at http://code.google.com/p/physher/. PMID:25055743

  18. Discovery of a divergent HPIV4 from respiratory secretions using second and third generation metagenomic sequencing

    PubMed Central

    Alquezar-Planas, David E.; Mourier, Tobias; Bruhn, Christian A. W.; Hansen, Anders J.; Vitcetz, Sarah Nathalie; Mørk, Søren; Gorodkin, Jan; Nielsen, Hanne Abel; Guo, Yan; Sethuraman, Anand; Paxinos, Ellen E.; Shan, Tongling; Delwart, Eric L.; Nielsen, Lars P.

    2013-01-01

    Molecular detection of viruses has been aided by high-throughput sequencing, permitting the genomic characterization of emerging strains. In this study, we comprehensively screened 500 respiratory secretions from children with upper and/or lower respiratory tract infections for viral pathogens. The viruses detected are described, including a divergent human parainfluenza virus type 4 from GS FLX pyrosequencing of 92 specimens. Complete full-genome characterization of the virus followed, using Single Molecule, Real-Time (SMRT) sequencing. Subsequent “primer walking” combined with Sanger sequencing validated the RS platform's utility in viral sequencing from complex clinical samples. Comparative genomics reveals the divergent strain clusters with the only completely sequenced HPIV4a subtype. However, it also exhibits various structural features present in one of the HPIV4b reference strains, opening questions regarding their lifecycle and evolutionary relationships among these viruses. Clinical data from patients infected with the strain, as well as viral prevalence estimates using real-time PCR, is also described. PMID:24002378

  19. Chip-based sequencing nucleic acids

    SciTech Connect

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  20. Evolutionary Analysis of Sequence Divergence and Diversity of Duplicate Genes in Aspergillus fumigatus

    PubMed Central

    Yang, Ence; Hulse, Amanda M.; Cai, James J.

    2012-01-01

    Gene duplication as a major source of novel genetic material plays an important role in evolution. In this study, we focus on duplicate genes in Aspergillus fumigatus, a ubiquitous filamentous fungus causing life-threatening human infections. We characterize the extent and evolutionary patterns of the duplicate genes in the genome of A. fumigatus. Our results show that A. fumigatus contains a large amount of duplicate genes with pronounced sequence divergence between two copies, and approximately 10% of them diverge asymmetrically, i.e. two copies of a duplicate gene pair diverge at significantly different rates. We use a Bayesian approach of the McDonald-Kreitman test to infer distributions of selective coefficients γ(=2Nes) and find that (1) the values of γ for two copies of duplicate genes co-vary positively and (2) the average γ for the two copies differs between genes from different gene families. This analysis highlights the usefulness of combining divergence and diversity data in studying the evolution of duplicate genes. Taken together, our results provide further support and refinement to the theories of gene duplication. Through characterizing the duplicate genes in the genome of A. fumigatus, we establish a computational framework, including parameter settings and methods, for comparative study of genetic redundancy and gene duplication between different fungal species. PMID:23225993

  1. Distinguishing proteins from arbitrary amino acid sequences.

    PubMed

    Yau, Stephen S-T; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  2. The amino-acid sequence of kangaroo pancreatic ribonuclease.

    PubMed

    Gaastra, W; Welling, G W; Beintema, J J

    1978-05-01

    Red kangaroo (Macropus rufus) ribonuclease was isolated from pancreatic tissue by affinity chromatography. The amino acid sequence was determined by automatic sequencing of overlapping large fragments and by analysis of shorter peptides obtained by digestion with a number of proteolytic enzymes. The polypeptide chain consists of 122 amino acid residues. Compared to other ribonucleases, the N-terminal residue and residue 114 are deleted. In other pancreatic ribonucleases position 114 is occupied by a cis proline residue in an external loop at the surface of the molecule. Other remarkable substitutions are the presence of a tyrosine residue at position 123 instead of a serine which forms a hydrogen bond with the pyrimidine ring of a nucleotide substrate, and a number of hydrophobichydrophilic interchanges in the sequence 51-55, which forms part of an alpha-helix in bovine ribonuclease and exhibits few substitutions in the placental mammals. Kangaroo ribonuclease contains no carbohydrate, although the enzyme possesses a recognition site for carbohydrate attachment in the sequence Asn-Val-Thr (62-64). The enzyme differs at about 35-40% of the positions from all other mammalian pancreatic ribonucleases sequenced to date, which is in agreement with the early divergence between the marsupials and the placental mammals. From fragmentary data a tentative sequence of red-necked wallaby (Macropus rufogriseus) pancreatic ribonuclease has been derived. Eight differences with the kangaroo sequence were found.

  3. The amino-acid sequence of kangaroo pancreatic ribonuclease.

    PubMed

    Gaastra, W; Welling, G W; Beintema, J J

    1978-05-01

    Red kangaroo (Macropus rufus) ribonuclease was isolated from pancreatic tissue by affinity chromatography. The amino acid sequence was determined by automatic sequencing of overlapping large fragments and by analysis of shorter peptides obtained by digestion with a number of proteolytic enzymes. The polypeptide chain consists of 122 amino acid residues. Compared to other ribonucleases, the N-terminal residue and residue 114 are deleted. In other pancreatic ribonucleases position 114 is occupied by a cis proline residue in an external loop at the surface of the molecule. Other remarkable substitutions are the presence of a tyrosine residue at position 123 instead of a serine which forms a hydrogen bond with the pyrimidine ring of a nucleotide substrate, and a number of hydrophobichydrophilic interchanges in the sequence 51-55, which forms part of an alpha-helix in bovine ribonuclease and exhibits few substitutions in the placental mammals. Kangaroo ribonuclease contains no carbohydrate, although the enzyme possesses a recognition site for carbohydrate attachment in the sequence Asn-Val-Thr (62-64). The enzyme differs at about 35-40% of the positions from all other mammalian pancreatic ribonucleases sequenced to date, which is in agreement with the early divergence between the marsupials and the placental mammals. From fragmentary data a tentative sequence of red-necked wallaby (Macropus rufogriseus) pancreatic ribonuclease has been derived. Eight differences with the kangaroo sequence were found. PMID:658039

  4. Sequence divergence and chromosomal rearrangements during the evolution of human pseudoautosomal genes and their mouse homologs

    SciTech Connect

    Ellison, J.; Li, X.; Francke, U.

    1994-09-01

    The pseudoautosomal region (PAR) is an area of sequence identity between the X and Y chromosomes and is important for mediating X-Y pairing during male meiosis. Of the seven genes assigned to the human PAR, none of the mouse homologs have been isolated by a cross-hybridization strategy. Two of these homologs, Csfgmra and II3ra, have been isolated using a functional assay for the gene products. These genes are quite different in sequence from their human homologs, showing only 60-70% sequence similarity. The Csfgmra gene has been found to further differ from its human homolog in being isolated not on the sex chromosomes, but on a mouse autosome (chromosome 19). Using a mouse-hamster somatic cell hybrid mapping panel, we have mapped the II3ra gene to yet another mouse autosome, chromosome 14. Attempts to clone the mouse homolog of the ANT3 locus resulted in the isolation of two related genes, Ant1 and Ant2, but failed to yield the Ant3 gene. Southern blot analysis of the ANT/Ant genes showed the Ant1 and Ant2 sequences to be well-conserved among all of a dozen mammals tested. In contrast, the ANT3 gene only showed hybridization to non-rodent mammals, suggesting it is either greatly divergent or has been deleted in the rodent lineage. Similar experiments with other human pseudoautosomal probes likewise showed a lack of hybridization to rodent sequences. The results show a definite trend of extensive divergence of pseudoautosomal sequences in addition to chromosomal rearrangements involving X;autosome translocations and perhaps gene deletions. Such observations have interesting implications regarding the evolution of this important region of the sex chromosomes.

  5. Complete genome sequence of a divergent strain of Japanese yam mosaic virus from China

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A novel strain of Japanese yam mosaic virus (JYMV-CN) was identified in a yam plant with foliar mottle symptoms in China. The complete genomic sequence of JYMV-CN was determined. Its genomic sequence of 9701 nucleotides encodes a polyprotein of 3247 amino acids. Its organization was virtually identi...

  6. Sequence, expression divergence, and complementation of homologous ALCATRAZ loci in Brassica napus.

    PubMed

    Hua, Shuijin; Shamsi, Imran Haider; Guo, Yuan; Pak, Haksong; Chen, Mingxun; Shi, Congguang; Meng, Huabing; Jiang, Lixi

    2009-08-01

    The genomic era provides new perspectives in understanding polyploidy evolution, mostly on the genome-wide scale. In this paper, we show the sequence and expression divergence between the homologous ALCATRAZ (ALC) loci in Brassica napus, responsible for silique dehiscence. We cloned two homologous ALC loci, namely BnaC.ALC.a and BnaA.ALC.a in B. napus. Driven by the 35S promoter, both the loci complemented to the alc mutation of Arabidopsis thaliana, yet only the expression of BnaC.ALC.a was detectable in the siliques of B. napus. Sequence alignment indicated that BnaC.ALC.a and BolC.ALC.a, or BnaA.ALC.a and BraA.ALC.a, possess a high level of similarity. The understanding of the sequence and expression divergence among homologous loci of a gene is of due importance for an effective gene manipulation and TILLING (or ECOTILLING) analysis for the allelic DNA variation at a given locus. PMID:19504267

  7. Drosophila bloom helicase maintains genome integrity by inhibiting recombination between divergent DNA sequences

    PubMed Central

    Kappeler, Michael; Kranz, Elisabeth; Woolcock, Katrina; Georgiev, Oleg; Schaffner, Walter

    2008-01-01

    DNA double strand breaks (DSB) can be repaired either via a sequence independent joining of DNA ends or via homologous recombination. We established a detection system in Drosophila melanogaster to investigate the impact of sequence constraints on the usage of the homology based DSB repair via single strand annealing (SSA), which leads to recombination between direct repeats with concomitant loss of one repeat copy. First of all, we find the SSA frequency to be inversely proportional to the spacer length between the repeats, for spacers up to 2.4 kb in length. We further show that SSA between divergent repeats (homeologous SSA) is suppressed in cell cultures and in vivo in a sensitive manner, recognizing sequence divergences smaller than 0.5%. Finally, we demonstrate that the suppression of homeologous SSA depends on the Bloom helicase (Blm), encoded by the Drosophila gene mus309. Suppression of homeologous recombination is a novel function of Blm in ensuring genomic integrity, not described to date in mammalian systems. Unexpectedly, distinct from its function in Saccharomyces cerevisiae, the mismatch repair factor Msh2 encoded by spel1 does not suppress homeologous SSA in Drosophila. PMID:18978019

  8. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  9. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  10. Concerted evolution at a multicopy locus in the protozoan parasite Theileria parva: extreme divergence of potential protein-coding sequences.

    PubMed Central

    Bishop, R; Musoke, A; Morzaria, S; Sohanpal, B; Gobright, E

    1997-01-01

    Concerted evolution of multicopy gene families in vertebrates is recognized as an important force in the generation of biological novelty but has not been documented for the multicopy genes of protozoa. A multicopy locus, Tpr, which consists of tandemly arrayed open reading frames (ORFs) containing several repeated elements has been described for Theileria parva. Herein we show that probes derived from the 5'/N-terminal ends of ORFs in the genomic DNAs of T. parva Uganda (1,108 codons) and Boleni (699 codons) hybridized with multicopy sequences in homologous DNA but did not detect similar sequences in the DNA of 14 heterologous T. parva stocks and clones. The probe sequences were, however, protein coding according to predictive algorithms and codon usage. The 3'/C-terminal ends of the Uganda and Boleni ORFs exhibited 75% similarity and identity, respectively, to the previously identified Tpr1 and Tpr2 repetitive elements of T. parva Muguga. Tpr1-homologous sequences were detected in two additional species of Theileria. Eight different Tpr1-homologous transcripts were present in piroplasm mRNA from a single T. parva Muguga-infected animal. The Tpr1 and Tpr2 amino acid sequences contained six predicted membrane-associated segments. The ratio of synonymous to nonsynonymous substitutions indicates that Tpr1 evolves like protein-encoding DNA. The previously determined nucleotide sequence of the gene encoding the p67 antigen is completely identical in T. parva Muguga, Boleni, and Uganda, including the third base in codons. The data suggest that concerted evolution can lead to the radical divergence of coding sequences and that this can be a mechanism for the generation of novel genes. PMID:9032293

  11. Divergence of function in sequence-related groups of Escherichia coli proteins.

    PubMed

    Nahum, L A; Riley, M

    2001-08-01

    The most prominent mechanism of molecular evolution is believed to have been duplication and divergence of genes. Proteins that belong to sequence-related groups in any one organism are candidates to have emerged from such a process and to share a common ancestor. Groups of proteins in Escherichia coli having sequence similarity are mostly composed of proteins with closely related function, but some groups comprise proteins with unrelated functions. In order to understand how function can change while sequences remain similar, we have examined some of these groups in detail. The enzymes analyzed in this work include representatives of amidotransferases, phosphotransferases, decarboxylases, and others. Most sequence-related groups contain enzymes that are in the same classes of Enzyme Commission (EC) numbers. We have concentrated on groups that are heterogeneous in that respect, and also on groups containing more than one enzyme of any pathway. We find that although the EC number may differ, the reaction chemistry of these sequence-related proteins is the same or very similar. Some of these families illustrate how diversification has taken place in evolution, using common features of either reaction chemistry or ligand specificity, or both, to create catalysts for different kinds of biochemical reactions. This information has relevance to the area of functional genomics in which the activities of gene products of unknown reading frames are attributed by analogy to the functions of sequence-related proteins of known function.

  12. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

    PubMed Central

    Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

    2015-01-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930

  13. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence.

    PubMed

    Gordon, Kacy L; Arthur, Robert K; Ruvinsky, Ilya

    2015-05-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements.

  14. Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms.

    PubMed

    Nei, M; Xu, P; Glazko, G

    2001-02-27

    When many protein sequences are available for estimating the time of divergence between two species, it is customary to estimate the time for each protein separately and then use the average for all proteins as the final estimate. However, it can be shown that this estimate generally has an upward bias, and that an unbiased estimate is obtained by using distances based on concatenated sequences. We have shown that two concatenation-based distances, i.e., average gamma distance weighted with sequence length (d(2)) and multiprotein gamma distance (d(3)), generally give more satisfactory results than other concatenation-based distances. Using these two distance measures for 104 protein sequences, we estimated the time of divergence between mice and rats to be approximately 33 million years ago. Similarly, the time of divergence between humans and rodents was estimated to be approximately 96 million years ago. We also investigated the dependency of time estimates on statistical methods and various assumptions made by using sequence data from eubacteria, protists, plants, fungi, and animals. Our best estimates of the times of divergence between eubacteria and eukaryotes, between protists and other eukaryotes, and between plants, fungi, and animals were 3, 1.7, and 1.3 billion years ago, respectively. However, estimates of ancient divergence times are subject to a substantial amount of error caused by uncertainty of the molecular clock, horizontal gene transfer, errors in sequence alignments, etc.

  15. Oil palm genome sequence reveals divergence of interfertile species in Old and New worlds.

    PubMed

    Singh, Rajinder; Ong-Abdullah, Meilina; Low, Eng-Ti Leslie; Manaf, Mohamad Arif Abdul; Rosli, Rozana; Nookiah, Rajanaidu; Ooi, Leslie Cheng-Li; Ooi, Siew-Eng; Chan, Kuang-Lim; Halim, Mohd Amin; Azizi, Norazah; Nagappan, Jayanthi; Bacher, Blaire; Lakey, Nathan; Smith, Steven W; He, Dong; Hogan, Michael; Budiman, Muhammad A; Lee, Ernest K; DeSalle, Rob; Kudrna, David; Goicoechea, Jose Luis; Wing, Rod A; Wilson, Richard K; Fulton, Robert S; Ordway, Jared M; Martienssen, Robert A; Sambanthamurthi, Ravigadevi

    2013-08-15

    Oil palm is the most productive oil-bearing crop. Although it is planted on only 5% of the total world vegetable oil acreage, palm oil accounts for 33% of vegetable oil and 45% of edible oil worldwide, but increased cultivation competes with dwindling rainforest reserves. We report the 1.8-gigabase (Gb) genome sequence of the African oil palm Elaeis guineensis, the predominant source of worldwide oil production. A total of 1.535 Gb of assembled sequence and transcriptome data from 30 tissue types were used to predict at least 34,802 genes, including oil biosynthesis genes and homologues of WRINKLED1 (WRI1), and other transcriptional regulators, which are highly expressed in the kernel. We also report the draft sequence of the South American oil palm Elaeis oleifera, which has the same number of chromosomes (2n = 32) and produces fertile interspecific hybrids with E. guineensis but seems to have diverged in the New World. Segmental duplications of chromosome arms define the palaeotetraploid origin of palm trees. The oil palm sequence enables the discovery of genes for important traits as well as somaclonal epigenetic alterations that restrict the use of clones in commercial plantings, and should therefore help to achieve sustainability for biofuels and edible oils, reducing the rainforest footprint of this tropical plantation crop.

  16. Genetic divergence between subpopulations of the eastern Pacific goose barnacle Pollicipes elegans: mitochondrial cytochrome c subunit 1 nucleotide sequences.

    PubMed

    Van Syoc, R J

    1994-12-01

    Nucleotide sequence data derived from polymerase chain reaction products from the cytochrome oxidase subunit 1 gene of mitochondrial DNA provide evidence for interrupted gene flow and subsequent genetic divergence between geographically separate subpopulations of the edible goose barnacle, Pollicipes elegans, with a 4400-km latitudinal distribution in the eastern Pacific Ocean. The amphitropical subpopulations of Pollicipes elegans have a net nucleotide sequence divergence of about 1.2%. A range of mutation rates are applied to calculate estimates for the timing of this divergence. The earliest estimated time of divergence agrees with a Pliocene time of general warming in the eastern Pacific. The latest estimated times coincide with the Pleistocene epoch and periods of cooling and warming that could have allowed for a series of expansions and contractions of P. elegans populations in the eastern tropical Pacific. These expansions and contractions may, therefore, represent alternating periods of genetic exchange and isolation of the two populations.

  17. The Genome Sequence of Lone Star Virus, a Highly Divergent Bunyavirus Found in the Amblyomma americanum Tick

    PubMed Central

    Swei, Andrea; Russell, Brandy J.; Naccache, Samia N.; Kabre, Beniwende; Veeraraghavan, Narayanan; Pilgard, Mark A.; Johnson, Barbara J. B.; Chiu, Charles Y.

    2013-01-01

    Viruses in the family Bunyaviridae infect a wide range of plant, insect, and animal hosts. Tick-borne bunyaviruses in the Phlebovirus genus, including Severe Fever with Thrombocytopenia Syndrome virus (SFTSV) in China, Heartland virus (HRTV) in the United States, and Bhanja virus in Eurasia and Africa have been associated with acute febrile illness in humans. Here we sought to characterize the growth characteristics and genome of Lone Star virus (LSV), an unclassified bunyavirus originally isolated from the lone star tick Amblyomma americanum. LSV was able to infect both human (HeLa) and monkey (Vero) cells. Cytopathic effects were seen within 72 h in both cell lines; vacuolization was observed in infected Vero, but not HeLa, cells. Viral culture supernatants were examined by unbiased deep sequencing and analysis using an in-house developed rapid computational pipeline for viral discovery, which definitively identified LSV as a phlebovirus. De novo assembly of the full genome revealed that LSV is highly divergent, sharing <61% overall amino acid identity with any other bunyavirus. Despite this sequence diversity, LSV was found by phylogenetic analysis to be part of a well-supported clade that includes members of the Bhanja group viruses, which are most closely related to SFSTV/HRTV. The genome sequencing of LSV is a critical first step in developing diagnostic tools to determine the risk of arbovirus transmission by A. americanum, a tick of growing importance given its expanding geographic range and competence as a disease vector. This study also underscores the power of deep sequencing analysis in rapidly identifying and sequencing the genomes of viruses of potential clinical and public health significance. PMID:23637969

  18. Bovine Parathyroid Hormone: Amino Acid Sequence

    PubMed Central

    Brewer, H. Bryan; Ronan, Rosemary

    1970-01-01

    Bovine parathyroid hormone has been isolated in homogeneous form, and its complete amino acid sequence determined. The bovine hormone is a single chain, 84 amino acids long. It contains amino-terminal alanine, and carboxyl-terminal glutamine. The bovine parathyroid hormone is approximately three times the length of the newly discovered hormone, thyrocalcitonin, whose action is reciprocal to parathyroid hormone. Images PMID:5275384

  19. Acid stress mediated adaptive divergence in ion channel function during embryogenesis in Rana arvalis

    PubMed Central

    Shu, Longfei; Laurila, Anssi; Räsänen, Katja

    2015-01-01

    Ion channels and pumps are responsible for ion flux in cells, and are key mechanisms mediating cellular function. Many environmental stressors, such as salinity and acidification, are known to severely disrupt ionic balance of organisms thereby challenging fitness of natural populations. Although ion channels can have several vital functions during early life-stages (e.g. embryogenesis), it is currently not known i) how developing embryos maintain proper intracellular conditions when exposed to environmental stress and ii) to what extent environmental stress can drive intra-specific divergence in ion channels. Here we studied the moor frog, Rana arvalis, from three divergent populations to investigate the role of different ion channels and pumps for embryonic survival under acid stress (pH 4 vs 7.5) and whether populations adapted to contrasting acidities differ in the relative role of different ion channel/pumps. We found that ion channels that mediate Ca2+ influx are essential for embryonic survival under acidic pH, and, intriguingly, that populations differ in calcium channel function. Our results suggest that adaptive divergence in embryonic acid stress tolerance of amphibians may in part be mediated by Ca2+ balance. We suggest that ion flux may mediate adaptive divergence of natural populations at early life-stages in the face of environmental stress. PMID:26381453

  20. Assessing divergence time of Spirulida and Sepiida (Cephalopoda) based on hemocyanin sequences.

    PubMed

    Warnke, Kerstin Martina; Meyer, Achim; Ebner, Bettina; Lieb, Bernhard

    2011-02-01

    The phylogenetic position of the mesopelagic decabrachian cephalopod Spirula is still a matter of debate. Since hemocyanin has successfully been used to calibrate a molecular clock for many molluscan species, a molecular clock was calculated based on this gene with special attention to the cephalopod genera Spirula and Sepia. The obtained partial sequence comprising ca., one third (3567 bp) of the complete gene is similar to that of Sepia officinalis. The molecular clock was calibrated using the splits of Gastropoda-Cephalopoda (ca. 550 ± 50 mya) and Heterobranchia-Vetigastropoda (ca. 380 ± 10 mya). The resulting hemocyanin-based molecular clock is stable, and the estimated divergence time of Spirulida and Sepiida, some 150 ± 30 million years ago, can be deemed reliable.

  1. AVP2, a sequence-divergent, K(+)-insensitive H(+)-translocating inorganic pyrophosphatase from Arabidopsis.

    PubMed

    Drozdowicz, Y M; Kissinger, J C; Rea, P A

    2000-05-01

    Plant vacuolar H(+)-translocating inorganic pyrophosphatases (V-PPases; EC 3.6.1.1) have been considered to constitute a family of functionally and structurally monotonous intrinsic membrane proteins. Typified by AVP1 (V. Sarafian, Y. Kim, R.J. Poole, P.A. Rea [1992] Proc Natl Acad Sci USA 89: 1775-1779) from Arabidopsis, all characterized plant V-PPases share greater than 84% sequence identity and catalyze K(+)-stimulated H(+) translocation. Here we describe the molecular and biochemical characterization of AVP2 (accession no. AF182813), a sequence-divergent (36% identical) K(+)-insensitive, Ca(2+)-hypersensitive V-PPase active in both inorganic pyrophosphate hydrolysis and H(+) translocation. The differences between AVP2 and AVP1 provide the first indication that plant V-PPases from the same organism fall into two distinct categories. Phylogenetic analyses of these and other V-PPase sequences extend this principle by showing that AVP2, rather than being an isoform of AVP1, is but one representative of a novel category of AVP2-like (type II) V-PPases that coexist with AVP1-like (type I) V-PPases not only in plants, but also in apicomplexan protists such as the malarial parasite Plasmodium falciparum.

  2. Next-generation sequencing reveals recent horizontal transfer of a DNA transposon between divergent mosquitoes.

    PubMed

    Diao, Yupu; Qi, Yumin; Ma, Yajun; Xia, Ai; Sharakhov, Igor; Chen, Xiaoguang; Biedler, Jim; Ling, Erjun; Tu, Zhijian Jake

    2011-01-01

    Horizontal transfer of genetic material between complex organisms often involves transposable elements (TEs). For example, a DNA transposon mariner has been shown to undergo horizontal transfer between different orders of insects and between different phyla of animals. Here we report the discovery and characterization of an ITmD37D transposon, MJ1, in Anopheles sinensis. We show that some MJ1 elements in Aedes aegypti and An. sinensis contain intact open reading frames and share nearly 99% nucleotide identity over the entire transposon, which is unexpectedly high given that these two genera had diverged 145-200 million years ago. Chromosomal hybridization and TE-display showed that MJ1 copy number is low in An. sinensis. Among 24 mosquito species surveyed, MJ1 is only found in Ae. aegypti and the hyrcanus group of anopheline mosquitoes to which An. sinensis belongs. Phylogenetic analysis is consistent with horizontal transfer and provides the basis for inference of its timing and direction. Although report of horizontal transfer of DNA transposons between higher eukaryotes is accumulating, our analysis is one of a small number of cases in which horizontal transfer of nearly identical TEs among highly divergent species has been thoroughly investigated and strongly supported. Horizontal transfer involving mosquitoes is of particular interest because there are ongoing investigations of the possibility of spreading pathogen-resistant genes into mosquito populations to control malaria and other infectious diseases. The initial indication of horizontal transfer of MJ1 came from comparisons between a 0.4x coverage An. sinensis 454 sequence database and available TEs in mosquito genomes. Therefore we have shown that it is feasible to use low coverage sequencing to systematically uncover horizontal transfer events. Expanding such efforts across a wide range of species will generate novel insights into the relative frequency of horizontal transfer of different TEs and

  3. Sequence Divergence and Conservation in Genomes of Helicobacter cetorum Strains from a Dolphin and a Whale

    PubMed Central

    Kersulyte, Dangeruta; Rossi, Mirko; Berg, Douglas E.

    2013-01-01

    Background and Objectives Strains of Helicobacter cetorum have been cultured from several marine mammals and have been found to be closely related in 16 S rDNA sequence to the human gastric pathogen H. pylori, but their genomes were not characterized further. Methods The genomes of H. cetorum strains from a dolphin and a whale were sequenced completely using 454 technology and PCR and capillary sequencing. Results These genomes are 1.8 and 1.95 mb in size, some 7–26% larger than H. pylori genomes, and differ markedly from one another in gene content, and sequences and arrangements of shared genes. However, each strain is more related overall to H. pylori and its descendant H. acinonychis than to other known species. These H. cetorum strains lack cag pathogenicity islands, but contain novel alleles of the virulence-associated vacuolating cytotoxin (vacA) gene. Of particular note are (i) an extra triplet of vacA genes with ≤50% protein-level identity to each other in the 5′ two-thirds of the gene needed for host factor interaction; (ii) divergent sets of outer membrane protein genes; (iii) several metabolic genes distinct from those of H. pylori; (iv) genes for an iron-cofactored urease related to those of Helicobacter species from terrestrial carnivores, in addition to genes for a nickel co-factored urease; and (v) members of the slr multigene family, some of which modulate host responses to infection and improve Helicobacter growth with mammalian cells. Conclusions Our genome sequence data provide a glimpse into the novelty and great genetic diversity of marine helicobacters. These data should aid further analyses of microbial genome diversity and evolution and infection and disease mechanisms in vast and often fragile ocean ecosystems. PMID:24358262

  4. Two groups of rhinoviruses revealed by a panel of antiviral compounds present sequence divergence and differential pathogenicity.

    PubMed Central

    Andries, K; Dewindt, B; Snoeks, J; Wouters, L; Moereels, H; Lewi, P J; Janssen, P A

    1990-01-01

    A variety of chemically different compounds inhibit the replication of several serotypes of rhinoviruses (common-cold viruses). We noticed that one of these antiviral compounds, WIN 51711, had an antiviral spectrum clearly distinctive from a consensus spectrum or other capsid-binding compounds, although all of them were shown to share the same binding site. A systematic evaluation of all known rhinovirus capsid-binding compounds against all serotyped rhinoviruses was therefore initiated. Multivariate analysis of the results revealed the existence of two groups of rhinoviruses, which we will call antiviral groups A and B. The differential sensitivity of members of these groups to antiviral compounds suggests the existence of a dimorphic binding site. The antiviral groups turned out to be a reflection of a divergence of rhinovirus serotypes on a much broader level. Similarities in antiviral spectra were highly correlated with sequence similarities, not only of amino acids lining the antiviral compound-binding-site, but also of amino acids of the whole VP1 protein. Furthermore, analysis of epidemiological data indicated that group B rhinoviruses produced more than twice as many clinical infections per serotype than group A rhinoviruses did. Rhinoviruses belonging to the minor receptor group were without exception all computed to lie in the same region of antiviral group B. PMID:2154596

  5. The Complete Chloroplast Genome Sequences of Three Veroniceae Species (Plantaginaceae): Comparative Analysis and Highly Divergent Regions

    PubMed Central

    Choi, Kyoung Su; Chung, Myong Gi; Park, SeonJoo

    2016-01-01

    Previous studies of Veronica and related genera were weakly supported by molecular and paraphyletic taxa. Here, we report the complete chloroplast genome sequence of Veronica nakaiana and the related species Veronica persica and Veronicastrum sibiricum. The chloroplast genome length of V. nakaiana, V. persica, and V. sibiricum ranged from 150,198 bp to 152,930 bp. A total of 112 genes comprising 79 protein coding genes, 29 tRNA genes, and 4 rRNA genes were observed in three chloroplast genomes. The total number of SSRs was 48, 51, and 53 in V. nakaiana, V. persica, and V. sibiricum, respectively. Two SSRs (10 bp of AT and 12 bp of AATA) were observed in the same regions (rpoC2 and ndhD) in three chloroplast genomes. A comparison of coding genes and non-coding regions between V. nakaiana and V. persica revealed divergent sites, with the greatest variation occurring petD-rpoA region. The complete chloroplast genome sequence information regarding the three Veroniceae will be helpful for elucidating Veroniceae phylogenetic relationships. PMID:27047524

  6. Antibiotic-mediated recombination: ciprofloxacin stimulates SOS-independent recombination of divergent sequences in Escherichia coli.

    PubMed

    López, Elena; Elez, Marina; Matic, Ivan; Blázquez, Jesús

    2007-04-01

    The widespread use and abuse of antibiotics as therapeutic agents has produced a major challenge for bacteria, leading to the selection and spread of antibiotic resistant variants. However, antibiotics do not seem to be mere selectors of these variants. Here we show that the fluoroquinolone antibiotic ciprofloxacin, an inhibitor of type II DNA topoisomerases, stimulates intrachromosomal recombination of DNA sequences. The stimulation of recombination between divergent sequences occurs via either the RecBCD or RecFOR pathways and is, surprisingly, independent of SOS induction. Additionally, this stimulation also occurs in a hyperrecombinogenic mismatch repair mutS mutant. It is worth noting that ciprofloxacin also stimulates the conjugational recombination of an antibiotic resistance gene. Finally, we demonstrate that Escherichia coli is able to recover from treatments with recombination-stimulating concentrations of the antibiotic. Thus, fluoroquinolones can increase genetic variation by the stimulation of the recombinogenic capability of treated bacteria (via an SOS-independent mechanism) and consequently may favour the acquisition, evolution and spread of antibiotic resistance determinants. PMID:17376074

  7. Direct sequencing of mitochondrial DNA detects highly divergent haplotypes in blue marlin (Makaira nigricans).

    PubMed

    Finnerty, J R; Block, B A

    1992-06-01

    We were able to differentiate between species of billfish (Istiophoridae family) and to detect considerable intraspecific variation in the blue marlin (Makaira nigricans) by directly sequencing a polymerase chain reaction (PCR)-amplified, 612-bp fragment of the mitochondrial cytochrome b gene. Thirteen variable nucleotide sites separated blue marlin (n = 26) into 7 genotypes. On average, these genotypes differed by 5.7 base substitutions. A smaller sample of swordfish from an equally broad geographic distribution displayed relatively little intraspecific variation, with an average of 1.3 substitutions separating different genotypes. A cladistic analysis of blue marlin cytochrome b variants indicates two major divergent evolutionary lines within the species. The frequencies of these two major evolutionary lines differ significantly between Atlantic and Pacific ocean basins. This finding is important given that the Atlantic stocks of blue marlin are considered endangered. Migration from the Pacific can help replenish the numbers of blue marlin in the Atlantic, but the loss of certain mitochondrial DNA haplotypes in the Atlantic due to overfishing probably could not be remedied by an influx of Pacific fish because of their absence in the Pacific population. Fishery management strategies should attempt to preserve the genetic diversity within the species. The detection of DNA sequence polymorphism indicates the utility of PCR technology in pelagic fishery genetics.

  8. Impaired nuclear import of mammalian Dlx4 proteins as a consequence of rapid sequence divergence

    SciTech Connect

    Coubrough, Melissa L.; Bendall, Andrew J. . E-mail: abendall@uoguelph.ca

    2006-11-15

    Dlx genes encode a developmentally important family of transcription factors with a variety of functions and sites of action during vertebrate embryogenesis. The murine Dlx4 gene is an enigmatic member of the family; little is known about the normal developmental function(s) of Dlx4. Here, we show that Dlx4 is expressed in the murine placenta and in a trophoblast cell line where the protein localizes to both the nucleus and cytoplasm. Despite the presence of several leucine/valine-rich motifs that match known nuclear export sequences, cytoplasmic Dlx4 is not due to CRM-1-mediated nuclear export. Rather, nuclear import of Dlx4 is compromised by specific residues that flank the nuclear localization signal. One of these residues represents a novel conserved feature of the Dlx4 protein in placental mammals, and the second represents novel variation within mouse Dlx4 isoforms. Comparison of orthologous protein sequences reveals a particularly high rate of non-synonymous change in the coding regions of mammalian Dlx4 genes. Since impaired nuclear localization is unlikely to enhance the function of a nuclear transcription factor, these data point to reduced selection pressure as the basis for the rapid divergence of the Dlx4 gene within the mammalian clade.

  9. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  10. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  11. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  12. RAD sequencing enables unprecedented phylogenetic resolution and objective species delimitation in recalcitrant divergent taxa.

    PubMed

    Herrera, Santiago; Shank, Timothy M

    2016-07-01

    Species delimitations is problematic in many cases due to the difficulty of evaluating predictions from species hypotheses. In many cases delimitations rely on subjective interpretations of morphological and/or DNA data. Species with inadequate genetic resources needed to answer questions regarding evolutionary relatedness and genetic uniqueness are particularly problematic. In this study, we demonstrate the utility of restriction site associated DNA sequencing (RAD-seq) to objectively resolve unambiguous phylogenetic relationships in a recalcitrant group of deep-sea corals with divergences >80 million years. We infer robust species boundaries in the genus Paragorgia by testing alternative delimitation hypotheses using a Bayes Factors delimitation method. We present substantial evidence rejecting the current morphological species delimitation model for the genus and infer the presence of cryptic species associated with environmental variables. We argue that the suitability limits of RAD-seq for phylogenetic inferences cannot be assessed in terms of absolute time, but are contingent on taxon-specific factors. We show that classical taxonomy can greatly benefit from integrative approaches that provide objective tests to species delimitation hypotheses. Our results lead the way for addressing further questions in marine biogeography, community ecology, population dynamics, conservation, and evolution. PMID:26993764

  13. Optimization of short amino acid sequences classifier

    NASA Astrophysics Data System (ADS)

    Barcz, Aleksy; Szymański, Zbigniew

    This article describes processing methods used for short amino acid sequences classification. The data processed are 9-symbols string representations of amino acid sequences, divided into 49 data sets - each one containing samples labeled as reacting or not with given enzyme. The goal of the classification is to determine for a single enzyme, whether an amino acid sequence would react with it or not. Each data set is processed separately. Feature selection is performed to reduce the number of dimensions for each data set. The method used for feature selection consists of two phases. During the first phase, significant positions are selected using Classification and Regression Trees. Afterwards, symbols appearing at the selected positions are substituted with numeric values of amino acid properties taken from the AAindex database. In the second phase the new set of features is reduced using a correlation-based ranking formula and Gram-Schmidt orthogonalization. Finally, the preprocessed data is used for training LS-SVM classifiers. SPDE, an evolutionary algorithm, is used to obtain optimal hyperparameters for the LS-SVM classifier, such as error penalty parameter C and kernel-specific hyperparameters. A simple score penalty is used to adapt the SPDE algorithm to the task of selecting classifiers with best performance measures values.

  14. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  15. On combining protein sequences and nucleic acid sequences in phylogenetic analysis: the homeobox protein case.

    PubMed

    Agosti, D; Jacobs, D; DeSalle, R

    1996-01-01

    Amino acid encoding genes contain character state information that may be useful for phylogenetic analysis on at least two levels. The nucleotide sequence and the translated amino acid sequences have both been employed separately as character states for cladistic studies of various taxa, including studies of the genealogy of genes in multigene families. In essence, amino acid sequences and nucleic acid sequences are two different ways of character coding the information in a gene. Silent positions in the nucleotide sequence (first or third positions in codons that can accrue change without changing the identity of the amino acid that the triplet codes for) may accrue change relatively rapidly and become saturated, losing the pattern of historical divergence. On the other hand, non-silent nucleotide alterations and their accompanying amino acid changes may evolve too slowly to reveal relationships among closely related taxa. In general, the dynamics of sequence change in silent and non-silent positions in protein coding genes result in homoplasy and lack of resolution, respectively. We suggest that the combination of nucleic acid and the translated amino acid coded character states into the same data matrix for phylogenetic analysis addresses some of the problems caused by the rapid change of silent nucleotide positions and overall slow rate of change of non-silent nucleotide positions and slowly changing amino acid positions. One major theoretical problem with this approach is the apparent non-independence of the two sources of characters. However, there are at least three possible outcomes when comparing protein coding nucleic acid sequences with their translated amino acids in a phylogenetic context on a codon by codon basis. First, the two character sets for a codon may be entirely congruent with respect to the information they convey about the relationships of a certain set of taxa. Second, one character set may display no information concerning a phylogenetic

  16. Functional Divergence of APETALA1 and FRUITFULL is due to Changes in both Regulation and Coding Sequence

    PubMed Central

    McCarthy, Elizabeth W.; Mohamed, Abeer; Litt, Amy

    2015-01-01

    Gene duplications are prevalent in plants, and functional divergence subsequent to duplication may be linked with the occurrence of novel phenotypes in plant evolution. Here, we examine the functional divergence of Arabidopsis thaliana APETALA1 (AP1) and FRUITFULL (FUL), which arose via a duplication correlated with the origin of the core eudicots. Both AP1 and FUL play a role in floral meristem identity, but AP1 is required for the formation of sepals and petals whereas FUL is involved in cauline leaf and fruit development. AP1 and FUL are expressed in mutually exclusive domains but also differ in sequence, with unique conserved motifs in the C-terminal domains of the proteins that suggest functional differentiation. To determine whether the functional divergence of AP1 and FUL is due to changes in regulation or changes in coding sequence, we performed promoter swap experiments, in which FUL was expressed in the AP1 domain in the ap1 mutant and vice versa. Our results show that FUL can partially substitute for AP1, and AP1 can partially substitute for FUL; thus, the functional divergence between AP1 and FUL is due to changes in both regulation and coding sequence. We also mutated AP1 and FUL conserved motifs to determine if they are required for protein function and tested the ability of these mutated proteins to interact in yeast with known partners. We found that these motifs appear to play at best a minor role in protein function and dimerization capability, despite being strongly conserved. Our results suggest that the functional differentiation of these two paralogous key transcriptional regulators involves both differences in regulation and in sequence; however, sequence changes in the form of unique conserved motifs do not explain the differences observed. PMID:26697035

  17. Functional Divergence of APETALA1 and FRUITFULL is due to Changes in both Regulation and Coding Sequence.

    PubMed

    McCarthy, Elizabeth W; Mohamed, Abeer; Litt, Amy

    2015-01-01

    Gene duplications are prevalent in plants, and functional divergence subsequent to duplication may be linked with the occurrence of novel phenotypes in plant evolution. Here, we examine the functional divergence of Arabidopsis thaliana APETALA1 (AP1) and FRUITFULL (FUL), which arose via a duplication correlated with the origin of the core eudicots. Both AP1 and FUL play a role in floral meristem identity, but AP1 is required for the formation of sepals and petals whereas FUL is involved in cauline leaf and fruit development. AP1 and FUL are expressed in mutually exclusive domains but also differ in sequence, with unique conserved motifs in the C-terminal domains of the proteins that suggest functional differentiation. To determine whether the functional divergence of AP1 and FUL is due to changes in regulation or changes in coding sequence, we performed promoter swap experiments, in which FUL was expressed in the AP1 domain in the ap1 mutant and vice versa. Our results show that FUL can partially substitute for AP1, and AP1 can partially substitute for FUL; thus, the functional divergence between AP1 and FUL is due to changes in both regulation and coding sequence. We also mutated AP1 and FUL conserved motifs to determine if they are required for protein function and tested the ability of these mutated proteins to interact in yeast with known partners. We found that these motifs appear to play at best a minor role in protein function and dimerization capability, despite being strongly conserved. Our results suggest that the functional differentiation of these two paralogous key transcriptional regulators involves both differences in regulation and in sequence; however, sequence changes in the form of unique conserved motifs do not explain the differences observed.

  18. Analysis of porcine adipose tissue transcriptome reveals differences in de novo fatty acid synthesis in pigs with divergent muscle fatty acid composition

    PubMed Central

    2013-01-01

    Background In pigs, adipose tissue is one of the principal organs involved in the regulation of lipid metabolism. It is particularly involved in the overall fatty acid synthesis with consequences in other lipid-target organs such as muscles and the liver. With this in mind, we have used massive, parallel high-throughput sequencing technologies to characterize the porcine adipose tissue transcriptome architecture in six Iberian x Landrace crossbred pigs showing extreme phenotypes for intramuscular fatty acid composition (three per group). Results High-throughput RNA sequencing was used to generate a whole characterization of adipose tissue (backfat) transcriptome. A total of 4,130 putative unannotated protein-coding sequences were identified in the 20% of reads which mapped in intergenic regions. Furthermore, 36% of the unmapped reads were represented by interspersed repeats, SINEs being the most abundant elements. Differential expression analyses identified 396 candidate genes among divergent animals for intramuscular fatty acid composition. Sixty-two percent of these genes (247/396) presented higher expression in the group of pigs with higher content of intramuscular SFA and MUFA, while the remaining 149 showed higher expression in the group with higher content of PUFA. Pathway analysis related these genes to biological functions and canonical pathways controlling lipid and fatty acid metabolisms. In concordance with the phenotypic classification of animals, the major metabolic pathway differentially modulated between groups was de novo lipogenesis, the group with more PUFA being the one that showed lower expression of lipogenic genes. Conclusions These results will help in the identification of genetic variants at loci that affect fatty acid composition traits. The implications of these results range from the improvement of porcine meat quality traits to the application of the pig as an animal model of human metabolic diseases. PMID:24289474

  19. The Pedigree Rate of Sequence Divergence in the Human Mitochondrial Genome: There Is a Difference Between Phylogenetic and Pedigree Rates

    PubMed Central

    Howell, Neil; Smejkal, Christy Bogolin; Mackey, D. A.; Chinnery, P. F.; Turnbull, D. M.; Herrnstadt, Corinna

    2003-01-01

    We have extended our previous analysis of the pedigree rate of control-region divergence in the human mitochondrial genome. One new germline mutation in the mitochondrial DNA (mtDNA) control region was detected among 185 transmission events (generations) from five Leber hereditary optic neuropathy (LHON) pedigrees. Pooling the LHON pedigree analyses yields a control-region divergence rate of 1.0 mutation/bp/106 years (Myr). When the results from eight published studies that used a similar approach were pooled with the LHON pedigree studies, totaling >2,600 transmission events, a pedigree divergence rate of 0.95 mutations/bp/Myr for the control region was obtained with a 99.5% confidence interval of 0.53–1.57. Taken together, the cumulative results support the original conclusion that the pedigree divergence rate for the control region is ∼10-fold higher than that obtained with phylogenetic analyses. There is no evidence that any one factor explains this discrepancy, and the possible roles of mutational hotspots (rate heterogeneity), selection, and random genetic drift and the limitations of phylogenetic approaches to deal with high levels of homoplasy are discussed. In addition, we have extended our pedigree analysis of divergence in the mtDNA coding region. Finally, divergence of complete mtDNA sequences was analyzed in two tissues, white blood cells and skeletal muscle, from each of 17 individuals. In three of these individuals, there were four instances in which an mtDNA mutation was found in one tissue but not in the other. These results are discussed in terms of the occurrence of somatic mtDNA mutations. PMID:12571803

  20. Mitochondrial and nuclear DNA sequences reveal recent divergence in morphologically indistinguishable petrels.

    PubMed

    Welch, Andreanna J; Yoshida, Allison A; Fleischer, Robert C

    2011-04-01

    Often during the process of divergence, genetic markers will only gradually obtain the signal of isolation. Studies of recently diverged taxa utilizing both mitochondrial and nuclear data sets may therefore yield gene trees with differing levels of phylogenetic signal as a result of differences in coalescence times. However, several factors can lead to this same pattern, and it is important to distinguish between them to gain a better understanding of the process of divergence and the factors driving it. Here, we employ three nuclear intron loci in addition to the mitochondrial Cytochrome b gene to investigate the magnitude and timing of divergence between two endangered and nearly indistinguishable petrel taxa: the Galapagos (GAPE) and Hawaiian (HAPE) petrels (Pterodroma phaeopygia and P. sandwichensis). Phylogenetic analyses indicated reciprocal monophyly between these two taxa for the mitochondrial data set, but trees derived from the nuclear introns were unresolved. Coalescent analyses revealed effectively no migration between GAPE and HAPE over the last 100,000 generations and that they diverged relatively recently, approximately 550,000 years ago, coincident with a time of intense ecological change in both the Galapagos and Hawaiian archipelagoes. This indicates that recent divergence and incomplete lineage sorting are causing the difference in the strength of the phylogenetic signal of each data set, instead of insufficient variability or ongoing male-biased dispersal. Further coalescent analyses show that gene flow is low even between islands within each archipelago suggesting that divergence may be continuing at a local scale. Accurately identifying recently isolated taxa is becoming increasingly important as many clearly recognizable species are already threatened by extinction. PMID:21324012

  1. Mitochondrial and nuclear DNA sequences reveal recent divergence in morphologically indistinguishable petrels.

    PubMed

    Welch, Andreanna J; Yoshida, Allison A; Fleischer, Robert C

    2011-04-01

    Often during the process of divergence, genetic markers will only gradually obtain the signal of isolation. Studies of recently diverged taxa utilizing both mitochondrial and nuclear data sets may therefore yield gene trees with differing levels of phylogenetic signal as a result of differences in coalescence times. However, several factors can lead to this same pattern, and it is important to distinguish between them to gain a better understanding of the process of divergence and the factors driving it. Here, we employ three nuclear intron loci in addition to the mitochondrial Cytochrome b gene to investigate the magnitude and timing of divergence between two endangered and nearly indistinguishable petrel taxa: the Galapagos (GAPE) and Hawaiian (HAPE) petrels (Pterodroma phaeopygia and P. sandwichensis). Phylogenetic analyses indicated reciprocal monophyly between these two taxa for the mitochondrial data set, but trees derived from the nuclear introns were unresolved. Coalescent analyses revealed effectively no migration between GAPE and HAPE over the last 100,000 generations and that they diverged relatively recently, approximately 550,000 years ago, coincident with a time of intense ecological change in both the Galapagos and Hawaiian archipelagoes. This indicates that recent divergence and incomplete lineage sorting are causing the difference in the strength of the phylogenetic signal of each data set, instead of insufficient variability or ongoing male-biased dispersal. Further coalescent analyses show that gene flow is low even between islands within each archipelago suggesting that divergence may be continuing at a local scale. Accurately identifying recently isolated taxa is becoming increasingly important as many clearly recognizable species are already threatened by extinction.

  2. Divergent functional profiles of acidic and basic phospholipases A2 in the venom of the snake Porthidium lansbergii lansbergii.

    PubMed

    Jiménez-Charris, Eliécer; Montealegre-Sánchez, Leonel; Solano-Redondo, Luis; Castro-Herrera, Fernando; Fierro-Pérez, Leonardo; Lomonte, Bruno

    2016-09-01

    The Lansberg's hognose pitviper, Porthidium lansbergii lansbergii, inhabits northern Colombia. A recent proteomic characterization of its venom (J. Proteomics [2015] 114, 287-299) revealed the presence of phospholipases A2 (PLA2) accounting for 16.2% of its proteins. The two most abundant PLA2s were biochemically and functionally characterized. Pllans-I is a basic, dimeric enzyme with a monomer mass of 14,136 Da, while Pllans-II is an acidic, monomeric enzyme of 13,901 Da. Both have Asp49 in their partial amino acid sequences and, accordingly, are catalytically active upon natural or synthetic substrates. Nevertheless, these two enzymes differ markedly in their bioactivities. Pllans-I induces myonecrosis, edema, and is lethal by intracerebro-ventricular injection in mice, as well as cytolytic and anticoagulant in vitro. In contrast, Pllans-II is devoid of these effects, except for the induction of a moderate edema. In spite of lacking myotoxicity, Pllans-II enhances the muscle damaging action of Pllans-I in vivo. Altogether, results further illustrate the divergent functional profiles of basic and acidic PLA2s in viperid venoms, and suggest that Pllans-I plays a myotoxic role in envenomings by P. l. lansbergii, whereas Pllans-II, apparently devoid of toxicity, enhances muscle damage caused by Pllans-I. PMID:27381371

  3. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  4. Simple sequence repeat markers in genetic divergence and marker-assisted selection of rice cultivars: a review.

    PubMed

    Kaur, Shubhneet; Panesar, Parmjit S; Bera, Manab B; Kaur, Varinder

    2015-01-01

    Sequencing of rice genome has facilitated the understanding of rice evolution and has been utilized extensively for mining of DNA markers to facilitate marker-assisted breeding. Simple sequence repeat (SSR) markers that are tandemly repeated nucleotide sequence motifs flanked by unique sequences are presently the maker of choice in rice improvement due to their abundance, co-dominant inheritance, high levels of allelic diversity, and simple reproducible assay. The current level of genome coverage by SSR markers in rice is sufficient to employ them for genotype identification and marker-assisted selection in breeding for mapping of genes and quantitative trait loci analysis. This review provides comprehensive information on the mapping and applications of SSR markers in investigation of rice cultivars to study their genetic divergence and marker-assisted selection of important agronomic traits.

  5. Heterogeneity of amino acid sequence in hippopotamus cytochrome c.

    PubMed

    Thompson, R B; Borden, D; Tarr, G E; Margoliash, E

    1978-12-25

    The amino acid sequences of chymotryptic and tryptic peptides of Hippopotamus amphibius cytochrome c were determined by a recent modification of the manual Edman sequential degradation procedure. They were ordered by comparison with the structure of the hog protein. The hippopotamus protein differs in three positions: serine, alanine, and glutamine replace alanine, glutamic acid, and lysine in positions 43, 92, and 100, respectively. Since the artiodactyl suborders diverged in the mid-Eocene some 50 million years ago, the fact that representatives of some of them show no differences in their cytochromes c (cow, sheep, and hog), while another exhibits as many as three such differences, verifies that even in relatively closely related lines of descent the rate at which cytochrome c changes in the course of evolution is not constant. Furthermore, 10.6% of the hippopotamus cytochrome c preparation was shown to contain isoleucine instead of valine at position 3, indicating that one of the four animals from which the protein was obtained was heterozygous in the cytochrome c gene. Such heterogeneity is a necessary condition of evolutionary variation and has not been previously observed in the cytochrome c of a wild mammalian population.

  6. Studies on monotreme proteins. VII. Amino acid sequence of myoglobin from the platypus, Ornithoryhynchus anatinus.

    PubMed

    Fisher, W K; Thompson, E O

    1976-03-01

    Myoglobin isolated from skeletal muscle of the platypus contains 153 amino acid residues. The complete amino acid sequence has been determined following cleavage with cyanogen bromide and further digestion of the four fragments with trypsin, chymotrypsin, pepsin and thermolysin. Sequences of the purified peptides were determined by the dansyl-Edman procedure. The amino acid sequence showed 25 differences from human myoglobin and 24 from kangaroo myoglobin. Amino acid sequences in myoglobins are more conserved than sequences in the alpha- and beta-globin chains, and platypus myoglobin shows a similar number of variations in sequence to kangaroo myoglobin when compared with myoglobin of other species. The date of divergence of the platypus from other mammals was estimated at 102 +/- 31 million years, based on the number of amino acid differences between species and allowing for mutations during the evolutionary period. This estimate differs widely from the estimate given by similar treatment of the alpha- and beta-chain sequences and a constant rate of mutation of globin chains is not supported. PMID:962722

  7. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  8. An evolutionary constraint: strongly disfavored class of change in DNA sequence during divergence of cis-regulatory modules.

    PubMed

    Cameron, R Andrew; Chow, Suk Hen; Berney, Kevin; Chiu, Tsz-Yeung; Yuan, Qiu-Autumn; Krämer, Alexander; Helguero, Argelia; Ransick, Andrew; Yun, Mirong; Davidson, Eric H

    2005-08-16

    The DNA of functional cis-regulatory modules displays extensive sequence conservation in comparisons of genomes from modestly distant species. Patches of sequence that are several hundred base pairs in length within these modules are often seen to be 80-95% identical, although the flanking sequence cannot even be aligned. However, it is unlikely that base pairs located between the transcription factor target sites of cis-regulatory modules have sequence-dependent function, and the mechanism that constrains evolutionary change within cis-regulatory modules is incompletely understood. We chose five functionally characterized cis-regulatory modules from the Strongylocentrotus purpuratus (sea urchin) genome and obtained orthologous regulatory and flanking sequences from a bacterial artificial chromosome genome library of a congener, Strongylocentrotus franciscanus. As expected, single-nucleotide substitutions and small indels occur freely at many positions within the regulatory modules of these two species, as they do outside the regulatory modules. However, large indels (>20 bp) are statistically almost absent within the regulatory modules, although they are common in flanking intergenic or intronic sequence. The result helps to explain the patterns of evolutionary sequence divergence characteristic of cis-regulatory DNA.

  9. Characterization and phylogenetic analysis of α-gliadin gene sequences reveals significant genomic divergence in Triticeae species.

    PubMed

    Li, Guang-Rong; Lang, Tao; Yang, En-Nian; Liu, Cheng; Yang, Zu-Jun

    2014-12-01

    Although the unique properties of wheat α-gliadin gene family are well characterized, little is known about the evolution and genomic divergence of α-gliadin gene family within the Triticeae. We isolated a total of 203 α-gliadin gene sequences from 11 representative diploid and polyploid Triticeae species, and found 108 sequences putatively functional. Our results indicate that α-gliadin genes may have possibly originated from wild Secale species, where the sequences contain the shortest repetitive domains and display minimum variation. A miniature inverted-repeat transposable element insertion is reported for the first time in α-gliadin gene sequence of Thinopyrum intermedium in this study, indicating that the transposable element might have contributed to the diversification of α-gliadin genes family among Triticeae genomes. The phylogenetic analyses revealed that the α-gliadin gene sequences of Dasypyrum, Australopyrum, Lophopyrum, Eremopyrum and Pseudoroengeria species have amplified several times. A search for four typical toxic epitopes for celiac disease within the Triticeae α-gliadin gene sequences showed that the α-gliadins of wild Secale, Australopyrum and Agropyron genomes lack all four epitopes, while other Triticeae species have accumulated these epitopes, suggesting that the evolution of these toxic epitopes sequences occurred during the course of speciation, domestication or polyploidization of Triticeae. PMID:25572231

  10. Codon and Amino Acid Usage Are Shaped by Selection Across Divergent Model Organisms of the Pancrustacea.

    PubMed

    Whittle, Carrie A; Extavour, Cassandra G

    2015-11-01

    In protein-coding genes, synonymous codon usage and amino acid composition correlate to expression in some eukaryotes, and may result from translational selection. Here, we studied large-scale RNA-seq data from three divergent arthropod models, including cricket (Gryllus bimaculatus), milkweed bug (Oncopeltus fasciatus), and the amphipod crustacean Parhyale hawaiensis, and tested for optimization of codon and amino acid usage relative to expression level. We report strong signals of AT3 optimal codons (those favored in highly expressed genes) in G. bimaculatus and O. fasciatus, whereas weaker signs of GC3 optimal codons were found in P. hawaiensis, suggesting selection on codon usage in all three organisms. Further, in G. bimaculatus and O. fasciatus, high expression was associated with lowered frequency of amino acids with large size/complexity (S/C) scores in favor of those with intermediate S/C values; thus, selection may favor smaller amino acids while retaining those of moderate size for protein stability or conformation. In P. hawaiensis, highly transcribed genes had elevated frequency of amino acids with large and small S/C scores, suggesting a complex dynamic in this crustacean. In all species, the highly transcribed genes appeared to favor short proteins, high optimal codon usage, specific amino acids, and were preferentially involved in cell-cycling and protein synthesis. Together, based on examination of 1,680,067, 1,667,783, and 1,326,896 codon sites in G. bimaculatus, O. fasciatus, and P. hawaiensis, respectively, we conclude that translational selection shapes codon and amino acid usage in these three Pancrustacean arthropods. PMID:26384771

  11. Genetic Divergence between Freshwater and Marine Morphs of Alewife (Alosa pseudoharengus): A ‘Next-Generation’ Sequencing Analysis

    PubMed Central

    Czesny, Sergiusz; Epifanio, John; Michalak, Pawel

    2012-01-01

    Alewife Alosa pseudoharengus, a small clupeid fish native to Atlantic Ocean, has recently (∼150 years ago) invaded the North American Great Lakes and despite challenges of freshwater environment its populations exploded and disrupted local food web structures. This range expansion has been accompanied by dramatic changes at all levels of organization. Growth rates, size at maturation, or fecundity are only a few of the most distinct morphological and life history traits that contrast the two alewife morphs. A question arises to what extent these rapidly evolving differences between marine and freshwater varieties result from regulatory (including phenotypic plasticity) or structural mutations. To gain insights into expression changes and sequence divergence between marine and freshwater alewives, we sequenced transcriptomes of individuals from Lake Michigan and Atlantic Ocean. Population specific single nucleotide polymorphisms were rare but interestingly occurred in sequences of genes that also tended to show large differences in expression. Our results show that the striking phenotypic divergence between anadromous and lake alewives can be attributed to massive regulatory modifications rather than coding changes. PMID:22438868

  12. Genetic divergence between freshwater and marine morphs of alewife (Alosa pseudoharengus): a 'next-generation' sequencing analysis.

    PubMed

    Czesny, Sergiusz; Epifanio, John; Michalak, Pawel

    2012-01-01

    Alewife Alosa pseudoharengus, a small clupeid fish native to Atlantic Ocean, has recently (∼150 years ago) invaded the North American Great Lakes and despite challenges of freshwater environment its populations exploded and disrupted local food web structures. This range expansion has been accompanied by dramatic changes at all levels of organization. Growth rates, size at maturation, or fecundity are only a few of the most distinct morphological and life history traits that contrast the two alewife morphs. A question arises to what extent these rapidly evolving differences between marine and freshwater varieties result from regulatory (including phenotypic plasticity) or structural mutations. To gain insights into expression changes and sequence divergence between marine and freshwater alewives, we sequenced transcriptomes of individuals from Lake Michigan and Atlantic Ocean. Population specific single nucleotide polymorphisms were rare but interestingly occurred in sequences of genes that also tended to show large differences in expression. Our results show that the striking phenotypic divergence between anadromous and lake alewives can be attributed to massive regulatory modifications rather than coding changes.

  13. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  14. More reliable estimates of divergence times in Pan using complete mtDNA sequences and accounting for population structure

    PubMed Central

    Stone, Anne C.; Battistuzzi, Fabia U.; Kubatko, Laura S.; Perry, George H.; Trudeau, Evan; Lin, Hsiuman; Kumar, Sudhir

    2010-01-01

    Here, we report the sequencing and analysis of eight complete mitochondrial genomes of chimpanzees (Pan troglodytes) from each of the three established subspecies (P. t. troglodytes, P. t. schweinfurthii and P. t. verus) and the proposed fourth subspecies (P. t. ellioti). Our population genetic analyses are consistent with neutral patterns of evolution that have been shaped by demography. The high levels of mtDNA diversity in western chimpanzees are unlike those seen at nuclear loci, which may reflect a demographic history of greater female to male effective population sizes possibly owing to the characteristics of the founding population. By using relaxed-clock methods, we have inferred a timetree of chimpanzee species and subspecies. The absolute divergence times vary based on the methods and calibration used, but relative divergence times show extensive uniformity. Overall, mtDNA produces consistently older times than those known from nuclear markers, a discrepancy that is reduced significantly by explicitly accounting for chimpanzee population structures in time estimation. Assuming the human–chimpanzee split to be between 7 and 5 Ma, chimpanzee time estimates are 2.1–1.5, 1.1–0.76 and 0.25–0.18 Ma for the chimpanzee/bonobo, western/(eastern + central) and eastern/central chimpanzee divergences, respectively. PMID:20855302

  15. Conservation and Expression Patterns Divergence of Ascorbic Acid d-mannose/l-galactose Pathway Genes in Brassica rapa

    PubMed Central

    Duan, Weike; Ren, Jun; Li, Yan; Liu, Tongkun; Song, Xiaoming; Chen, Zhongwen; Huang, Zhinan; Hou, Xilin; Li, Ying

    2016-01-01

    Ascorbic acid (AsA) participates in diverse biological processes, is regulated by multiple factors and is a potent antioxidant and cellular reductant. The D-Mannose/L-Galactose pathway is a major plant AsA biosynthetic pathway that is highly connected within biosynthetic networks, and generally conserved across plants. Previous work has shown that, although most genes of this pathway are expressed under standard growth conditions in Brassica rapa, some paralogs of these genes are not. We hypothesize that regulatory evolution in duplicate AsA pathway genes has occurred as an adaptation to environmental stressors, and that gene retention has been influenced by polyploidation events in Brassicas. To test these hypotheses, we explored the conservation of these genes in Brassicas and their expression patterns divergence in B. rapa. Similar retention and a high degree of gene sequence similarity were identified in B. rapa (A genome), B. oleracea (C genome) and B. napus (AC genome). However, the number of genes that encode the same type of enzymes varied among the three plant species. With the exception of GMP, which has nine genes, there were one to four genes that encoded the other enzymes. Moreover, we found that expression patterns divergence widely exists among these genes. (i) VTC2 and VTC5 are paralogous genes, but only VTC5 is influenced by FLC. (ii) Under light treatment, PMI1 co-regulates the AsA pool size with other D-Man/L-Gal pathway genes, whereas PMI2 is regulated only by darkness. (iii) Under NaCl, Cu2+, MeJA and wounding stresses, most of the paralogs exhibit different expression patterns. Additionally, GME and GPP are the key regulatory enzymes that limit AsA biosynthesis in response to these treatments. In conclusion, our data support that the conservative and divergent expression patterns of D-Man/L-Gal pathway genes not only avoid AsA biosynthesis network instability but also allow B. rapa to better adapt to complex environments. PMID:27313597

  16. Conservation and Expression Patterns Divergence of Ascorbic Acid d-mannose/l-galactose Pathway Genes in Brassica rapa.

    PubMed

    Duan, Weike; Ren, Jun; Li, Yan; Liu, Tongkun; Song, Xiaoming; Chen, Zhongwen; Huang, Zhinan; Hou, Xilin; Li, Ying

    2016-01-01

    Ascorbic acid (AsA) participates in diverse biological processes, is regulated by multiple factors and is a potent antioxidant and cellular reductant. The D-Mannose/L-Galactose pathway is a major plant AsA biosynthetic pathway that is highly connected within biosynthetic networks, and generally conserved across plants. Previous work has shown that, although most genes of this pathway are expressed under standard growth conditions in Brassica rapa, some paralogs of these genes are not. We hypothesize that regulatory evolution in duplicate AsA pathway genes has occurred as an adaptation to environmental stressors, and that gene retention has been influenced by polyploidation events in Brassicas. To test these hypotheses, we explored the conservation of these genes in Brassicas and their expression patterns divergence in B. rapa. Similar retention and a high degree of gene sequence similarity were identified in B. rapa (A genome), B. oleracea (C genome) and B. napus (AC genome). However, the number of genes that encode the same type of enzymes varied among the three plant species. With the exception of GMP, which has nine genes, there were one to four genes that encoded the other enzymes. Moreover, we found that expression patterns divergence widely exists among these genes. (i) VTC2 and VTC5 are paralogous genes, but only VTC5 is influenced by FLC. (ii) Under light treatment, PMI1 co-regulates the AsA pool size with other D-Man/L-Gal pathway genes, whereas PMI2 is regulated only by darkness. (iii) Under NaCl, Cu(2+), MeJA and wounding stresses, most of the paralogs exhibit different expression patterns. Additionally, GME and GPP are the key regulatory enzymes that limit AsA biosynthesis in response to these treatments. In conclusion, our data support that the conservative and divergent expression patterns of D-Man/L-Gal pathway genes not only avoid AsA biosynthesis network instability but also allow B. rapa to better adapt to complex environments.

  17. Conservation and Expression Patterns Divergence of Ascorbic Acid d-mannose/l-galactose Pathway Genes in Brassica rapa.

    PubMed

    Duan, Weike; Ren, Jun; Li, Yan; Liu, Tongkun; Song, Xiaoming; Chen, Zhongwen; Huang, Zhinan; Hou, Xilin; Li, Ying

    2016-01-01

    Ascorbic acid (AsA) participates in diverse biological processes, is regulated by multiple factors and is a potent antioxidant and cellular reductant. The D-Mannose/L-Galactose pathway is a major plant AsA biosynthetic pathway that is highly connected within biosynthetic networks, and generally conserved across plants. Previous work has shown that, although most genes of this pathway are expressed under standard growth conditions in Brassica rapa, some paralogs of these genes are not. We hypothesize that regulatory evolution in duplicate AsA pathway genes has occurred as an adaptation to environmental stressors, and that gene retention has been influenced by polyploidation events in Brassicas. To test these hypotheses, we explored the conservation of these genes in Brassicas and their expression patterns divergence in B. rapa. Similar retention and a high degree of gene sequence similarity were identified in B. rapa (A genome), B. oleracea (C genome) and B. napus (AC genome). However, the number of genes that encode the same type of enzymes varied among the three plant species. With the exception of GMP, which has nine genes, there were one to four genes that encoded the other enzymes. Moreover, we found that expression patterns divergence widely exists among these genes. (i) VTC2 and VTC5 are paralogous genes, but only VTC5 is influenced by FLC. (ii) Under light treatment, PMI1 co-regulates the AsA pool size with other D-Man/L-Gal pathway genes, whereas PMI2 is regulated only by darkness. (iii) Under NaCl, Cu(2+), MeJA and wounding stresses, most of the paralogs exhibit different expression patterns. Additionally, GME and GPP are the key regulatory enzymes that limit AsA biosynthesis in response to these treatments. In conclusion, our data support that the conservative and divergent expression patterns of D-Man/L-Gal pathway genes not only avoid AsA biosynthesis network instability but also allow B. rapa to better adapt to complex environments. PMID:27313597

  18. Patterns of sequence divergence and evolution of the S orthologous regions between Asian and African cultivated rice species.

    PubMed

    Guyot, Romain; Garavito, Andrea; Gavory, Frédérick; Samain, Sylvie; Tohme, Joe; Ghesquière, Alain; Lorieux, Mathias

    2011-01-01

    A strong postzygotic reproductive barrier separates the recently diverged Asian and African cultivated rice species, Oryza sativa and O. glaberrima. Recently a model of genetic incompatibilities between three adjacent loci: S(1)A, S(1) and S(1)B (called together the S(1) regions) interacting epistatically, was postulated to cause the allelic elimination of female gametes in interspecific hybrids. Two candidate factors for the S(1) locus (including a putative F-box gene) were proposed, but candidates for S(1)A and S(1)B remained undetermined. Here, to better understand the basis of the evolution of regions involved in reproductive isolation, we studied the genic and structural changes accumulated in the S(1) regions between orthologous sequences. First, we established an 813 kb genomic sequence in O. glaberrima, covering completely the S(1)A, S(1) and the majority of the S(1)B regions, and compared it with the orthologous regions of O. sativa. An overall strong structural conservation was observed, with the exception of three isolated regions of disturbed collinearity: (1) a local invasion of transposable elements around a putative F-box gene within S(1), (2) the multiple duplication and subsequent divergence of the same F-box gene within S(1)A, (3) an interspecific chromosomal inversion in S(1)B, which restricts recombination in our O. sativa×O. glaberrima crosses. Beside these few structural variations, a uniform conservative pattern of coding sequence divergence was found all along the S(1) regions. Hence, the S(1) regions have undergone no drastic variation in their recent divergence and evolution between O. sativa and O. glaberrima, suggesting that a small accumulation of genic changes, following a Bateson-Dobzhansky-Muller (BDM) model, might be involved in the establishment of the sterility barrier. In this context, genetic incompatibilities involving the duplicated F-box genes as putative candidates, and a possible strengthening step involving the

  19. Patterns of Sequence Divergence and Evolution of the S1 Orthologous Regions between Asian and African Cultivated Rice Species

    PubMed Central

    Gavory, Frédérick; Samain, Sylvie; Tohme, Joe; Ghesquière, Alain; Lorieux, Mathias

    2011-01-01

    A strong postzygotic reproductive barrier separates the recently diverged Asian and African cultivated rice species, Oryza sativa and O. glaberrima. Recently a model of genetic incompatibilities between three adjacent loci: S1A, S1 and S1B (called together the S1 regions) interacting epistatically, was postulated to cause the allelic elimination of female gametes in interspecific hybrids. Two candidate factors for the S1 locus (including a putative F-box gene) were proposed, but candidates for S1A and S1B remained undetermined. Here, to better understand the basis of the evolution of regions involved in reproductive isolation, we studied the genic and structural changes accumulated in the S1 regions between orthologous sequences. First, we established an 813 kb genomic sequence in O. glaberrima, covering completely the S1A, S1 and the majority of the S1B regions, and compared it with the orthologous regions of O. sativa. An overall strong structural conservation was observed, with the exception of three isolated regions of disturbed collinearity: (1) a local invasion of transposable elements around a putative F-box gene within S1, (2) the multiple duplication and subsequent divergence of the same F-box gene within S1A, (3) an interspecific chromosomal inversion in S1B, which restricts recombination in our O. sativa×O. glaberrima crosses. Beside these few structural variations, a uniform conservative pattern of coding sequence divergence was found all along the S1 regions. Hence, the S1 regions have undergone no drastic variation in their recent divergence and evolution between O. sativa and O. glaberrima, suggesting that a small accumulation of genic changes, following a Bateson-Dobzhansky-Muller (BDM) model, might be involved in the establishment of the sterility barrier. In this context, genetic incompatibilities involving the duplicated F-box genes as putative candidates, and a possible strengthening step involving the chromosomal inversion might participate to

  20. The Complete Sequence of the Acacia ligulata Chloroplast Genome Reveals a Highly Divergent clpP1 Gene.

    PubMed

    Williams, Anna V; Boykin, Laura M; Howell, Katharine A; Nevill, Paul G; Small, Ian

    2015-01-01

    Legumes are a highly diverse angiosperm family that include many agriculturally important species. To date, 21 complete chloroplast genomes have been sequenced from legume crops confined to the Papilionoideae subfamily. Here we report the first chloroplast genome from the Mimosoideae, Acacia ligulata, and compare it to the previously sequenced legume genomes. The A. ligulata chloroplast genome is 174,233 bp in size, comprising inverted repeats of 38,225 bp and single-copy regions of 92,798 bp and 4,985 bp [corrected]. Acacia ligulata lacks the inversion present in many of the Papilionoideae, but is not otherwise significantly different in terms of gene and repeat content. The key feature is its highly divergent clpP1 gene, normally considered essential in chloroplast genomes. In A. ligulata, although transcribed and spliced, it probably encodes a catalytically inactive protein. This study provides a significant resource for further genetic research into Acacia and the Mimosoideae. The divergent clpP1 gene suggests that Acacia will provide an interesting source of information on the evolution and functional diversity of the chloroplast Clp protease complex.

  1. The Complete Sequence of the Acacia ligulata Chloroplast Genome Reveals a Highly Divergent clpP1 Gene

    PubMed Central

    Williams, Anna V.; Boykin, Laura M.; Howell, Katharine A.; Nevill, Paul G.; Small, Ian

    2015-01-01

    Legumes are a highly diverse angiosperm family that include many agriculturally important species. To date, 21 complete chloroplast genomes have been sequenced from legume crops confined to the Papilionoideae subfamily. Here we report the first chloroplast genome from the Mimosoideae, Acacia ligulata, and compare it to the previously sequenced legume genomes. The A. ligulata chloroplast genome is 158,724 bp in size, comprising inverted repeats of 25,925 bp and single-copy regions of 88,576 bp and 18,298 bp. Acacia ligulata lacks the inversion present in many of the Papilionoideae, but is not otherwise significantly different in terms of gene and repeat content. The key feature is its highly divergent clpP1 gene, normally considered essential in chloroplast genomes. In A. ligulata, although transcribed and spliced, it probably encodes a catalytically inactive protein. This study provides a significant resource for further genetic research into Acacia and the Mimosoideae. The divergent clpP1 gene suggests that Acacia will provide an interesting source of information on the evolution and functional diversity of the chloroplast Clp protease complex. PMID:25955637

  2. A Delicate Balance Between Repair and Replication Factors Regulates Recombination Between Divergent DNA Sequences in Saccharomyces cerevisiae.

    PubMed

    Chakraborty, Ujani; George, Carolyn M; Lyndaker, Amy M; Alani, Eric

    2016-02-01

    Single-strand annealing (SSA) is an important homologous recombination mechanism that repairs DNA double strand breaks (DSBs) occurring between closely spaced repeat sequences. During SSA, the DSB is acted upon by exonucleases to reveal complementary sequences that anneal and are then repaired through tail clipping, DNA synthesis, and ligation steps. In baker's yeast, the Msh DNA mismatch recognition complex and the Sgs1 helicase act to suppress SSA between divergent sequences by binding to mismatches present in heteroduplex DNA intermediates and triggering a DNA unwinding mechanism known as heteroduplex rejection. Using baker's yeast as a model, we have identified new factors and regulatory steps in heteroduplex rejection during SSA. First we showed that Top3-Rmi1, a topoisomerase complex that interacts with Sgs1, is required for heteroduplex rejection. Second, we found that the replication processivity clamp proliferating cell nuclear antigen (PCNA) is dispensable for heteroduplex rejection, but is important for repairing mismatches formed during SSA. Third, we showed that modest overexpression of Msh6 results in a significant increase in heteroduplex rejection; this increase is due to a compromise in Msh2-Msh3 function required for the clipping of 3' tails. Thus 3' tail clipping during SSA is a critical regulatory step in the repair vs. rejection decision; rejection is favored before the 3' tails are clipped. Unexpectedly, Msh6 overexpression, through interactions with PCNA, disrupted heteroduplex rejection between divergent sequences in another recombination substrate. These observations illustrate the delicate balance that exists between repair and replication factors to optimize genome stability. PMID:26680658

  3. Pharmacological manipulation of arachidonic acid-epoxygenase results in divergent effects on renal damage.

    PubMed

    Li, Jing; Stier, Charles T; Chander, Praveen N; Manthati, Vijay L; Falck, John R; Carroll, Mairéad A

    2014-01-01

    Kidney damage is markedly accelerated by high-salt (HS) intake in stroke-prone spontaneously hypertensive rats (SHRSP). Epoxyeicosatrienoic acids (EETs) are epoxygenase products of arachidonic acid which possess vasodepressor, natriuretic, and anti-inflammatory activities. We examined whether up-regulation (clofibrate) or inhibition [N-methylsulfonyl-6-(2-propargyloxyphenyl)hexanamide (MS-PPOH)] of epoxygenase would alter systolic blood pressure (SBP) and/or renal pathology in SHRSP on HS intake (1% NaCl drinking solution). Three weeks of treatment with clofibrate induced renal cortical protein expression of CYP2C23 and increased urinary excretion of EETs compared with vehicle-treated SHRSP. SBP and urinary protein excretion (UPE) were significantly lowered with clofibrate treatment. Kidneys from vehicle-treated SHRSP, which were on HS intake for 3 weeks, demonstrated focal lesions of vascular fibrinoid degeneration, which were markedly attenuated with clofibrate treatment. In contrast, 2 weeks of treatment with the selective epoxygenase inhibitor, MS-PPOH, increased UPE without significantly altering neither urinary EET levels nor SBP. Kidneys from vehicle-treated SHRSP, which were on HS intake for 11 days, demonstrated occasional mild damage whereas kidneys from MS-PPOH-treated rats exhibited widespread malignant nephrosclerosis. These results suggest that pharmacological manipulation of epoxygenase results in divergent effects on renal damage and that interventions to increase EET levels may provide therapeutic strategies for treating salt-sensitive hypertension and renal damage.

  4. Predicting intrinsic disorder from amino acid sequence.

    PubMed

    Obradovic, Zoran; Peng, Kang; Vucetic, Slobodan; Radivojac, Predrag; Brown, Celeste J; Dunker, A Keith

    2003-01-01

    Blind predictions of intrinsic order and disorder were made on 42 proteins subsequently revealed to contain 9,044 ordered residues, 284 disordered residues in 26 segments of length 30 residues or less, and 281 disordered residues in 2 disordered segments of length greater than 30 residues. The accuracies of the six predictors used in this experiment ranged from 77% to 91% for the ordered regions and from 56% to 78% for the disordered segments. The average of the order and disorder predictions ranged from 73% to 77%. The prediction of disorder in the shorter segments was poor, from 25% to 66% correct, while the prediction of disorder in the longer segments was better, from 75% to 95% correct. Four of the predictors were composed of ensembles of neural networks. This enabled them to deal more efficiently with the large asymmetry in the training data through diversified sampling from the significantly larger ordered set and achieve better accuracy on ordered and long disordered regions. The exclusive use of long disordered regions for predictor training likely contributed to the disparity of the predictions on long versus short disordered regions, while averaging the output values over 61-residue windows to eliminate short predictions of order or disorder probably contributed to the even greater disparity for three of the predictors. This experiment supports the predictability of intrinsic disorder from amino acid sequence. PMID:14579347

  5. Complete Genome Sequences of Three Historically Important, Spatiotemporally Distinct, and Genetically Divergent Strains of Zika Virus: MR-766, P6-740, and PRVABC-59.

    PubMed

    Yun, Sang-Im; Song, Byung-Hak; Frank, Jordan C; Julander, Justin G; Polejaeva, Irina A; Davies, Christopher J; White, Kenneth L; Lee, Young-Min

    2016-01-01

    Here, we report the 10,807-nucleotide-long consensus RNA genome sequences of three spatiotemporally distinct and genetically divergent Zika virus strains, with the functionality of their genomic sequences substantiated by reverse genetics: MR-766 (African lineage, Uganda, 1947), P6-740 (Asian lineage, Malaysia, 1966), and PRVABC-59 (Asian lineage-derived American strain, Puerto Rico, 2015). PMID:27540058

  6. Complete Genome Sequences of Three Historically Important, Spatiotemporally Distinct, and Genetically Divergent Strains of Zika Virus: MR-766, P6-740, and PRVABC-59

    PubMed Central

    Yun, Sang-Im; Song, Byung-Hak; Frank, Jordan C.; Julander, Justin G.; Polejaeva, Irina A.; Davies, Christopher J.; White, Kenneth L.

    2016-01-01

    Here, we report the 10,807-nucleotide-long consensus RNA genome sequences of three spatiotemporally distinct and genetically divergent Zika virus strains, with the functionality of their genomic sequences substantiated by reverse genetics: MR-766 (African lineage, Uganda, 1947), P6-740 (Asian lineage, Malaysia, 1966), and PRVABC-59 (Asian lineage-derived American strain, Puerto Rico, 2015). PMID:27540058

  7. Complete Genome Sequences of Three Historically Important, Spatiotemporally Distinct, and Genetically Divergent Strains of Zika Virus: MR-766, P6-740, and PRVABC-59.

    PubMed

    Yun, Sang-Im; Song, Byung-Hak; Frank, Jordan C; Julander, Justin G; Polejaeva, Irina A; Davies, Christopher J; White, Kenneth L; Lee, Young-Min

    2016-01-01

    Here, we report the 10,807-nucleotide-long consensus RNA genome sequences of three spatiotemporally distinct and genetically divergent Zika virus strains, with the functionality of their genomic sequences substantiated by reverse genetics: MR-766 (African lineage, Uganda, 1947), P6-740 (Asian lineage, Malaysia, 1966), and PRVABC-59 (Asian lineage-derived American strain, Puerto Rico, 2015).

  8. Sample sequencing of vascular plants demonstrates widespread conservation and divergence of microRNAs.

    PubMed

    Chávez Montes, Ricardo A; de Fátima Rosas-Cárdenas, Flor; De Paoli, Emanuele; Accerbi, Monica; Rymarquis, Linda A; Mahalingam, Gayathri; Marsch-Martínez, Nayelli; Meyers, Blake C; Green, Pamela J; de Folter, Stefan

    2014-04-23

    Small RNAs are pivotal regulators of gene expression that guide transcriptional and post-transcriptional silencing mechanisms in eukaryotes, including plants. Here we report a comprehensive atlas of sRNA and miRNA from 3 species of algae and 31 representative species across vascular plants, including non-model plants. We sequence and quantify sRNAs from 99 different tissues or treatments across species, resulting in a data set of over 132 million distinct sequences. Using miRBase mature sequences as a reference, we identify the miRNA sequences present in these libraries. We apply diverse profiling methods to examine critical sRNA and miRNA features, such as size distribution, tissue-specific regulation and sequence conservation between species, as well as to predict putative new miRNA sequences. We also develop database resources, computational analysis tools and a dedicated website, http://smallrna.udel.edu/. This study provides new insights on plant sRNAs and miRNAs, and a foundation for future studies.

  9. Sample sequencing of vascular plants demonstrates widespread conservation and divergence of microRNAs.

    PubMed

    Chávez Montes, Ricardo A; de Fátima Rosas-Cárdenas, Flor; De Paoli, Emanuele; Accerbi, Monica; Rymarquis, Linda A; Mahalingam, Gayathri; Marsch-Martínez, Nayelli; Meyers, Blake C; Green, Pamela J; de Folter, Stefan

    2014-01-01

    Small RNAs are pivotal regulators of gene expression that guide transcriptional and post-transcriptional silencing mechanisms in eukaryotes, including plants. Here we report a comprehensive atlas of sRNA and miRNA from 3 species of algae and 31 representative species across vascular plants, including non-model plants. We sequence and quantify sRNAs from 99 different tissues or treatments across species, resulting in a data set of over 132 million distinct sequences. Using miRBase mature sequences as a reference, we identify the miRNA sequences present in these libraries. We apply diverse profiling methods to examine critical sRNA and miRNA features, such as size distribution, tissue-specific regulation and sequence conservation between species, as well as to predict putative new miRNA sequences. We also develop database resources, computational analysis tools and a dedicated website, http://smallrna.udel.edu/. This study provides new insights on plant sRNAs and miRNAs, and a foundation for future studies. PMID:24759728

  10. AVP2, a sequence-divergent, K{sup +}-insensitive H{sup +}-translocating inorganic pyrophosphatase from arabidopsis

    SciTech Connect

    Drozdowicz, Y.M.; Kissinger, J.C.; Rea, P.A.

    2000-05-01

    Plant vacuolar H{sup +}-translocating inorganic pyrophosphatase have been considered to constitute a family of functionally and structurally monotonous intrinsic membrane proteins. Typified by AVPI from Arabidopsis, all characterized plant V-PPases share greater than 84% sequence identity and catalyze K{sup +}-stimulated H{sup +} translocation. Here the authors describe the molecular and biochemical characterization of AVP2, a sequence-divergent K{sup +}-insensitive, Ca{sup 2+}-hypersensitive V-PPase active in both inorganic pyrophosphate hydrolysis and H{sup +} translocation. The differences between AVP2 and AVP1 provide the first indication that plant V-PPase sequences from the same organism fall into two distinct categories. Phylogenetic analyses of these and other V-PPase sequences extend this principle by showing that AVP2, rather than being an isoform of AMP1, is but one representative of a novel category of AVP2-like (type 2) V-PPases that coexist with AVP1-like (type 1) V-PPases not only in plants, but also in apicomplexan protists such as the malarial parasite Plasmodium falciparum.

  11. Correlating traits of gene retention, sequence divergence, duplicability and essentiality in vertebrates, arthropods, and fungi.

    PubMed

    Waterhouse, Robert M; Zdobnov, Evgeny M; Kriventseva, Evgenia V

    2011-01-01

    Delineating ancestral gene relations among a large set of sequenced eukaryotic genomes allowed us to rigorously examine links between evolutionary and functional traits. We classified 86% of over 1.36 million protein-coding genes from 40 vertebrates, 23 arthropods, and 32 fungi into orthologous groups and linked over 90% of them to Gene Ontology or InterPro annotations. Quantifying properties of ortholog phyletic retention, copy-number variation, and sequence conservation, we examined correlations with gene essentiality and functional traits. More than half of vertebrate, arthropod, and fungal orthologs are universally present across each lineage. These universal orthologs are preferentially distributed in groups with almost all single-copy or all multicopy genes, and sequence evolution of the predominantly single-copy orthologous groups is markedly more constrained. Essential genes from representative model organisms, Mus musculus, Drosophila melanogaster, and Saccharomyces cerevisiae, are significantly enriched in universal orthologs within each lineage, and essential-gene-containing groups consistently exhibit greater sequence conservation than those without. This study of eukaryotic gene repertoire evolution identifies shared fundamental principles and highlights lineage-specific features, it also confirms that essential genes are highly retained and conclusively supports the "knockout-rate prediction" of stronger constraints on essential gene sequence evolution. However, the distinction between sequence conservation of single- versus multicopy orthologs is quantitatively more prominent than between orthologous groups with and without essential genes. The previously underappreciated difference in the tolerance of gene duplications and contrasting evolutionary modes of "single-copy control" versus "multicopy license" may reflect a major evolutionary mechanism that allows extended exploration of gene sequence space.

  12. An rRNA variable region has an evolutionarily conserved essential role despite sequence divergence.

    PubMed Central

    Sweeney, R; Chen, L; Yao, M C

    1994-01-01

    Regions extremely variable in size and sequence occur at conserved locations in eukaryotic rRNAs. The functional importance of one such region was determined by gene reconstruction and replacement in Tetrahymena thermophila. Deletion of the D8 region of the large-subunit rRNA inactivates T. thermophila rRNA genes (rDNA): transformants containing only this type of rDNA are unable to grow. Replacement with an unrelated sequence of similar size or a variable region from a different position in the rRNA also inactivated the rDNA. Mutant rRNAs resulting from such constructs were present only in precursor forms, suggesting that these rRNAs are deficient in either processing or stabilization of the mature form. Replacement with D8 regions from three other organisms restored function, even though the sequences are very different. Thus, these D8 regions share an essential functional feature that is not reflected in their primary sequences. Similar tertiary structures may be the quality these sequences share that allows them to function interchangeably. Images PMID:8196658

  13. Divergent spatial regulation of duplicated fatty acid-binding protein (fabp) genes in rainbow trout (Oncorhynchus mykiss).

    PubMed

    Bayır, Mehtap; Bayır, Abdulkadir; Wright, Jonathan M

    2015-06-01

    The increased use of plant oil as a dietary supplement with the resultant high dietary lipid loads challenges the lipid transport, metabolism and storage mechanisms in economically important aquaculture species, such as rainbow trout. Fatty acid-binding proteins (Fabp), ubiquitous in tissues highly active in fatty acid metabolism, participate in lipid uptake and transport, and overall lipid homeostasis. In the present study, searches of nucleotide sequence databases identified mRNA transcripts coded by 14 different fatty acid-binding protein (fabp) genes in rainbow trout (Oncorhynchus mykiss), which include the complete minimal suite of seven distinct fabp genes (fabp1, 2, 3, 6, 7, 10 and 11) discovered thus far in teleost fishes. Phylogenetic analyses suggest that many of these extant fabp genes in rainbow trout exist as duplicates, which putatively arose owing to the teleost-specific whole genome duplication (WGD); three pairs of duplicated fabp genes (fabp2a.1/fabp2a.2, fabp7b.1/fabp7b.2 and fabp10a.1/fabp10a.2) most likely were generated by the salmonid-specific WGD subsequent to the teleost-specific WGD; and fabp3 and fabp6 exist as single copy genes in the rainbow trout genome. Assay of the steady-state levels of fabp gene transcripts by RT-qPCR revealed: (1) steady-state transcript levels differ substantially between fabp genes and, in some instances, by as much as 30×10(4)-fold; (2) some fabp transcripts are widely distributed in many tissues, whereas others are restricted to one or a few tissues; and (3) divergence of regulatory mechanisms that control spatial transcription of duplicated fabp genes in rainbow trout appears related to length of time since their duplication. The suite of fabp genes described here provides the foundation to investigate the role(s) of fatty acid-binding proteins in the uptake, mobilization and storage of fatty acids in cultured fish fed diets differing in lipid content, especially the use of plant oil as a dietary supplement

  14. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  15. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  16. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  17. Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing.

    PubMed

    Yang, Hua; Wei, Chao-Ling; Liu, Hong-Wei; Wu, Jun-Lan; Li, Zheng-Guo; Zhang, Liang; Jian, Jian-Bo; Li, Ye-Yun; Tai, Yu-Ling; Zhang, Jing; Zhang, Zheng-Zhu; Jiang, Chang-Jun; Xia, Tao; Wan, Xiao-Chun

    2016-01-01

    Tea is one of the most popular beverages across the world and is made exclusively from cultivars of Camellia sinensis. Many wild relatives of the genus Camellia that are closely related to C. sinensis are native to Southwest China. In this study, we first identified the distinct genetic divergence between C. sinensis and its wild relatives and provided a glimpse into the artificial selection of tea plants at a genome-wide level by analyzing 15,444 genomic SNPs that were identified from 18 cultivated and wild tea accessions using a high-throughput genome-wide restriction site-associated DNA sequencing (RAD-Seq) approach. Six distinct clusters were detected by phylogeny inferrence and principal component and genetic structural analyses, and these clusters corresponded to six Camellia species/varieties. Genetic divergence apparently indicated that C. taliensis var. bangwei is a semi-wild or transient landrace occupying a phylogenetic position between those wild and cultivated tea plants. Cultivated accessions exhibited greater heterozygosity than wild accessions, with the exception of C. taliensis var. bangwei. Thirteen genes with non-synonymous SNPs exhibited strong selective signals that were suggestive of putative artificial selective footprints for tea plants during domestication. The genome-wide SNPs provide a fundamental data resource for assessing genetic relationships, characterizing complex traits, comparing heterozygosity and analyzing putatitve artificial selection in tea plants.

  18. Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing

    PubMed Central

    Liu, Hong-Wei; Wu, Jun-Lan; Li, Zheng-Guo; Zhang, Liang; Jian, Jian-Bo; Li, Ye-Yun; Tai, Yu-Ling; Zhang, Jing; Zhang, Zheng-Zhu; Jiang, Chang-Jun; Xia, Tao; Wan, Xiao-Chun

    2016-01-01

    Tea is one of the most popular beverages across the world and is made exclusively from cultivars of Camellia sinensis. Many wild relatives of the genus Camellia that are closely related to C. sinensis are native to Southwest China. In this study, we first identified the distinct genetic divergence between C. sinensis and its wild relatives and provided a glimpse into the artificial selection of tea plants at a genome-wide level by analyzing 15,444 genomic SNPs that were identified from 18 cultivated and wild tea accessions using a high-throughput genome-wide restriction site-associated DNA sequencing (RAD-Seq) approach. Six distinct clusters were detected by phylogeny inferrence and principal component and genetic structural analyses, and these clusters corresponded to six Camellia species/varieties. Genetic divergence apparently indicated that C. taliensis var. bangwei is a semi-wild or transient landrace occupying a phylogenetic position between those wild and cultivated tea plants. Cultivated accessions exhibited greater heterozygosity than wild accessions, with the exception of C. taliensis var. bangwei. Thirteen genes with non-synonymous SNPs exhibited strong selective signals that were suggestive of putative artificial selective footprints for tea plants during domestication. The genome-wide SNPs provide a fundamental data resource for assessing genetic relationships, characterizing complex traits, comparing heterozygosity and analyzing putatitve artificial selection in tea plants. PMID:26962860

  19. Multi-locus DNA sequencing of Toxoplasma gondii isolated from Brazilian pigs identifies genetically divergent strains

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Five Toxoplasma gondii isolates (TgPgBr1-5) were isolated from hearts and brains of pigs freshly purchased at the market of Campos dos Goytacazes, Northern Rio de Janeiro State, Brazil. Four of the five isolates were highly pathogenic in mice. Four genotypes were identified. Multi-locus DNA sequenci...

  20. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  1. An update of DIVERGE software for functional divergence analysis of protein family.

    PubMed

    Gu, Xun; Zou, Yangyun; Su, Zhixi; Huang, Wei; Zhou, Zhan; Arendsee, Zebulun; Zeng, Yanwu

    2013-07-01

    DIVERGE is a software system for phylogeny-based analyses of protein family evolution and functional divergence. It provides a suite of statistical tools for selection and prioritization of the amino acid sites that are responsible for the functional divergence of a gene family. The synergistic efforts of DIVERGE and other methods have convincingly demonstrated that the pattern of rate change at a particular amino acid site may contain insightful information about the underlying functional divergence following gene duplication. These predicted sites may be used as candidates for further experiments. We are now releasing an updated version of DIVERGE with the following improvements: 1) a feasible approach to examining functional divergence in nearly complete sequences by including deletions and insertions (indels); 2) the calculation of the false discovery rate of functionally diverging sites; 3) estimation of the effective number of functional divergence-related sites that is reliable and insensitive to cutoffs; 4) a statistical test for asymmetric functional divergence; and 5) a new method to infer functional divergence specific to a given duplicate cluster. In addition, we have made efforts to improve software design and produce a well-written software manual for the general user.

  2. Multiple nucleic acid cleavage modes in divergent type III CRISPR systems

    PubMed Central

    Zhang, Jing; Graham, Shirley; Tello, Agnes; Liu, Huanting; White, Malcolm F.

    2016-01-01

    CRISPR-Cas is an RNA-guided adaptive immune system that protects bacteria and archaea from invading nucleic acids. Type III systems (Cmr, Csm) have been shown to cleave RNA targets in vitro and some are capable of transcription-dependent DNA targeting. The crenarchaeon Sulfolobus solfataricus has two divergent subtypes of the type III system (Sso-IIID and a Cmr7-containing variant of Sso-IIIB). Here, we report that both the Sso-IIID and Sso-IIIB complexes cleave cognate RNA targets with a ruler mechanism and 6 or 12 nt spacing that relates to the organization of the Cas7 backbone. This backbone-mediated cleavage activity thus appears universal for the type III systems. The Sso-IIIB complex is also known to possess a distinct ‘UA’ cleavage mode. The predominant activity observed in vitro depends on the relative molar concentration of protein and target RNA. The Sso-IIID complex can cleave plasmid DNA targets in vitro, generating linear DNA products with an activity that is dependent on both the cyclase and HD nuclease domains of the Cas10 subunit, suggesting a role for both nuclease active sites in the degradation of double-stranded DNA targets. PMID:26801642

  3. Abscisic Acid Mediates a Divergence in the Drought Response of Two Conifers1[W][OA

    PubMed Central

    Brodribb, Timothy J.; McAdam, Scott A.M.

    2013-01-01

    During water stress, stomatal closure occurs as water tension and levels of abscisic acid (ABA) increase in the leaf, but the interaction between these two drivers of stomatal aperture is poorly understood. We investigate the dynamics of water potential, ABA, and stomatal conductance during the imposition of water stress on two drought-tolerant conifer species with contrasting stomatal behavior. Rapid rehydration of excised shoots was used as a means of differentiating the direct influences of ABA and water potential on stomatal closure. Pinus radiata (Pinaceae) was found to exhibit ABA-driven stomatal closure during water stress, resulting in strongly isohydric regulation of water loss. By contrast, stomatal closure in Callitris rhomboidea (Cupressaceae) was initiated by elevated foliar ABA, but sustained water stress saw a marked decline in ABA levels and a shift to water potential-driven stomatal closure. The transition from ABA to water potential as the primary driver of stomatal aperture allowed C. rhomboidea to rapidly recover gas exchange after water-stressed plants were rewatered, and was associated with a strongly anisohydric regulation of water loss. These two contrasting mechanisms of stomatal regulation function in combination with xylem vulnerability to produce highly divergent strategies of water management. Species-specific ABA dynamics are proposed as a central component of drought survival and ecology. PMID:23709665

  4. Shallow and deep earthquake sequences captured in the North Tanzanian Divergence, East Africa: Inferences on seismogenic processes and rheology

    NASA Astrophysics Data System (ADS)

    Albaric, J.; Perrot, J.; Déverchère, J.; Deschamps, A.; Ferdinand, R. W.; Le Gall, B.

    2009-12-01

    Using a temporary local seismic network of 35 stations deployed in North Tanzania (SEISMOTANZ'07 experiment) during 6 months in 2007, we captured two earthquake sequences (Gelai and Manyara) occurring respectively in the southern end of the Kenya rift and in the North Tanzanian Divergence (NTD). None of the sequences depicts typical swarm or mainshock-aftershock patterns. Although distant of only ~150 km, their triggering mechanisms appear to be different. They highlight a major change in the magmatic/tectonic nature of the rift where the eastern branch of the Est African Rift enters the Tanzanian craton. Both depict similar shape and long-axis, emphasizing the preferred locus of active strain release along NE-SW discontinuities which probably root at depth into steep Proterozoic shear zones. At Gelai, the deformation is dominated by aseismic processes involving slow slip on a normal fault and dyke intrusion within the upper crust, and an interaction with the eruption of the nearby Oldoinyo Lengai volcano. At Manyara, the sequence reveals a long-lasting seismic activity deeply rooted (~20-35 km depth), possibly indicative of stress loading transmitted laterally. Focal solutions demonstrate a mixture of normal and strike slip faulting on sub-vertical inherited structures striking N60°E. The yield stress envelope modelled from the depth frequency distribution of earthquakes in Manyara is consistent with the presence of a mafic lower crust and further supports the strength increase of the rifted crust from south Kenya to the NTD.

  5. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  6. Intramuscular fat and fatty acid composition of longissimus muscle from divergent pure breeds of cattle.

    PubMed

    Dinh, T T N; Blanton, J R; Riley, D G; Chase, C C; Coleman, S W; Phillips, W A; Brooks, J C; Miller, M F; Thompson, L D

    2010-02-01

    The objective of this study was to compare the fatty acid (FA) composition of intramuscular fat from the LM of 3 divergent breeds of cattle: Angus (AN, n = 9), Brahman (BR, n = 7), and Romosinuano (RM, n = 11). Cattle were blocked by breed and finished 129 d before slaughter in one year and 157 d in the next year. Longissimus muscle samples were collected from each carcass between the 10th and 13th ribs, trimmed of external fat, frozen in liquid nitrogen, homogenized, and used for fat extraction, using a modified Folch procedure. Extracted fat was analyzed for FA by using a GLC system with an HP-88 capillary column. Fatty acid composition was expressed using both a normalized percentage (%) and gravimetric calculation (mg/g of fresh muscle tissue) in relation to degree of saturation, which was determined using a saturation index (ratio of total SFA to total unsaturated FA). Crude fat determination revealed that LM from AN purebred cattle had the greatest amount of intramuscular fat (7.08%; P = 0.001). Although intramuscular fat of LM from RM contained a reduced percentage of total SFA (P = 0.002) compared with AN, it had the greatest percentage of total PUFA (P < 0.001 and P = 0.020). The percentages of total MUFA were similar among the 3 breeds (P = 0.675). The gravimetric calculation, a measure of actual FA concentration, showed significantly greater concentrations of SFA (26.67 mg/g), MUFA (26.50 mg/g), and PUFA (2.37 mg/g) in LM from AN cattle, as compared with LM from BR and RM cattle (P < 0.001). Interestingly, BR purebreds had the least PUFA concentration (1.49 mg/g; P acid composition was characterized by palmitic and oleic acids being the most

  7. Intramuscular fat and fatty acid composition of longissimus muscle from divergent pure breeds of cattle.

    PubMed

    Dinh, T T N; Blanton, J R; Riley, D G; Chase, C C; Coleman, S W; Phillips, W A; Brooks, J C; Miller, M F; Thompson, L D

    2010-02-01

    The objective of this study was to compare the fatty acid (FA) composition of intramuscular fat from the LM of 3 divergent breeds of cattle: Angus (AN, n = 9), Brahman (BR, n = 7), and Romosinuano (RM, n = 11). Cattle were blocked by breed and finished 129 d before slaughter in one year and 157 d in the next year. Longissimus muscle samples were collected from each carcass between the 10th and 13th ribs, trimmed of external fat, frozen in liquid nitrogen, homogenized, and used for fat extraction, using a modified Folch procedure. Extracted fat was analyzed for FA by using a GLC system with an HP-88 capillary column. Fatty acid composition was expressed using both a normalized percentage (%) and gravimetric calculation (mg/g of fresh muscle tissue) in relation to degree of saturation, which was determined using a saturation index (ratio of total SFA to total unsaturated FA). Crude fat determination revealed that LM from AN purebred cattle had the greatest amount of intramuscular fat (7.08%; P = 0.001). Although intramuscular fat of LM from RM contained a reduced percentage of total SFA (P = 0.002) compared with AN, it had the greatest percentage of total PUFA (P < 0.001 and P = 0.020). The percentages of total MUFA were similar among the 3 breeds (P = 0.675). The gravimetric calculation, a measure of actual FA concentration, showed significantly greater concentrations of SFA (26.67 mg/g), MUFA (26.50 mg/g), and PUFA (2.37 mg/g) in LM from AN cattle, as compared with LM from BR and RM cattle (P < 0.001). Interestingly, BR purebreds had the least PUFA concentration (1.49 mg/g; P acid composition was characterized by palmitic and oleic acids being the most

  8. Divergence of human [alpha]-chain constant region gene sequences: A novel recombinant [alpha]2 gene

    SciTech Connect

    Chintalacharuvu, K. R.; Morrison, S.L. ); Raines, M. )

    1994-06-01

    IgA is the major Ig synthesized in humans and provides the first line of defense at the mucosal surfaces. The constant region of IgA heavy chain is encoded by the [alpha] gene on chromosome 14. Previous studies have indicated the presence of two [alpha] genes, [alpha]1 and [alpha]2 existing in two allotypic forms, [alpha]2 m(1) and [alpha]2 m(2). Here the authors report the cloning and complete nucleotide sequence determination of a novel human [alpha] gene. Nucleotide sequence comparison with the published [alpha] sequences suggests that the gene arose as a consequence of recombination or gene conversion between the two [alpha]2 alleles. The authors have expressed the gene as a chimeric protein in myeloma cells indicating that it encodes a functional protein. The novel IgA resembles IgA2 m(2) in that disulfide bonds link H and L chains. This novel recombinant gene provides insights into the mechanisms of generation of different constant regions and suggests that within human populations, multiple alleles of [alpha] may be present providing IgAs of different structures.

  9. Mitochondrial DNA sequence divergence in the Melanogaster and oriental species subgroups of Drosophila.

    PubMed

    Nigro, L; Solignac, M; Sharp, P M

    1991-08-01

    The nucleotide sequence of a segment of the mitochondrial DNA from three Drosophila species (D. erecta, D. eugracilis, and D. takahashii), belonging to different subgroups of the melanogaster group has been determined. The segment encompasses three complete tRNA genes (tRNAtrp, tRNAcys, and tRNAtyr) and portions of two protein-coding genes: the subunit 2 of the NADH dehydrogenase (ND2) and the subunit 1 of the cytochrome oxidase (COI). Comparisons also involve homologous sequences already known for four other Drosophila species of the melanogaster group. Length differences were confined in the intergenic region where a long stretch of AT repeats was observed in one of the species analyzed. The three tRNA genes exhibit very different evolutionary rates, the most slowly evolving one, tRNAtyr, is adjacent to the 5' end of COI; tRNAs in similar positions have been previously shown to evolve slowly because they are probably involved in transcript processing. Although the rate of synonymous substitutions was very similar between ND2 and COI genes there were strong discrepancies between them in terms of the number of nonsynonymous substitutions. Differences have also been found in G + C content of the genes, which are likely to be linked to different selective pressures. There is a reduction in G + C content in the region where selective constraints are reduced. This suggests the existence of different levels of constraints along the sequenced segment. An overall analysis of the types of substitutions showed a decrease in A + T content during the course of evolution of the species.

  10. Contrasted seismogenic and rheological behaviours from shallow and deep earthquake sequences in the North Tanzanian Divergence, East Africa

    NASA Astrophysics Data System (ADS)

    Albaric, J.; Perrot, J.; Déverchère, J.; Deschamps, A.; Le Gall, B.; Ferdinand, R. W.; Petit, C.; Tiberi, C.; Sue, C.; Songo, M.

    2010-12-01

    We report preliminary results of a seismological experiment, SEISMO-TANZ' 07, which consisted in the deployment of a local network (35 stations) in the East African Rift System (EARS), North Tanzania, during 6 months in 2007. We compare two earthquake sequences (Gelai and Manyara) occurring, respectively, in the southern end of the Kenya rift and in the North Tanzanian Divergence (NTD). Only distant of ˜150 km, their triggering mechanisms are different. None of the sequences depicts typical swarm or mainshock-aftershock patterns. They highlight the change in the magmatic/tectonic nature of the rift where the eastern branch of the EARS enters the Tanzanian craton. The similar shape and long-axis of the elongate sequences emphasize the preferred locus of active strain release along NE-SW discontinuities which probably root at depth into steep Proterozoic shear zones. At Gelai, the deformation is dominated by aseismic process involving slow slip on normal fault and dyke intrusion within the upper crust (Calais et al., 2008). The spatial and temporal earthquake distribution indicates a possible correlation between the Gelai crisis and the eruption of the nearby Oldoinyo Lengai volcano. At Manyara, the sequence is more uncommon, revealing a long-lasting seismic activity deeply rooted (˜20-35 km depth) possibly related to stress loading transmitted laterally. The yield strength envelope modelled from the depth frequency distribution of earthquakes in the NTD is consistent with the presence of a mafic lower crust and further supports the strength increase of the rifted crust from south Kenya to the NTD.

  11. Meiotic recombination at the Lmp2 hotspot tolerates minor sequence divergence between homologous chromosomes

    SciTech Connect

    Yoshino, Masayasu; Sagai, Tomoko; Shiroishi, Toshihiko

    1996-06-01

    Recombination is widely considered to linearly depend on the length of the homologous sequences. An 11% mismatch decreases the rate of phage-plasmid recombination 240-fold. Two single nucleotide mismatches, which reduce the longest uninterrupted stretch of similarity from 232 base pairs (bp) to 134 bp, reduce gene conversion in mouse L cells 20-fold. The efficiency of gene targeting through homologous recombination in mouse embryonic stem cells can be increased by using an isogenic, rather than a non-isogenic, DNA construct. In this study we asked whether a high degree of sequence identity between homologous mouse chromosomes enhances meiotic recombination at a hotspot. Sites of meiotic recombination in the mouse major histocompatibility complex (MHC) class II region are not randomly distributed but are almost all clustered within short segments known as recombinational hotspots. The wm7 MHC haplotype, derived from Japanese wild mice Mus musculus molossinus, enhances meiotic recombination at a hotspot near the Lmp2 gene. Heterozygotes between the wm7 haplotype and the b or k haplotypes have yielded a high frequency of recombination (2.1%) in 1.3 kilobase kb segment of this hotspot. 20 refs., 2 figs.

  12. Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63.

    PubMed

    Zhang, Jianwei; Chen, Ling-Ling; Xing, Feng; Kudrna, David A; Yao, Wen; Copetti, Dario; Mu, Ting; Li, Weiming; Song, Jia-Ming; Xie, Weibo; Lee, Seunghee; Talag, Jayson; Shao, Lin; An, Yue; Zhang, Chun-Liu; Ouyang, Yidan; Sun, Shuai; Jiao, Wen-Biao; Lv, Fang; Du, Bogu; Luo, Meizhong; Maldonado, Carlos Ernesto; Goicoechea, Jose Luis; Xiong, Lizhong; Wu, Changyin; Xing, Yongzhong; Zhou, Dao-Xiu; Yu, Sibin; Zhao, Yu; Wang, Gongwei; Yu, Yeisoo; Luo, Yijie; Zhou, Zhi-Wei; Hurtado, Beatriz Elena Padilla; Danowitz, Ann; Wing, Rod A; Zhang, Qifa

    2016-08-30

    Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059-2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement. PMID:27535938

  13. Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63.

    PubMed

    Zhang, Jianwei; Chen, Ling-Ling; Xing, Feng; Kudrna, David A; Yao, Wen; Copetti, Dario; Mu, Ting; Li, Weiming; Song, Jia-Ming; Xie, Weibo; Lee, Seunghee; Talag, Jayson; Shao, Lin; An, Yue; Zhang, Chun-Liu; Ouyang, Yidan; Sun, Shuai; Jiao, Wen-Biao; Lv, Fang; Du, Bogu; Luo, Meizhong; Maldonado, Carlos Ernesto; Goicoechea, Jose Luis; Xiong, Lizhong; Wu, Changyin; Xing, Yongzhong; Zhou, Dao-Xiu; Yu, Sibin; Zhao, Yu; Wang, Gongwei; Yu, Yeisoo; Luo, Yijie; Zhou, Zhi-Wei; Hurtado, Beatriz Elena Padilla; Danowitz, Ann; Wing, Rod A; Zhang, Qifa

    2016-08-30

    Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059-2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement.

  14. Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63

    PubMed Central

    Zhang, Jianwei; Chen, Ling-Ling; Xing, Feng; Kudrna, David A.; Yao, Wen; Copetti, Dario; Mu, Ting; Li, Weiming; Song, Jia-Ming; Lee, Seunghee; Talag, Jayson; Shao, Lin; An, Yue; Zhang, Chun-Liu; Ouyang, Yidan; Sun, Shuai; Jiao, Wen-Biao; Lv, Fang; Du, Bogu; Luo, Meizhong; Maldonado, Carlos Ernesto; Goicoechea, Jose Luis; Xiong, Lizhong; Wu, Changyin; Xing, Yongzhong; Zhou, Dao-Xiu; Yu, Sibin; Zhao, Yu; Wang, Gongwei; Yu, Yeisoo; Luo, Yijie; Zhou, Zhi-Wei; Hurtado, Beatriz Elena Padilla; Danowitz, Ann; Wing, Rod A.; Zhang, Qifa

    2016-01-01

    Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica. Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059–2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement. PMID:27535938

  15. Purification, sequencing and evaluation of a divergent phytase from Penicillium oxalicum KCTC6440.

    PubMed

    Kim, Bong-Hyun; Lee, Ji Yeon; Lee, Peter C W

    2015-01-01

    A fungal strain producing high levels of phytase was purified to homogeneity from Penicillium oxalicum KCTC6440 (PhyA). The molecular mass of the purified PhyA was 65 kDa and optimal activity occurred at 55°C. The enzyme was stable in a pH range of 4.5-6.5, with an optimum performance at pH 5.5. The Km value for the substrate sodium phytate was 0.48 mM with a Vmax of 672 U/mg. The enzyme was inhibited by Ca(2+), Cu(2+), and Zn(2+), and slightly enhanced by EDTA. The PhyA efficiently released phosphate from feedstuffs such as soybean, rich bran and corn meal. The PhyA gene was cloned in two steps of degenerate PCR and inverse PCR and found to comprise 1501 bp and encode 461 amino acid residues. The enzyme was found to have only 13 amino acids differing to the known PhyA from other Penicillium sp., but has distinct enzyme characteristics. Computational analysis showed that PhyA possessed more positively charged residues in the active sites compared to other PhyA molecules, which may explain the broader pH spectrum. PMID:26377131

  16. From Artificial Amino Acids to Sequence-Defined Targeted Oligoaminoamides.

    PubMed

    Morys, Stephan; Wagner, Ernst; Lächelt, Ulrich

    2016-01-01

    Artificial oligoamino acids with appropriate protecting groups can be used for the sequential assembly of oligoaminoamides on solid-phase. With the help of these oligoamino acids multifunctional nucleic acid (NA) carriers can be designed and produced in highly defined topologies. Here we describe the synthesis of the artificial oligoamino acid Fmoc-Stp(Boc3)-OH, the subsequent assembly into sequence-defined oligomers and the formulation of tumor-targeted plasmid DNA (pDNA) polyplexes. PMID:27436323

  17. Population histories of right whales (Cetacea: Eubalaena) inferred from mitochondrial sequence diversities and divergences of their whale lice (Amphipoda: Cyamus).

    PubMed

    Kaliszewska, Zofia A; Seger, Jon; Rowntree, Victoria J; Barco, Susan G; Benegas, Rafael; Best, Peter B; Brown, Moira W; Brownell, Robert L; Carribero, Alejandro; Harcourt, Robert; Knowlton, Amy R; Marshall-Tilas, Kim; Patenaude, Nathalie J; Rivarola, Mariana; Schaeff, Catherine M; Sironi, Mariano; Smith, Wendy A; Yamada, Tadasu K

    2005-10-01

    Right whales carry large populations of three 'whale lice' (Cyamus ovalis, Cyamus gracilis, Cyamus erraticus) that have no other hosts. We used sequence variation in the mitochondrial COI gene to ask (i) whether cyamid population structures might reveal associations among right whale individuals and subpopulations, (ii) whether the divergences of the three nominally conspecific cyamid species on North Atlantic, North Pacific, and southern right whales (Eubalaena glacialis, Eubalaena japonica, Eubalaena australis) might indicate their times of separation, and (iii) whether the shapes of cyamid gene trees might contain information about changes in the population sizes of right whales. We found high levels of nucleotide diversity but almost no population structure within oceans, indicating large effective population sizes and high rates of transfer between whales and subpopulations. North Atlantic and Southern Ocean populations of all three species are reciprocally monophyletic, and North Pacific C. erraticus is well separated from North Atlantic and southern C. erraticus. Mitochondrial clock calibrations suggest that these divergences occurred around 6 million years ago (Ma), and that the Eubalaena mitochondrial clock is very slow. North Pacific C. ovalis forms a clade inside the southern C. ovalis gene tree, implying that at least one right whale has crossed the equator in the Pacific Ocean within the last 1-2 million years (Myr). Low-frequency polymorphisms are more common than expected under neutrality for populations of constant size, but there is no obvious signal of rapid, interspecifically congruent expansion of the kind that would be expected if North Atlantic or southern right whales had experienced a prolonged population bottleneck within the last 0.5 Myr.

  18. RNA Deep Sequencing Reveals Novel Candidate Genes and Polymorphisms in Boar Testis and Liver Tissues with Divergent Androstenone Levels

    PubMed Central

    Gunawan, Asep; Sahadevan, Sudeep; Neuhoff, Christiane; Große-Brinkhaus, Christine; Gad, Ahmed; Frieden, Luc; Tesfaye, Dawit; Tholen, Ernst; Looft, Christian; Uddin, Muhammad Jasim; Schellander, Karl; Cinar, Mehmet Ulas

    2013-01-01

    Boar taint is an unpleasant smell and taste of pork meat derived from some entire male pigs. The main causes of boar taint are the two compounds androstenone (5α-androst-16-en-3-one) and skatole (3-methylindole). It is crucial to understand the genetic mechanism of boar taint to select pigs for lower androstenone levels and thus reduce boar taint. The aim of the present study was to investigate transcriptome differences in boar testis and liver tissues with divergent androstenone levels using RNA deep sequencing (RNA-Seq). The total number of reads produced for each testis and liver sample ranged from 13,221,550 to 33,206,723 and 12,755,487 to 46,050,468, respectively. In testis samples 46 genes were differentially regulated whereas 25 genes showed differential expression in the liver. The fold change values ranged from −4.68 to 2.90 in testis samples and −2.86 to 3.89 in liver samples. Differentially regulated genes in high androstenone testis and liver samples were enriched in metabolic processes such as lipid metabolism, small molecule biochemistry and molecular transport. This study provides evidence for transcriptome profile and gene polymorphisms of boars with divergent androstenone level using RNA-Seq technology. Digital gene expression analysis identified candidate genes in flavin monooxygenease family, cytochrome P450 family and hydroxysteroid dehydrogenase family. Moreover, polymorphism and association analysis revealed mutation in IRG6, MX1, IFIT2, CYP7A1, FMO5 and KRT18 genes could be potential candidate markers for androstenone levels in boars. Further studies are required for proving the role of candidate genes to be used in genomic selection against boar taint in pig breeding programs. PMID:23696805

  19. Decrease of Population Divergence in Eurasian Perch (Perca fluviatilis) in Browning Waters: Role of Fatty Acids and Foraging Efficiency

    PubMed Central

    Scharnweber, Kristin; Strandberg, Ursula; Karlsson, Konrad; Eklöv, Peter

    2016-01-01

    Due to altered biogeochemical processes related to climate change, highly colored dissolved organic carbon (DOC) from terrestrial sources will lead to a water “brownification” in many freshwater systems of the Northern Hemisphere. This will create deteriorated visual conditions that have been found to affect habitat-specific morphological variations in Eurasian perch (Perca fluviatilis) in a previous study. So far, potential drivers and ultimate causes of these findings have not been identified. We conducted a field study to investigate the connection between morphological divergence and polyunsaturated fatty acid (PUFA) composition of perch from six lakes across a gradient of DOC concentration. We expected a decrease in the prevalence of PUFAs, which are important for perch growth and divergence with increasing DOC concentrations, due to the restructuring effects of DOC on aquatic food webs. In general, rate of morphological divergence in perch decreased with increasing DOC concentrations. Proportions of specific PUFAs (22:6n-3, 18:3n-3, 20:5n-3, and 20:4n-6) identified to primarily contribute to overall differences between perch caught in clear and brown-water lakes tended to be connected to overall decline of morphological divergence. However, no overall significant relationship was found, indicating no severe limitation of essential fatty acids for perch inhabiting brown water lakes. We further broaden our approach by conducting a laboratory experiment on foraging efficiency of perch. Therefore, we induced pelagic and littoral phenotypes by differences in habitat-structure and feeding mode and recorded attack rate in a feeding experiment. Generally, fish were less efficient in foraging on littoral prey (Ephemeroptera) when visual conditions were degraded by brown water color. We concluded that browning water may have a strong effect on the forager’s ability to find particular food resources, resulting in the reduced development of evolutionary traits

  20. Decrease of Population Divergence in Eurasian Perch (Perca fluviatilis) in Browning Waters: Role of Fatty Acids and Foraging Efficiency.

    PubMed

    Scharnweber, Kristin; Strandberg, Ursula; Karlsson, Konrad; Eklöv, Peter

    2016-01-01

    Due to altered biogeochemical processes related to climate change, highly colored dissolved organic carbon (DOC) from terrestrial sources will lead to a water "brownification" in many freshwater systems of the Northern Hemisphere. This will create deteriorated visual conditions that have been found to affect habitat-specific morphological variations in Eurasian perch (Perca fluviatilis) in a previous study. So far, potential drivers and ultimate causes of these findings have not been identified. We conducted a field study to investigate the connection between morphological divergence and polyunsaturated fatty acid (PUFA) composition of perch from six lakes across a gradient of DOC concentration. We expected a decrease in the prevalence of PUFAs, which are important for perch growth and divergence with increasing DOC concentrations, due to the restructuring effects of DOC on aquatic food webs. In general, rate of morphological divergence in perch decreased with increasing DOC concentrations. Proportions of specific PUFAs (22:6n-3, 18:3n-3, 20:5n-3, and 20:4n-6) identified to primarily contribute to overall differences between perch caught in clear and brown-water lakes tended to be connected to overall decline of morphological divergence. However, no overall significant relationship was found, indicating no severe limitation of essential fatty acids for perch inhabiting brown water lakes. We further broaden our approach by conducting a laboratory experiment on foraging efficiency of perch. Therefore, we induced pelagic and littoral phenotypes by differences in habitat-structure and feeding mode and recorded attack rate in a feeding experiment. Generally, fish were less efficient in foraging on littoral prey (Ephemeroptera) when visual conditions were degraded by brown water color. We concluded that browning water may have a strong effect on the forager's ability to find particular food resources, resulting in the reduced development of evolutionary traits, such as

  1. Decrease of Population Divergence in Eurasian Perch (Perca fluviatilis) in Browning Waters: Role of Fatty Acids and Foraging Efficiency.

    PubMed

    Scharnweber, Kristin; Strandberg, Ursula; Karlsson, Konrad; Eklöv, Peter

    2016-01-01

    Due to altered biogeochemical processes related to climate change, highly colored dissolved organic carbon (DOC) from terrestrial sources will lead to a water "brownification" in many freshwater systems of the Northern Hemisphere. This will create deteriorated visual conditions that have been found to affect habitat-specific morphological variations in Eurasian perch (Perca fluviatilis) in a previous study. So far, potential drivers and ultimate causes of these findings have not been identified. We conducted a field study to investigate the connection between morphological divergence and polyunsaturated fatty acid (PUFA) composition of perch from six lakes across a gradient of DOC concentration. We expected a decrease in the prevalence of PUFAs, which are important for perch growth and divergence with increasing DOC concentrations, due to the restructuring effects of DOC on aquatic food webs. In general, rate of morphological divergence in perch decreased with increasing DOC concentrations. Proportions of specific PUFAs (22:6n-3, 18:3n-3, 20:5n-3, and 20:4n-6) identified to primarily contribute to overall differences between perch caught in clear and brown-water lakes tended to be connected to overall decline of morphological divergence. However, no overall significant relationship was found, indicating no severe limitation of essential fatty acids for perch inhabiting brown water lakes. We further broaden our approach by conducting a laboratory experiment on foraging efficiency of perch. Therefore, we induced pelagic and littoral phenotypes by differences in habitat-structure and feeding mode and recorded attack rate in a feeding experiment. Generally, fish were less efficient in foraging on littoral prey (Ephemeroptera) when visual conditions were degraded by brown water color. We concluded that browning water may have a strong effect on the forager's ability to find particular food resources, resulting in the reduced development of evolutionary traits, such as

  2. Sequence divergence of microsatellites for phylogeographic assessment of Moroccan Medicago species.

    PubMed

    Zitouna, N; Marghali, S; Gharbi, M; Haddioui, A; Trifi-Farah, N

    2014-01-01

    Six Medicago species were investigated to characterize and valorize plant genetic resources of pastoral interest in Morocco. Samples were obtained from the core collection of the South Australian Research and Development Institute (SARDI). The transferability of single sequence repeat markers of Medicago truncatula was successful with 97.6% efficiency across the five species. A total of 283 alleles and 243 genotypes were generated using seven SSR markers, confirming the high level of polymorphism that is characteristic of the Medicago genus, despite a heterozygosity deficit (HO = 0.378; HE = 0.705). In addition, a high level of gene flow was revealed among the species analyzed with significant intra-specific variation. The unweighted pair group method with arithmetic mean dendrogram generated by the dissimilarity matrix revealed that M. polymorpha and M. orbicularis are closely related, and that M. truncatula is likely the ancestral species. The Pearson correlation index revealed no significant correlations between the geographic distribution of the Moroccan species and genetic similarities, indicating local adaptation of these species to different ecological environments independent of their topographical proximities. The substantial genetic variation observed was likely due to the predominance of selfing species, the relative proximity of prospected sites, human impacts, and the nature of the SARDI core collections, which are selected for their high genetic diversity. The results of this first report on Moroccan Medicago species will be of great interest for establishing strategies aiming at reasonable management and selection programs for local and Mediterranean germplasm in the face of increasing environmental change.

  3. Amino acid sequences of lower vertebrate parvalbumins and their evolution: parvalbumins of boa, turtle, and salamander.

    PubMed

    Maeda, N; Zhu, D X; Fitch, W M

    1984-11-01

    One major parvalbumin each was isolated from the skeletal muscle of two reptiles, a boa snake, Boa constrictor, and a map turtle, Graptemys geographica, while two parvalbumins were isolated from an amphibian, the salamander Amphiuma means. The amino acid sequences of all four parvalbumins were determined from the sequences of their tryptic peptides, which were ordered partially by homology to other parvalbumins. Phylogenetic study of these and 16 other parvalbumin sequences revealed that the turtle parvalbumin belongs to beta lineage, while the salamander sequences belong, one each, to the alpha and beta lineages defined by Goodman and Pechère (1977). Boa parvalbumin, however, while belonging to the beta lineage, clusters within the fish in all reasonably parsimonious trees. The most parsimonious trees show many parallel or back mutations in the evolution of many parvalbumin residues, although the residues responsible for Ca2+ binding are very well conserved. These most parsimonious trees show an actinopterygian rather than a crossoptyrigian origin of the tetrapods in both the alpha and beta groups. One of two electric eel parvalbumins is evolving more than 10 times faster than its paralogous partner, suggesting it may be on its way to becoming a pseudogene. It is concluded that varying rates of amino acid replacement, much homoplasy, considerable gene duplication, plus complicated lineages make the set of parvalbumin sequences unsuitable for systematic study of the origin of the tetrapods and other higher-taxa divergence, although it may be suitable within a genus or family.

  4. Segments of amino acid sequence similarity in beta-amylases.

    PubMed

    Friedberg, F; Rhodes, C

    1988-01-01

    In alpha-amylases from animals, plants and bacteria and in beta-amylases from plants and bacteria a number of segments exhibit amino acid sequence similarity specific to the alpha or to the beta type, respectively. In the case of the beta-amylases the similar sequence regions are extensive and they are disrupted only by short interspersed dissimilar regions. Close to the C terminus, however, no such sequence similarity exist. PMID:2464171

  5. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon.

    PubMed Central

    Yu, J H; Eng, J; Yalow, R S

    1990-01-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled pork insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report we describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. We demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in our immunoassay system is only a few percent of that of human insulin. Squirrel monkey glucagon is identical with the usual glucagon found in Old World mammals, which predicts that the glucagons of other New World monkeys would not differ from the usual Old World mammalian glucagon. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species. PMID:2263627

  6. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    SciTech Connect

    Yu, Jinghua ); Eng, J.; Yalow, R.S. City Univ. of New York, NY )

    1990-12-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species.

  7. A highly divergent 33 kDa Cryptosporidium parvum antigen

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Previous studies comparing the genome sequences of Cryptosporidium parvum with C. hominis identified a number of highly divergent genes that might reflect positive selection for host specificity. In the present study, a C. parvum sequence, namely cgd8-5370, whose amino acid sequence differs appreci...

  8. Comparative analysis between homoeologous genome segments of Brassica napus and its progenitor species reveals extensive sequence-level divergence.

    PubMed

    Cheung, Foo; Trick, Martin; Drou, Nizar; Lim, Yong Pyo; Park, Jee-Young; Kwon, Soo-Jin; Kim, Jin-A; Scott, Rod; Pires, J Chris; Paterson, Andrew H; Town, Chris; Bancroft, Ian

    2009-07-01

    Homoeologous regions of Brassica genomes were analyzed at the sequence level. These represent segments of the Brassica A genome as found in Brassica rapa and Brassica napus and the corresponding segments of the Brassica C genome as found in Brassica oleracea and B. napus. Analysis of synonymous base substitution rates within modeled genes revealed a relatively broad range of times (0.12 to 1.37 million years ago) since the divergence of orthologous genome segments as represented in B. napus and the diploid species. Similar, and consistent, ranges were also identified for single nucleotide polymorphism and insertion-deletion variation. Genes conserved across the Brassica genomes and the homoeologous segments of the genome of Arabidopsis thaliana showed almost perfect collinearity. Numerous examples of apparent transduplication of gene fragments, as previously reported in B. oleracea, were observed in B. rapa and B. napus, indicating that this phenomenon is widespread in Brassica species. In the majority of the regions studied, the C genome segments were expanded in size relative to their A genome counterparts. The considerable variation that we observed, even between the different versions of the same Brassica genome, for gene fragments and annotated putative genes suggest that the concept of the pan-genome might be particularly appropriate when considering Brassica genomes.

  9. Evolution of Lycopodiaceae (Lycopsida): estimating divergence times from rbcL gene sequences by use of nonparametric rate smoothing.

    PubMed

    Wikström, N; Kenrick, P

    2001-05-01

    By use of nonparametric rate smoothing and nucleotide sequences of the rbcL gene, divergence times in Lycopodiaceae are estimated. The results show that much extant species diversity in Lycopodiaceae stems from relatively recent cladogenic events. These results corroborate previous ideas based on paleobotanical and biogeographical data. Previous molecular phylogenetic analyses recognized a split into neotropical and paleotropical clades in Huperzia, which contains 85-90% of all living species. Connecting this biogeographical pattern with continent movements, the diversification of this epiphytic group was suggested to coincide with that of angiosperms in the mid to Late Cretaceous. Results presented here are consistent with this idea, and the diversification of the two clades is resolved as Late Cretacous (78 and 95 Myr). In the related genera Lycopodium and Lycopodiella, the patterns are somewhat different. Here species diversity is scattered among different subgeneric groups. Most of the high-diversity subgeneric groups seem to have diversified very recently (Late Tertiary), whereas the cladogenic events leading to these groups are much older (Early to Late Cretaceous). Our analysis shows that, although much living species diversity stems from relatively recent cladogenesis, the origins of the family (Early Carboniferous) and generic crown groups (Early Permian to Early Jurassic) are much more ancient events. PMID:11341801

  10. Multifarious selection through environmental change: acidity and predator-mediated adaptive divergence in the moor frog (Rana arvalis).

    PubMed

    Egea-Serrano, Andrés; Hangartner, Sandra; Laurila, Anssi; Räsänen, Katja

    2014-04-01

    Environmental change can simultaneously cause abiotic stress and alter biological communities, yet adaptation of natural populations to co-changing environmental factors is poorly understood. We studied adaptation to acid and predator stress in six moor frog (Rana arvalis) populations along an acidification gradient, where abundance of invertebrate predators increases with increasing acidity of R. arvalis breeding ponds. First, we quantified divergence among the populations in anti-predator traits (behaviour and morphology) at different rearing conditions in the laboratory (factorial combinations of acid or neutral pH and the presence or the absence of a caged predator). Second, we evaluated relative fitness (survival) of the populations by exposing tadpoles from the different rearing conditions to predation by free-ranging dragonfly larvae. We found that morphological defences (relative tail depth) as well as survival of tadpoles under predation increased with increasing pond acidity (under most experimental conditions). Tail depth and larval size mediated survival differences among populations, but the contribution of trait divergence to survival was strongly dependent on prior rearing conditions. Our results indicate that R. arvalis populations are adapted to the elevated predator pressure in acidified ponds and emphasize the importance of multifarious selection via both direct (here: pH) and indirect (here: predators) environmental changes. PMID:24552840

  11. Multifarious selection through environmental change: acidity and predator-mediated adaptive divergence in the moor frog (Rana arvalis).

    PubMed

    Egea-Serrano, Andrés; Hangartner, Sandra; Laurila, Anssi; Räsänen, Katja

    2014-04-01

    Environmental change can simultaneously cause abiotic stress and alter biological communities, yet adaptation of natural populations to co-changing environmental factors is poorly understood. We studied adaptation to acid and predator stress in six moor frog (Rana arvalis) populations along an acidification gradient, where abundance of invertebrate predators increases with increasing acidity of R. arvalis breeding ponds. First, we quantified divergence among the populations in anti-predator traits (behaviour and morphology) at different rearing conditions in the laboratory (factorial combinations of acid or neutral pH and the presence or the absence of a caged predator). Second, we evaluated relative fitness (survival) of the populations by exposing tadpoles from the different rearing conditions to predation by free-ranging dragonfly larvae. We found that morphological defences (relative tail depth) as well as survival of tadpoles under predation increased with increasing pond acidity (under most experimental conditions). Tail depth and larval size mediated survival differences among populations, but the contribution of trait divergence to survival was strongly dependent on prior rearing conditions. Our results indicate that R. arvalis populations are adapted to the elevated predator pressure in acidified ponds and emphasize the importance of multifarious selection via both direct (here: pH) and indirect (here: predators) environmental changes.

  12. Metazoan remaining genes for essential amino acid biosynthesis: sequence conservation and evolutionary analyses.

    PubMed

    Costa, Igor R; Thompson, Julie D; Ortega, José Miguel; Prosdocimi, Francisco

    2014-12-24

    Essential amino acids (EAA) consist of a group of nine amino acids that animals are unable to synthesize via de novo pathways. Recently, it has been found that most metazoans lack the same set of enzymes responsible for the de novo EAA biosynthesis. Here we investigate the sequence conservation and evolution of all the metazoan remaining genes for EAA pathways. Initially, the set of all 49 enzymes responsible for the EAA de novo biosynthesis in yeast was retrieved. These enzymes were used as BLAST queries to search for similar sequences in a database containing 10 complete metazoan genomes. Eight enzymes typically attributed to EAA pathways were found to be ubiquitous in metazoan genomes, suggesting a conserved functional role. In this study, we address the question of how these genes evolved after losing their pathway partners. To do this, we compared metazoan genes with their fungal and plant orthologs. Using phylogenetic analysis with maximum likelihood, we found that acetolactate synthase (ALS) and betaine-homocysteine S-methyltransferase (BHMT) diverged from the expected Tree of Life (ToL) relationships. High sequence conservation in the paraphyletic group Plant-Fungi was identified for these two genes using a newly developed Python algorithm. Selective pressure analysis of ALS and BHMT protein sequences showed higher non-synonymous mutation ratios in comparisons between metazoans/fungi and metazoans/plants, supporting the hypothesis that these two genes have undergone non-ToL evolution in animals.

  13. Mitochondrial genomes and divergence times of crocodile newts: inter-islands distribution of Echinotriton andersoni and the origin of a unique repetitive sequence found in Tylototriton mt genomes.

    PubMed

    Kurabayashi, Atsushi; Nishitani, Takuma; Katsuren, Seiki; Oumi, Shohei; Sumida, Masayuki

    2012-01-01

    Crocodile newts, which constitute the genera Echinotriton and Tylototriton, are known as living fossils, and these genera comprise many endangered species. To identify mitochondrial (mt) genes suitable for future population genetic analyses for endangered taxa, we determined the complete nucleotide sequences of the mt genomes of the Japanese crocodile newt Echinotriton andersoni and Himalayan crocodile newt Tylototriton verrucosus. Although the control region (CR) is known as the most variable mtDNA region in many animal taxa, the CRs of crocodile newts are highly conservative. Rather, the genes of NADH dehydrogenase subunits and ATPase subunit 6 were found to have high sequence divergences and to be usable for population genetics studies. To estimate the inter-population divergence ages of E. andersoni endemic to the Ryukyu Islands, we performed molecular dating analysis using whole and partial mt genomic data. The estimated divergence ages of the inter-island individuals are older than the paleogeographic segmentation ages of the islands, suggesting that the lineage splits of E. andersoni populations were not caused by vicariant events. Our phylogenetic analysis with partial mt sequence data also suggests the existence of at least two more undescribed species in the genus Tylototriton. We also found unusual repeat sequences containing the 3' region of cytochrome apoenzyme b gene, whole tRNA-Thr gene, and a noncoding region (the T-P noncoding region characteristic in caudate mtDNAs) from T. verrucosus mtDNA. Similar repeat sequences were found in two other Tylototriton species. The Tylototriton taxa with the repeats become a monophyletic group, indicating a single origin of the repeat sequences. The intra-and inter-specific comparisons of the repeat sequences suggest the occurrences of homologous recombination-based concerted evolution among the repeat sequences.

  14. Whole plastome sequencing reveals deep plastid divergence and cytonuclear discordance between closely related balsam poplars, Populus balsamifera and P. trichocarpa (Salicaceae).

    PubMed

    Huang, Daisie I; Hefer, Charles A; Kolosova, Natalia; Douglas, Carl J; Cronk, Quentin C B

    2014-11-01

    As molecular phylogenetic analyses incorporate ever-greater numbers of loci, cases of cytonuclear discordance - the phenomenon in which nuclear gene trees deviate significantly from organellar gene trees - are being reported more frequently. Plant examples of topological discordance, caused by recent hybridization between extant species, are well known. However, examples of branch-length discordance are less reported in plants relative to animals. We use a combination of de novo assembly and reference-based mapping using short-read shotgun sequences to construct a robust phylogeny of the plastome for multiple individuals of all the common Populus species in North America. We demonstrate a case of strikingly high plastome divergence, in contrast to little nuclear genome divergence, in two closely related balsam poplars, Populus balsamifera and Populus trichocarpa (Populus balsamifera ssp. trichocarpa). Previous studies with nuclear loci indicate that the two species (or subspecies) diverged since the late Pleistocene, whereas their plastomes indicate deep divergence, dating to at least the Pliocene (6-7 Myr ago). Our finding is in marked contrast to the estimated Pleistocene divergence of the nuclear genomes, previously calculated at 75 000 yr ago, suggesting plastid capture from a 'ghost lineage' of a now-extinct North American poplar. PMID:25078531

  15. Whole plastome sequencing reveals deep plastid divergence and cytonuclear discordance between closely related balsam poplars, Populus balsamifera and P. trichocarpa (Salicaceae).

    PubMed

    Huang, Daisie I; Hefer, Charles A; Kolosova, Natalia; Douglas, Carl J; Cronk, Quentin C B

    2014-11-01

    As molecular phylogenetic analyses incorporate ever-greater numbers of loci, cases of cytonuclear discordance - the phenomenon in which nuclear gene trees deviate significantly from organellar gene trees - are being reported more frequently. Plant examples of topological discordance, caused by recent hybridization between extant species, are well known. However, examples of branch-length discordance are less reported in plants relative to animals. We use a combination of de novo assembly and reference-based mapping using short-read shotgun sequences to construct a robust phylogeny of the plastome for multiple individuals of all the common Populus species in North America. We demonstrate a case of strikingly high plastome divergence, in contrast to little nuclear genome divergence, in two closely related balsam poplars, Populus balsamifera and Populus trichocarpa (Populus balsamifera ssp. trichocarpa). Previous studies with nuclear loci indicate that the two species (or subspecies) diverged since the late Pleistocene, whereas their plastomes indicate deep divergence, dating to at least the Pliocene (6-7 Myr ago). Our finding is in marked contrast to the estimated Pleistocene divergence of the nuclear genomes, previously calculated at 75 000 yr ago, suggesting plastid capture from a 'ghost lineage' of a now-extinct North American poplar.

  16. Amino acid sequences of proteins from Leptospira serovar pomona.

    PubMed

    Alves, S F; Lefebvre, R B; Probert, W

    2000-01-01

    This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  17. The amino acid sequence of Staphylococcus aureus penicillinase.

    PubMed Central

    Ambler, R P

    1975-01-01

    The amino acid sequence of the penicillinase (penicillin amido-beta-lactamhydrolase, EC 3.5.2.6) from Staphylococcus aureus strain PC1 was determined. The protein consists of a single polypeptide chain of 257 residues, and the sequence was determined by characterization of tryptic, chymotryptic, peptic and CNBr peptides, with some additional evidence from thermolysin and S. aureus proteinase peptides. A mistake in the preliminary report of the sequence is corrected; residues 113-116 are now thought to be -Lys-Lys-Val-Lys- rather than -Lys-Val-Lys-Lys-. Detailed evidence for the amino acid sequence has been deposited as Supplementary Publication SUP 50056 (91 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1975) 145, 5. PMID:1218078

  18. Infrageneric phylogeny and temporal divergence of Sorghum (Andropogoneae, Poaceae) based on low-copy nuclear and plastid sequences.

    PubMed

    Liu, Qing; Liu, Huan; Wen, Jun; Peterson, Paul M

    2014-01-01

    The infrageneric phylogeny and temporal divergence of Sorghum were explored in the present study. Sequence data of two low-copy nuclear (LCN) genes, phosphoenolpyruvate carboxylase 4 (Pepc4) and granule-bound starch synthase I (GBSSI), from 79 accessions of Sorghum plus Cleistachne sorghoides together with those from outgroups were used for maximum likelihood (ML) and Bayesian inference (BI) analyses. Bayesian dating based on three plastid DNA markers (ndhA intron, rpl32-trnL, and rps16 intron) was used to estimate the ages of major diversification events in Sorghum. The monophyly of Sorghum plus Cleistachne sorghoides (with the latter nested within Sorghum) was strongly supported by the Pepc4 data using BI analysis, and the monophyly of Sorghum was strongly supported by GBSSI data using both ML and BI analyses. Sorghum was divided into three clades in the Pepc4, GBSSI, and plastid phylograms: the subg. Sorghum lineage; the subg. Parasorghum and Stiposorghum lineage; and the subg. Chaetosorghum and Heterosorghum lineage. Two LCN homoeologous loci of Cleistachne sorghoides were first discovered in the same accession. Sorghum arundinaceum, S. bicolor, S. x drummondii, S. propinquum, and S. virgatum were closely related to S. x almum in the Pepc4, GBSSI, and plastid phylograms, suggesting that they may be potential genome donors to S. almum. Multiple LCN and plastid allelic variants have been identified in S. halepense of subg. Sorghum. The crown ages of Sorghum plus Cleistachne sorghoides and subg. Sorghum are estimated to be 12.7 million years ago (Mya) and 8.6 Mya, respectively. Molecular results support the recognition of three distinct subgenera in Sorghum: subg. Chaetosorghum with two sections, each with a single species, subg. Parasorghum with 17 species, and subg. Sorghum with nine species and we also provide a new nomenclatural combination, Sorghum sorghoides.

  19. Infrageneric Phylogeny and Temporal Divergence of Sorghum (Andropogoneae, Poaceae) Based on Low-Copy Nuclear and Plastid Sequences

    PubMed Central

    Liu, Qing; Liu, Huan; Wen, Jun; Peterson, Paul M.

    2014-01-01

    The infrageneric phylogeny and temporal divergence of Sorghum were explored in the present study. Sequence data of two low-copy nuclear (LCN) genes, phosphoenolpyruvate carboxylase 4 (Pepc4) and granule-bound starch synthase I (GBSSI), from 79 accessions of Sorghum plus Cleistachne sorghoides together with those from outgroups were used for maximum likelihood (ML) and Bayesian inference (BI) analyses. Bayesian dating based on three plastid DNA markers (ndhA intron, rpl32-trnL, and rps16 intron) was used to estimate the ages of major diversification events in Sorghum. The monophyly of Sorghum plus Cleistachne sorghoides (with the latter nested within Sorghum) was strongly supported by the Pepc4 data using BI analysis, and the monophyly of Sorghum was strongly supported by GBSSI data using both ML and BI analyses. Sorghum was divided into three clades in the Pepc4, GBSSI, and plastid phylograms: the subg. Sorghum lineage; the subg. Parasorghum and Stiposorghum lineage; and the subg. Chaetosorghum and Heterosorghum lineage. Two LCN homoeologous loci of Cleistachne sorghoides were first discovered in the same accession. Sorghum arundinaceum, S. bicolor, S. x drummondii, S. propinquum, and S. virgatum were closely related to S. x almum in the Pepc4, GBSSI, and plastid phylograms, suggesting that they may be potential genome donors to S. almum. Multiple LCN and plastid allelic variants have been identified in S. halepense of subg. Sorghum. The crown ages of Sorghum plus Cleistachne sorghoides and subg. Sorghum are estimated to be 12.7 million years ago (Mya) and 8.6 Mya, respectively. Molecular results support the recognition of three distinct subgenera in Sorghum: subg. Chaetosorghum with two sections, each with a single species, subg. Parasorghum with 17 species, and subg. Sorghum with nine species and we also provide a new nomenclatural combination, Sorghum sorghoides. PMID:25122516

  20. A molecular phylogeny of dinoflagellate protists (pyrrhophyta) inferred from the sequence of 24S rRNA divergent domains D1 and D8.

    PubMed

    Lenaers, G; Scholin, C; Bhaud, Y; Saint-Hilaire, D; Herzog, M

    1991-01-01

    The sequence of two divergent domains (D1 and D8) from dinoflagellate 24S large subunit rRNA was determined by primer extension using total RNA as template. Nucleotide sequence alignments over 401 bases have been analyzed in order to investigate phylogenetic relationships within this highly divergent and taxonomically controversial group of protists of the division Pyrrhophyta. Data are provided confirming that dinoflagellates represent a monophyletic group. For 11 out of the 13 investigated laboratory grown species, an additional domain (D2) could not be completely sequenced by reverse transcription because of a hidden break located near its 3'-terminus. Two sets of sequence alignments were used to infer dinoflagellate phylogeny. The first [199 nucleotides (nt)] included conservative sequences flanking the D1 and D8 divergent domains. It was used to reconstruct a broad evolutionary tree for the dinoflagellates, which was rooted using Tetrahymena thermophila as the outgroup. To confirm the tree topology, and mainly the branchings leading to closely related species, a second alignment (401 nt) was considered, which included the D1 and D8 variable sequences in addition to the more conserved flanking regions. Species that showed sequence similarities with other species lower than 60% on average (Knuc values higher than 0.550) were removed from this analysis. A coherent and convincing evolutionary pattern was obtained for the dinoflagellates, also confirmed by the position of the hidden break within the D2 domain, which appears to be group specific. The reconstructed phylogeny indicates that the early emergence of Oxyrrhis marina preceded that of most Peridiniales, a large order of thecate species, whereas the unarmored Gymnodiniales appeared more recently, along with members of the Prorocentrales characterized by two thecal plates. In addition, the emergence of heterotrophic species preceded that of photosynthetic species. These results provide new perspectives on

  1. Identification of the novel candidate genes and variants in boar liver tissues with divergent skatole levels using RNA deep sequencing.

    PubMed

    Gunawan, Asep; Sahadevan, Sudeep; Cinar, Mehmet Ulas; Neuhoff, Christiane; Große-Brinkhaus, Christine; Frieden, Luc; Tesfaye, Dawit; Tholen, Ernst; Looft, Christian; Wondim, Dessie Salilew; Hölker, Michael; Schellander, Karl; Uddin, Muhammad Jasim

    2013-01-01

    Boar taint is the unpleasant odour of meat derived from non-castrated male pigs, caused by the accumulation of androstenone and skatole in fat. Skatole is a tryptophan metabolite produced by intestinal bacteria in gut and catabolised in liver. Since boar taint affects consumer's preference, the aim of this study was to perform transcriptome profiling in liver of boars with divergent skatole levels in backfat by using RNA-Seq. The total number of reads produced for each liver sample ranged from 11.8 to 39.0 million. Approximately 448 genes were differentially regulated (p-adjusted <0.05). Among them, 383 genes were up-regulated in higher skatole group and 65 were down-regulated (p<0.01, FC>1.5). Differentially regulated genes in the high skatole liver samples were enriched in metabolic processes such as small molecule biochemistry, protein synthesis, lipid and amino acid metabolism. Pathway analysis identified the remodeling of epithelial adherens junction and TCA cycle as the most dominant pathways which may play important roles in skatole metabolism. Differential gene expression analysis identified candidate genes in ATP synthesis, cytochrome P450, keratin, phosphoglucomutase, isocitrate dehydrogenase and solute carrier family. Additionally, polymorphism and association analysis revealed that mutations in ATP5B, KRT8, PGM1, SLC22A7 and IDH1 genes could be potential markers for skatole levels in boars. Furthermore, expression analysis of exon usage of three genes (ATP5B, KRT8 and PGM1) revealed significant differential expression of exons of these genes in different skatole levels. These polymorphisms and exon expression differences may have impacts on the gene activity ultimately leading to skatole variation and could be used as genetic marker for boar taint related traits. However, further validation is required to confirm the effect of these genetic markers in other pig populations in order to be used in genomic selection against boar taint in pig breeding

  2. Identification of the Novel Candidate Genes and Variants in Boar Liver Tissues with Divergent Skatole Levels Using RNA Deep Sequencing

    PubMed Central

    Gunawan, Asep; Sahadevan, Sudeep; Cinar, Mehmet Ulas; Neuhoff, Christiane; Große-Brinkhaus, Christine; Frieden, Luc; Tesfaye, Dawit; Tholen, Ernst; Looft, Christian; Wondim, Dessie Salilew; Hölker, Michael; Schellander, Karl; Uddin, Muhammad Jasim

    2013-01-01

    Boar taint is the unpleasant odour of meat derived from non-castrated male pigs, caused by the accumulation of androstenone and skatole in fat. Skatole is a tryptophan metabolite produced by intestinal bacteria in gut and catabolised in liver. Since boar taint affects consumer’s preference, the aim of this study was to perform transcriptome profiling in liver of boars with divergent skatole levels in backfat by using RNA-Seq. The total number of reads produced for each liver sample ranged from 11.8 to 39.0 million. Approximately 448 genes were differentially regulated (p-adjusted <0.05). Among them, 383 genes were up-regulated in higher skatole group and 65 were down-regulated (p<0.01, FC>1.5). Differentially regulated genes in the high skatole liver samples were enriched in metabolic processes such as small molecule biochemistry, protein synthesis, lipid and amino acid metabolism. Pathway analysis identified the remodeling of epithelial adherens junction and TCA cycle as the most dominant pathways which may play important roles in skatole metabolism. Differential gene expression analysis identified candidate genes in ATP synthesis, cytochrome P450, keratin, phosphoglucomutase, isocitrate dehydrogenase and solute carrier family. Additionally, polymorphism and association analysis revealed that mutations in ATP5B, KRT8, PGM1, SLC22A7 and IDH1 genes could be potential markers for skatole levels in boars. Furthermore, expression analysis of exon usage of three genes (ATP5B, KRT8 and PGM1) revealed significant differential expression of exons of these genes in different skatole levels. These polymorphisms and exon expression differences may have impacts on the gene activity ultimately leading to skatole variation and could be used as genetic marker for boar taint related traits. However, further validation is required to confirm the effect of these genetic markers in other pig populations in order to be used in genomic selection against boar taint in pig breeding

  3. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences

    PubMed Central

    Derr, Julien; Manapat, Michael L.; Rajamani, Sudha; Leu, Kevin; Xulvi-Brunet, Ramon; Joseph, Isaac; Nowak, Martin A.; Chen, Irene A.

    2012-01-01

    During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life. PMID:22319215

  4. The REH theory of protein and nucleic acid divergence - A retrospective update. [Random Evolutionary Hits

    NASA Technical Reports Server (NTRS)

    Holmquist, R.

    1978-01-01

    The random evolutionary hits (REH) theory of evolutionary divergence, originally proposed in 1972, is restated with attention to certain aspects of the theory that have caused confusion. The theory assumes that natural selection and stochastic processes interact and that natural selection restricts those codon sites which may fix mutations. The predicted total number of fixed nucleotide replacements agrees with data for cytochrome c, a-hemoglobin, beta-hemoglobin, and myoglobin. The restatement analyzes the magnitude of possible sources of errors and simplifies calculational methodology by supplying polynomial expressions to replace tables and graphs.

  5. Highly divergent sequences of the pollen self-incompatibility (S) gene in class-I S haplotypes of Brassica campestris (syn. rapa) L.

    PubMed

    Watanabe, M; Ito, A; Takada, Y; Ninomiya, C; Kakizaki, T; Takahata, Y; Hatakeyama, K; Hinata, K; Suzuki, G; Takasaki, T; Satta, Y; Shiba, H; Takayama, S; Isogai, A

    2000-05-12

    Self-incompatibility (SI) enables flowering plants to discriminate between self- and non-self-pollen. In Brassica, SI is controlled by the highly polymorphic S locus. The recently identified male determinant, termed SP11 or SCR, is thought to be the ligand of S receptor kinase, the female determinant. To examine functional and evolutionary properties of SP11, we cloned 14 alleles from class-I S haplotypes of Brassica campestris and carried out sequence analyses. The sequences of mature SP11 proteins are highly divergent, except for the presence of conserved cysteines. The phylogenetic trees suggest possible co-evolution of the genes encoding the male and female determinants.

  6. Functional Divergence in the Genus Oenococcus as Predicted by Genome Sequencing of the Newly-Described Species, Oenococcus kitaharae

    PubMed Central

    Borneman, Anthony R.; McCarthy, Jane M.; Chambers, Paul J.; Bartowsky, Eveline J.

    2012-01-01

    Oenococcus kitaharae is only the second member of the genus Oenococcus to be identified and is the closest relative of the industrially important wine bacterium Oenococcus oeni. To provide insight into this new species, the genome of the type strain of O. kitaharae, DSM 17330, was sequenced. Comparison of the sequenced genomes of both species show that the genome of O. kitaharae DSM 17330 contains many genes with predicted functions in cellular defence (bacteriocins, antimicrobials, restriction-modification systems and a CRISPR locus) which are lacking in O. oeni. The two genomes also appear to differentially encode several metabolic pathways associated with amino acid biosynthesis and carbohydrate utilization and which have direct phenotypic consequences. This would indicate that the two species have evolved different survival techniques to suit their particular environmental niches. O. oeni has adapted to survive in the harsh, but predictable, environment of wine that provides very few competitive species. However O. kitaharae appears to have adapted to a growth environment in which biological competition provides a significant selective pressure by accumulating biological defence molecules, such as bacteriocins and restriction-modification systems, throughout its genome. PMID:22235313

  7. Development of an expert system for amino acid sequence identification.

    PubMed

    Hu, L; Saulinskas, E F; Johnson, P; Harrington, P B

    1996-08-01

    An expert system for amino acid sequence identification has been developed. The algorithm uses heuristic rules developed by human experts in protein sequencing. The system is applied to the chromatographic data of phenylthiohydantoin-amino acids acquired from an automated sequencer. The peak intensities in the current cycle are compared with those in the previous cycle, while the calibration and succeeding cycles are used as ancillary identification criteria when necessary. The retention time for each chromatographic peak in each cycle is corrected by the corresponding peak in the calibration cycle at the same run. The main improvement of our system compared with the onboard software used by the Applied Biosystems 477A Protein/Peptide Sequencer is that each peak in each cycle is assigned an identification name according to the corrected retention time to be used for the comparison with different cycles. The system was developed from analyses of ribonuclease A and evaluated by runs of four other protein samples that were not used in rule development. This paper demonstrates that rules developed by human experts can be automatically applied to sequence assignment. The expert system performed more accurately than the onboard software of the protein sequencer, in that the misidentification rates for the expert system were around 7%, whereas those for the onboard software were between 13 and 21%.

  8. Characterization of the sequence and expression pattern of LFY homologues from dogwood species (Cornus) with divergent inflorescence architectures

    PubMed Central

    Liu, Juan; Franks, Robert G.; Feng, Chun-Miao; Liu, Xiang; Fu, Cheng-Xin;  (Jenny) Xiang, Qiu-Yun

    2013-01-01

    Background and Aims LFY homologues encode transcription factors that regulate the transition from vegetative to reproductive growth in flowering plants and have been shown to control inflorescence patterning in model species. This study investigated the expression patterns of LFY homologues within the diverse inflorescence types (head-like, umbel-like and inflorescences with elongated internodes) in closely related lineages in the dogwood genus (Cornus s.l.). The study sought to determine whether LFY homologues in Cornus species are expressed during floral and inflorescence development and if the pattern of expression is consistent with a function in regulating floral development and inflorescence architectures in the genus. Methods Total RNAs were extracted using the CTAB method and the first-strand cDNA was synthesized using the SuperScript III first-strand synthesis system kit (Invitrogen). Expression of CorLFY was investigated by RT–PCR and RNA in situ hybridization. Phylogenetic analyses were conducted using the maximum likelihood methods implemented in RAxML-HPC v7.2.8. Key Results cDNA clones of LFY homologues (designated CorLFY) were isolated from six Cornus species bearing different types of inflorescence. CorLFY cDNAs were predicted to encode proteins of approximately 375 amino acids. The detection of CorLFY expression patterns using in situ RNA hybridization demonstrated the expression of CorLFY within the inflorescence meristems, inflorescence branch meristems, floral meristems and developing floral organ primordia. PCR analyses for cDNA libraries derived from reverse transcription of total RNAs showed that CorLFY was also expressed during the late-stage development of flowers and inflorescences, as well as in bracts and developing leaves. Consistent differences in the CorLFY expression patterns were not detected among the distinct inflorescence types. Conclusions The results suggest a role for CorLFY genes during floral and inflorescence development

  9. Genome-wide association studies for fatty acid metabolic traits in five divergent pig populations.

    PubMed

    Zhang, Wanchang; Bin Yang; Zhang, Junjie; Cui, Leilei; Ma, Junwu; Chen, Congying; Ai, Huashui; Xiao, Shijun; Ren, Jun; Huang, Lusheng

    2016-04-21

    Fatty acid composition profiles are important indicators of meat quality and tasting flavor. Metabolic indices of fatty acids are more authentic to reflect meat nutrition and public acceptance. To investigate the genetic mechanism of fatty acid metabolic indices in pork, we conducted genome-wide association studies (GWAS) for 33 fatty acid metabolic traits in five pig populations. We identified a total of 865 single nucleotide polymorphisms (SNPs), corresponding to 11 genome-wide significant loci on nine chromosomes and 12 suggestive loci on nine chromosomes. Our findings not only confirmed seven previously reported QTL with stronger association strength, but also revealed four novel population-specific loci, showing that investigations on intermediate phenotypes like the metabolic traits of fatty acids can increase the statistical power of GWAS for end-point phenotypes. We proposed a list of candidate genes at the identified loci, including three novel genes (FADS2, SREBF1 and PLA2G7). Further, we constructed the functional networks involving these candidate genes and deduced the potential fatty acid metabolic pathway. These findings advance our understanding of the genetic basis of fatty acid composition in pigs. The results from European hybrid commercial pigs can be immediately transited into breeding practice for beneficial fatty acid composition.

  10. Genome-wide association studies for fatty acid metabolic traits in five divergent pig populations

    PubMed Central

    Zhang, Wanchang; Bin Yang; Zhang, Junjie; Cui, Leilei; Ma, Junwu; Chen, Congying; Ai, Huashui; Xiao, Shijun; Ren, Jun; Huang, Lusheng

    2016-01-01

    Fatty acid composition profiles are important indicators of meat quality and tasting flavor. Metabolic indices of fatty acids are more authentic to reflect meat nutrition and public acceptance. To investigate the genetic mechanism of fatty acid metabolic indices in pork, we conducted genome-wide association studies (GWAS) for 33 fatty acid metabolic traits in five pig populations. We identified a total of 865 single nucleotide polymorphisms (SNPs), corresponding to 11 genome-wide significant loci on nine chromosomes and 12 suggestive loci on nine chromosomes. Our findings not only confirmed seven previously reported QTL with stronger association strength, but also revealed four novel population-specific loci, showing that investigations on intermediate phenotypes like the metabolic traits of fatty acids can increase the statistical power of GWAS for end-point phenotypes. We proposed a list of candidate genes at the identified loci, including three novel genes (FADS2, SREBF1 and PLA2G7). Further, we constructed the functional networks involving these candidate genes and deduced the potential fatty acid metabolic pathway. These findings advance our understanding of the genetic basis of fatty acid composition in pigs. The results from European hybrid commercial pigs can be immediately transited into breeding practice for beneficial fatty acid composition. PMID:27097669

  11. Extreme sequence divergence between mitochondrial genomes of two subspecies of White-breasted Wood-wren (Henicorhina leucosticta, Cabanis, 1847) from western and central Panamá.

    PubMed

    Aguilar, Celestino; De Léon, Luis Fernando; Loaiza, José R; McMillan, W Owen; Miller, Matthew J

    2016-01-01

    Prior studies of mitochondrial variation in White-breasted Wood-Wrens (Henicorhina leucosticta) have suggested that populations in South American and Mesoamerica might represent multiple species. Here we report the complete mitochondrial genomes from two individuals of H. leucosticta, representing the Panamanian subspecies pittieri and alexandri. The two sequences were 16,721 and 16,726 base pairs in size with both genomes comprised of the usual 22 tRNA genes, 2 rRNA genes, 13 protein-coding genes, and one displacement loop region in the standard avian order. Uncorrected pairwise divergence between mitogenome features was high, with the highest divergence occurring in protein-coding genes (average = 8.2%), followed by control region (6.7%). RNA features had lower pairwise divergences (average tRNA = 4.3%, average rRNA = 2.3%). The protein-coding ATPase 6 gene had a different stop codon between these two specimens. The high level of sequence variation between these subspecies suggests that Mesoamerican H. leucosticta might be comprised of multiple species. We urge a full phylogeographic survey of this widespread Neotropical forest bird. PMID:24938093

  12. Diverging effects of anthropogenic acidification and natural acidity on community structure in Swedish streams.

    PubMed

    Petrin, Zlatko; Laudon, Hjalmar; Malmqvist, Björn

    2008-05-15

    Anthropogenic acidification caused by aerial deposition of acidifying substances is known to have detrimental effects on freshwater biota, including reductions in species diversity and ecosystem functioning. However, such impairment is not found in systems acidified to a similar extent by natural processes. A proposed explanation for this difference is that freshwater organisms have had far more time to evolve and adapt to natural than anthropogenic acidification. Thus, where acidity is natural, adaptation may account for diverse and functional communities. Here, we investigated whether adaptations--that were previously implied to occur on small spatial scales--may explain the species richness patterns on a much larger geographical scale, apply to ecological functioning, and are relevant in Sweden, where natural acidity is geologically relatively recent. Therefore, we compared differences in species diversity and ecosystem process rates between 24 acidic and circumneutral streams in northern Sweden, where acidity is natural, and southern Sweden, where acidity is largely anthropogenic. In agreement with our predictions, the difference in macroinvertebrate species richness between acidic and circumneutral streams was threefold larger in the region where acidity was anthropogenic than where it was natural, albeit marginally non-significantly. In contrast, no such trend was found for the rates of decomposition by microbes and leaf-feeding macroinvertebrates, possibly due to functional redundancy. The structure of species assemblages differed between acidic and circumneutral sites and between the regions. Our results agree with the notion that freshwater biota are adapted to natural acidity, but competing explanations including other differences in water chemistry and differences in the biogeographical colonization histories may also account for part of the observed patterns. Since naturally acidic environments similar to those in northern Sweden are widespread, we

  13. Diverging effects of anthropogenic acidification and natural acidity on community structure in Swedish streams.

    PubMed

    Petrin, Zlatko; Laudon, Hjalmar; Malmqvist, Björn

    2008-05-15

    Anthropogenic acidification caused by aerial deposition of acidifying substances is known to have detrimental effects on freshwater biota, including reductions in species diversity and ecosystem functioning. However, such impairment is not found in systems acidified to a similar extent by natural processes. A proposed explanation for this difference is that freshwater organisms have had far more time to evolve and adapt to natural than anthropogenic acidification. Thus, where acidity is natural, adaptation may account for diverse and functional communities. Here, we investigated whether adaptations--that were previously implied to occur on small spatial scales--may explain the species richness patterns on a much larger geographical scale, apply to ecological functioning, and are relevant in Sweden, where natural acidity is geologically relatively recent. Therefore, we compared differences in species diversity and ecosystem process rates between 24 acidic and circumneutral streams in northern Sweden, where acidity is natural, and southern Sweden, where acidity is largely anthropogenic. In agreement with our predictions, the difference in macroinvertebrate species richness between acidic and circumneutral streams was threefold larger in the region where acidity was anthropogenic than where it was natural, albeit marginally non-significantly. In contrast, no such trend was found for the rates of decomposition by microbes and leaf-feeding macroinvertebrates, possibly due to functional redundancy. The structure of species assemblages differed between acidic and circumneutral sites and between the regions. Our results agree with the notion that freshwater biota are adapted to natural acidity, but competing explanations including other differences in water chemistry and differences in the biogeographical colonization histories may also account for part of the observed patterns. Since naturally acidic environments similar to those in northern Sweden are widespread, we

  14. The genome of RNA tumor viruses contains polyadenylic acid sequences.

    PubMed

    Green, M; Cartas, M

    1972-04-01

    The 70S genome of two RNA tumor viruses, murine sarcoma virus and avian myeloblastosis virus, binds to Millipore filters in buffer with high salt concentration and to glass fiber filters containing poly(U). These observations suggest that 70S RNA contains adenylic acid-rich sequences. When digested by pancreatic RNase, 70S RNA of murine sarcoma virus yielded poly(A) sequences that contain 91% adenylic acid. These poly(A) sequences sedimented as a relatively homogenous peak in sucrose gradients with a sedimentation coefficient of 4-5 S, but had a mobility during polyacrylamide gel electrophoresis that corresponds to molecules that sediment at 6-7 S. If we estimate a molecular weight for each sequence of 30,000-60,000 (100-200 nucleotides) and a molecular weight for viral 70S RNA of 3-12 million, each viral genome could contain 1-8 poly(A) sequences. Possible functions of poly(A) in the infecting viral RNA may include a role in the initiation of viral DNA or RNA synthesis, in protein maturation, or in the assembly of the viral genome.

  15. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  16. Nucleic acid sequence design via efficient ensemble defect optimization.

    PubMed

    Zadeh, Joseph N; Wolfe, Brian R; Pierce, Niles A

    2011-02-01

    We describe an algorithm for designing the sequence of one or more interacting nucleic acid strands intended to adopt a target secondary structure at equilibrium. Sequence design is formulated as an optimization problem with the goal of reducing the ensemble defect below a user-specified stop condition. For a candidate sequence and a given target secondary structure, the ensemble defect is the average number of incorrectly paired nucleotides at equilibrium evaluated over the ensemble of unpseudoknotted secondary structures. To reduce the computational cost of accepting or rejecting mutations to a random initial sequence, candidate mutations are evaluated on the leaf nodes of a tree-decomposition of the target structure. During leaf optimization, defect-weighted mutation sampling is used to select each candidate mutation position with probability proportional to its contribution to the ensemble defect of the leaf. As subsequences are merged moving up the tree, emergent structural defects resulting from crosstalk between sibling sequences are eliminated via reoptimization within the defective subtree starting from new random subsequences. Using a Θ(N(3) ) dynamic program to evaluate the ensemble defect of a target structure with N nucleotides, this hierarchical approach implies an asymptotic optimality bound on design time: for sufficiently large N, the cost of sequence design is bounded below by 4/3 the cost of a single evaluation of the ensemble defect for the full sequence. Hence, the design algorithm has time complexity Ω(N(3) ). For target structures containing N ∈{100,200,400,800,1600,3200} nucleotides and duplex stems ranging from 1 to 30 base pairs, RNA sequence designs at 37°C typically succeed in satisfying a stop condition with ensemble defect less than N/100. Empirically, the sequence design algorithm exhibits asymptotic optimality and the exponent in the time complexity bound is sharp.

  17. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  18. cis-Acting sequences required for expression of the divergently transcribed Drosophila melanogaster Sgs-7 and Sgs-8 glue protein genes

    SciTech Connect

    Hofmann, A.; Garfinkel, M.D.; Meyerowitz, E.M. )

    1991-06-01

    The Sgs-7 and Sgs-8 glue genes at 68C are divergently transcribed and are separated by 475 bp. Fusion genes with Adh or lacZ coding sequences were constructed, and the expression of these genes, with different amounts of upstream sequences present, was tested by a transient expression procedure and by germ line transformation. A cis-acting element for both genes is located asymmetrically in the intergenic region between {minus}211 and {minus}43 bp relative to Sgs-7. It is required for correct expression of both genes. This element can confer the stage- and tissue-specific expression pattern of glue genes on a heterologous promoter. An 86-bp portion of the element, from {minus}133 to {minus}48 bp relative to Sgs-7, is shown to be capable of enhancing the expression of a truncated and therefore weakly expressed Sgs-3 fusion gene. Recently described common sequence motifs of glue gene regulatory elements.

  19. The amino acid sequence of Escherichia coli cyanase.

    PubMed

    Chin, C C; Anderson, P M; Wold, F

    1983-01-10

    The amino acid sequence of the enzyme cyanase (cyanate hydrolase) from Escherichia coli has been determined by automatic Edman degradation of the intact protein and of its component peptides. The primary peptides used in the sequencing were produced by cyanogen bromide cleavage at the methionine residues, yielding 4 peptides plus free homoserine from the NH2-terminal methionine, and by trypsin cleavage at the 7 arginine residues after acetylation of the lysines. Secondary peptides required for overlaps and COOH-terminal sequences were produced by chymotrypsin or clostripain cleavage of some of the larger peptides. The complete sequence of the cyanase subunit consists of 156 amino acid residues (Mr 16,350). Based on the observation that the cysteine-containing peptide is obtained as a disulfide-linked dimer, it is proposed that the covalent structure of cyanase is made up of two subunits linked by a disulfide bond between the single cystine residue in each subunit. The native enzyme (Mr 150,000) then appears to be a complex of four or five such subunit dimers.

  20. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  1. Evolutionary Divergence of Plant Borate Exporters and Critical Amino Acid Residues for the Polar Localization and Boron-Dependent Vacuolar Sorting of AtBOR1.

    PubMed

    Wakuta, Shinji; Mineta, Katsuhiko; Amano, Taro; Toyoda, Atsushi; Fujiwara, Toru; Naito, Satoshi; Takano, Junpei

    2015-05-01

    Boron (B) is an essential micronutrient for plants but is toxic when accumulated in excess. The plant BOR family encodes plasma membrane-localized borate exporters (BORs) that control translocation and homeostasis of B under a wide range of conditions. In this study, we examined the evolutionary divergence of BORs among terrestrial plants and showed that the lycophyte Selaginella moellendorffii and angiosperms have evolved two types of BOR (clades I and II). Clade I includes AtBOR1 and homologs previously shown to be involved in efficient transport of B under conditions of limited B availability. AtBOR1 shows polar localization in the plasma membrane and high-B-induced vacuolar sorting, important features for efficient B transport under low-B conditions, and rapid down-regulation to avoid B toxicity. Clade II includes AtBOR4 and barley Bot1 involved in B exclusion for high-B tolerance. We showed, using yeast complementation and B transport assays, that three genes in S. moellendorffii, SmBOR1 in clade I and SmBOR3 and SmBOR4 in clade II, encode functional BORs. Furthermore, amino acid sequence alignments identified an acidic di-leucine motif unique in clade I BORs. Mutational analysis of AtBOR1 revealed that the acidic di-leucine motif is required for the polarity and high-B-induced vacuolar sorting of AtBOR1. Our data clearly indicated that the common ancestor of vascular plants had already acquired two types of BOR for low- and high-B tolerance, and that the BOR family evolved to establish B tolerance in each lineage by adapting to their environments. PMID:25619824

  2. Evolutionary Divergence of Plant Borate Exporters and Critical Amino Acid Residues for the Polar Localization and Boron-Dependent Vacuolar Sorting of AtBOR1.

    PubMed

    Wakuta, Shinji; Mineta, Katsuhiko; Amano, Taro; Toyoda, Atsushi; Fujiwara, Toru; Naito, Satoshi; Takano, Junpei

    2015-05-01

    Boron (B) is an essential micronutrient for plants but is toxic when accumulated in excess. The plant BOR family encodes plasma membrane-localized borate exporters (BORs) that control translocation and homeostasis of B under a wide range of conditions. In this study, we examined the evolutionary divergence of BORs among terrestrial plants and showed that the lycophyte Selaginella moellendorffii and angiosperms have evolved two types of BOR (clades I and II). Clade I includes AtBOR1 and homologs previously shown to be involved in efficient transport of B under conditions of limited B availability. AtBOR1 shows polar localization in the plasma membrane and high-B-induced vacuolar sorting, important features for efficient B transport under low-B conditions, and rapid down-regulation to avoid B toxicity. Clade II includes AtBOR4 and barley Bot1 involved in B exclusion for high-B tolerance. We showed, using yeast complementation and B transport assays, that three genes in S. moellendorffii, SmBOR1 in clade I and SmBOR3 and SmBOR4 in clade II, encode functional BORs. Furthermore, amino acid sequence alignments identified an acidic di-leucine motif unique in clade I BORs. Mutational analysis of AtBOR1 revealed that the acidic di-leucine motif is required for the polarity and high-B-induced vacuolar sorting of AtBOR1. Our data clearly indicated that the common ancestor of vascular plants had already acquired two types of BOR for low- and high-B tolerance, and that the BOR family evolved to establish B tolerance in each lineage by adapting to their environments.

  3. Middle-Upper Pleistocene climate changes shaped the divergence and demography of Cycas guizhouensis (Cycadaceae): Evidence from DNA sequences and microsatellite markers

    PubMed Central

    Feng, Xiuyan; Zheng, Ying; Gong, Xun

    2016-01-01

    Climatic oscillations in the Pleistocene have had profound effects on the demography and genetic diversity of many extant species. Cycas guizhouensis Lan & R.F. Zou is an endemic and endangered species in Southwest China that is primarily distributed along the valleys of the Nanpan River. In this study, we used four chloroplast DNAs (cpDNA), three nuclear genes (nDNA) and 13 microsatellite (SSR) loci to investigate the genetic structure, divergence time and demographic history of 11 populations of C. guizhouensis. High genetic diversity and high levels of genetic differentiation among the populations were observed. Two evolutionary units were revealed based on network and Structure analysis. The divergence time estimations suggested that haplotypes of C. guizhouensis were diverged during the Middle-Upper Pleistocene. Additionally, the demographic histories deduced from different DNA sequences were discordant, but overall indicated that C. guizhouensis had experienced a recent population expansion during the post-glacial period. Microsatellite data revealed that there was a contraction in effective population size in the past. These genetic features allow conservation measures to be taken to ensure the protection of this endangered species from extinction. PMID:27270859

  4. Middle-Upper Pleistocene climate changes shaped the divergence and demography of Cycas guizhouensis (Cycadaceae): Evidence from DNA sequences and microsatellite markers.

    PubMed

    Feng, Xiuyan; Zheng, Ying; Gong, Xun

    2016-01-01

    Climatic oscillations in the Pleistocene have had profound effects on the demography and genetic diversity of many extant species. Cycas guizhouensis Lan &R.F. Zou is an endemic and endangered species in Southwest China that is primarily distributed along the valleys of the Nanpan River. In this study, we used four chloroplast DNAs (cpDNA), three nuclear genes (nDNA) and 13 microsatellite (SSR) loci to investigate the genetic structure, divergence time and demographic history of 11 populations of C. guizhouensis. High genetic diversity and high levels of genetic differentiation among the populations were observed. Two evolutionary units were revealed based on network and Structure analysis. The divergence time estimations suggested that haplotypes of C. guizhouensis were diverged during the Middle-Upper Pleistocene. Additionally, the demographic histories deduced from different DNA sequences were discordant, but overall indicated that C. guizhouensis had experienced a recent population expansion during the post-glacial period. Microsatellite data revealed that there was a contraction in effective population size in the past. These genetic features allow conservation measures to be taken to ensure the protection of this endangered species from extinction. PMID:27270859

  5. Structurally divergent lysophosphatidic acid acyltransferases with high selectivity for saturated medium chain fatty acids from Cuphea seeds.

    PubMed

    Kim, Hae Jin; Silva, Jillian E; Iskandarov, Umidjon; Andersson, Mariette; Cahoon, Rebecca E; Mockaitis, Keithanne; Cahoon, Edgar B

    2015-12-01

    Lysophosphatidic acid acyltransferase (LPAT) catalyzes acylation of the sn-2 position on lysophosphatidic acid by an acyl CoA substrate to produce the phosphatidic acid precursor of polar glycerolipids and triacylglycerols (TAGs). In the case of TAGs, this reaction is typically catalyzed by an LPAT2 from microsomal LPAT class A that has high specificity for C18 fatty acids containing Δ9 unsaturation. Because of this specificity, the occurrence of saturated fatty acids in the TAG sn-2 position is infrequent in seed oils. To identify LPATs with variant substrate specificities, deep transcriptomic mining was performed on seeds of two Cuphea species producing TAGs that are highly enriched in saturated C8 and C10 fatty acids. From these analyses, cDNAs for seven previously unreported LPATs were identified, including cDNAs from Cuphea viscosissima (CvLPAT2) and Cuphea avigera var. pulcherrima (CpuLPAT2a) encoding microsomal, seed-specific class A LPAT2s and a cDNA from C. avigera var. pulcherrima (CpuLPATB) encoding a microsomal, seed-specific LPAT from the bacterial-type class B. The activities of these enzymes were characterized in Camelina sativa by seed-specific co-expression with cDNAs for various Cuphea FatB acyl-acyl carrier protein thioesterases (FatB) that produce a variety of saturated medium-chain fatty acids. CvLPAT2 and CpuLPAT2a expression resulted in accumulation of 10:0 fatty acids in the Camelina sativa TAG sn-2 position, indicating a 10:0 CoA specificity that has not been previously described for plant LPATs. CpuLPATB expression generated TAGs with 14:0 at the sn-2 position, but not 10:0. Identification of these LPATs provides tools for understanding the structural basis of LPAT substrate specificity and for generating altered oil functionalities.

  6. Profiling serum bile acid glucuronides in humans: gender divergences, genetic determinants and response to fenofibrate

    PubMed Central

    Trottier, Jocelyn; Perreault, Martin; Rudkowska, Iwona; Levy, Cynthia; Dallaire-Theroux, Amélie; Verreault, Mélanie; Caron, Patrick; Staels, Bart; Vohl, Marie-Claude; Straka, Robert J.; Barbier, Olivier

    2014-01-01

    Glucuronidation, catalyzed by UDP-glucuronosyltransferase (UGT) enzymes detoxifies cholestatic bile acids (BAs). We aimed at i) characterizing the circulating BA-glucuronide (-G) pool composition in humans, ii) evaluating how sex and UGT polymorphisms influence this composition, and iii) analyzing the effects of lipid-lowering drug fenofibrate on the circulating BA-G profile in 300 volunteers and 5 cholestatic patients. Eleven BA-Gs were determined in pre- and post-fenofibrate samples. Men exhibited higher BA-G concentrations, and various genotype/BA-G associations were discovered in relevant UGT genes. The chenodeoxycholic acid-3G concentration was associated with the UGT2B7 802C>T polymorphism. Glucuronidation assays confirmed the predominant role of UGT2B7 and UGT1A4 in CDCA-3G formation. Fenofibrate exposure increased the serum levels of 5 BA-G species, including CDCA-3G, and up-regulated expression of UGT1A4, but not UGT2B7, in hepatic cells. This study demonstrates that fenofibrate stimulates BA glucuronidation in humans, and thus reduces bile acid toxicity in the liver. PMID:23756370

  7. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    SciTech Connect

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  8. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  9. Characterization and amino acid sequence of a fatty acid-binding protein from human heart.

    PubMed Central

    Offner, G D; Brecher, P; Sawlivich, W B; Costello, C E; Troxler, R F

    1988-01-01

    The complete amino acid sequence of a fatty acid-binding protein from human heart was determined by automated Edman degradation of CNBr, BNPS-skatole [3'-bromo-3-methyl-2-(2-nitrobenzenesulphenyl)indolenine], hydroxylamine, Staphylococcus aureus V8 proteinase, tryptic and chymotryptic peptides, and by digestion of the protein with carboxypeptidase A. The sequence of the blocked N-terminal tryptic peptide from citraconylated protein was determined by collisionally induced decomposition mass spectrometry. The protein contains 132 amino acid residues, is enriched with respect to threonine and lysine, lacks cysteine, has an acetylated valine residue at the N-terminus, and has an Mr of 14768 and an isoelectric point of 5.25. This protein contains two short internal repeated sequences from residues 48-54 and from residues 114-119 located within regions of predicted beta-structure and decreasing hydrophobicity. These short repeats are contained within two longer repeated regions from residues 48-60 and residues 114-125, which display 62% sequence similarity. These regions could accommodate the charged and uncharged moieties of long-chain fatty acids and may represent fatty acid-binding domains consistent with the finding that human heart fatty acid-binding protein binds 2 mol of oleate or palmitate/mol of protein. Detailed evidence for the amino acid sequences of the peptides has been deposited as Supplementary Publication SUP 50143 (23 pages) at the British Library Lending Division, Boston Spa, Yorkshire LS23 7BQ, U.K., from whom copies may be obtained as indicated in Biochem. J. (1988) 249, 5. PMID:3421901

  10. A case of orthologous sequences of hemocyanin subunits for an evolutionary study of horseshoe crabs: amino acid sequence comparison of immunologically identical subunits of Carcinoscorpius rotundicauda and Tachypleus tridentatus.

    PubMed

    Sugita, H; Shishikura, F

    1995-10-01

    About 83% of the amino acid sequence of hemocyanin subunit HR6 from the Southeast Asian horseshoe crab, Carcinoscorpius rotundicauda, has been determined. There is a difference of about 43% between HR6 and complete sequences of chelicerate hemocyanin subunits from the American horseshoe crab, Limulus polyphemus, and a tarantula, Eurypelma californicum. However, the immunologically identical subunits HR6 and HT6 from Tachypleus tridentatus (Japanese horseshoe crab) show 2.7% sequence difference. Based on the amino acid sequences of HR6 and HT6, the divergence between C. rotundicauda and T. tridentatus occurred about 9.6 million years ago. In the case of horseshoe crab hemocyanin subunits, it seems that the orthologous homologues in many homologous subunits between species are immunologically detectable.

  11. Genetic divergence and evolutionary relationship in Fejervarya cancrivora from Indonesia and other Asian countries inferred from allozyme and MtDNA sequence analyses.

    PubMed

    Kurniawan, Nia; Islam, Mohammed Mafizul; Djong, Tjong Hon; Igawa, Takeshi; Daicus, M Belabut; Yong, Hoi Sen; Wanichanon, Ratanasate; Khan, Md Mukhlesur Rahman; Iskandar, Djoko T; Nishioka, Midori; Sumida, Masayuki

    2010-03-01

    To elucidate genetic divergence and evolutionary relationship in Fejervarya cancrivora from Indonesia and other Asian countries, allozyme and molecular analyses were carried out using 131 frogs collected from 24 populations in Indonesia, Thailand, Bangladesh, Malaysia, and the Philippines. In the allozymic survey, seventeen enzymatic loci were examined for 92 frogs from eight representative localities. The results showed that F. cancrivora is subdivided into two main groups, the mangrove type and the large- plus Pelabuhan ratu types. The average Nel's genetic distance between the two groups was 0.535. Molecular phylogenetic trees based on nucleotide sequences of the 16S rRNA and Cyt b genes and constructed with the ML, MP, NJ, and BI methods also showed that the individuals of F. cancrivora analyzed comprised two clades, the mangrove type and the large plus Pelabuhan ratu / Sulawesi types, the latter further split into two subclades, the large type and the Pelabuhan ratu / Sulawesi type. The geographical distribution of individuals of the three F. cancrivora types was examined. Ten Individuals from Bangladesh, Thailand, and the Philippines represented the mangrove type; 34 Individuals from Malaysia and Indonesia represented the large type; and 11 individuals from Indonesia represented the Pelabuhan ratu / Sulawesi type. Average sequence divergences among the three types were 5.78-10.22% for the 16S and 12.88-16.38% for Cyt b. Our results suggest that each of the three types can be regarded as a distinct species.

  12. Genetic divergence and evolutionary relationship in Fejervarya cancrivora from Indonesia and other Asian countries inferred from allozyme and MtDNA sequence analyses.

    PubMed

    Kurniawan, Nia; Islam, Mohammed Mafizul; Djong, Tjong Hon; Igawa, Takeshi; Daicus, M Belabut; Yong, Hoi Sen; Wanichanon, Ratanasate; Khan, Md Mukhlesur Rahman; Iskandar, Djoko T; Nishioka, Midori; Sumida, Masayuki

    2010-03-01

    To elucidate genetic divergence and evolutionary relationship in Fejervarya cancrivora from Indonesia and other Asian countries, allozyme and molecular analyses were carried out using 131 frogs collected from 24 populations in Indonesia, Thailand, Bangladesh, Malaysia, and the Philippines. In the allozymic survey, seventeen enzymatic loci were examined for 92 frogs from eight representative localities. The results showed that F. cancrivora is subdivided into two main groups, the mangrove type and the large- plus Pelabuhan ratu types. The average Nel's genetic distance between the two groups was 0.535. Molecular phylogenetic trees based on nucleotide sequences of the 16S rRNA and Cyt b genes and constructed with the ML, MP, NJ, and BI methods also showed that the individuals of F. cancrivora analyzed comprised two clades, the mangrove type and the large plus Pelabuhan ratu / Sulawesi types, the latter further split into two subclades, the large type and the Pelabuhan ratu / Sulawesi type. The geographical distribution of individuals of the three F. cancrivora types was examined. Ten Individuals from Bangladesh, Thailand, and the Philippines represented the mangrove type; 34 Individuals from Malaysia and Indonesia represented the large type; and 11 individuals from Indonesia represented the Pelabuhan ratu / Sulawesi type. Average sequence divergences among the three types were 5.78-10.22% for the 16S and 12.88-16.38% for Cyt b. Our results suggest that each of the three types can be regarded as a distinct species. PMID:20192690

  13. Genetic divergence analysis of the Common Barn Owl Tyto alba (Scopoli, 1769) and the Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile using COI sequence.

    PubMed

    Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita

    2015-01-01

    In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy. PMID:26668551

  14. Genetic divergence analysis of the Common Barn Owl Tyto alba (Scopoli, 1769) and the Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile using COI sequence.

    PubMed

    Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita

    2015-01-01

    In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy.

  15. Genetic divergence analysis of the Common Barn Owl Tyto alba (Scopoli, 1769) and the Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile using COI sequence

    PubMed Central

    Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita

    2015-01-01

    Abstract In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy. PMID:26668551

  16. Extensive sequence divergence in the 3' inverted repeat of the chloroplast rbcL gene in non-flowering land plants and algae.

    PubMed

    Calie, P J; Manhart, J R

    1994-09-01

    A stem-loop region is present at the 3' terminus of the chloroplast rbcL mRNA in all taxa surveyed to date. In spinach, this structure has been shown by others to be involved in modulating transcript stability and correct 3' terminus processing, and is a conserved feature of other flowering plant rbcL mRNAs. In Chlamydomonas reinhardtii, an analogous structure has been shown by others to serve as a transcription terminator. Our sequencing data have shown that this region is highly divergent in several non-flowering land plants, as evidenced by representatives from the ferns, conifers, 'fern-allies' and liverworts. To extend our analysis, a computer-assisted survey of the stem-loop region of the 3' flanking region of published chloroplast rbcL genes was undertaken. The flowering plant rbcL inverted repeats (IR) were remarkably conserved in sequence, allowing for precise multiple alignments of both monocot and dicot sequences within a single matrix. Surprisingly, sequences obtained from non-flowering land plants, algae, photosynthetic protists and photosynthetic prokaryotes were extremely variant, in terms of both sequence composition and thermodynamic parameters.

  17. Refugial isolation and divergence in the Narrowheaded Gartersnake species complex (Thamnophis rufipunctatus) as revealed by multilocus DNA sequence data

    USGS Publications Warehouse

    Wood, Dustin A.; Vandergast, A.G.; Espinal, A. Lemos; Fisher, R.N.; Holycross, A.T.

    2011-01-01

    Glacial–interglacial cycles of the Pleistocene are hypothesized as one of the foremost contributors to biological diversification. This is especially true for cold-adapted montane species, where range shifts have had a pronounced effect on population-level divergence. Gartersnakes of the Thamnophis rufipunctatus species complex are restricted to cold headwater streams in the highlands of the Sierra Madre Occidental and southwestern USA. We used coalescent and multilocus phylogenetic approaches to test whether genetic diversification of this montane-restricted species complex is consistent with two prevailing models of range fluctuation for species affected by Pleistocene climate changes. Our concatenated nuDNA and multilocus species analyses recovered evidence for the persistence of multiple lineages that are restricted geographically, despite a mtDNA signature consistent with either more recent connectivity (and introgression) or recent expansion (and incomplete lineage sorting). Divergence times estimated using a relaxed molecular clock and fossil calibrations fall within the Late Pleistocene, and zero gene flow scenarios among current geographically isolated lineages could not be rejected. These results suggest that increased climate shifts in the Late Pleistocene have driven diversification and current range retraction patterns and that the differences between markers reflect the stochasticity of gene lineages (i.e. ancestral polymorphism) rather than gene flow and introgression. These results have important implications for the conservation of T. rufipunctatus (sensu novo), which is restricted to two drainage systems in the southwestern US and has undergone a recent and dramatic decline.

  18. New approaches for computer analysis of nucleic acid sequences.

    PubMed

    Karlin, S; Ghandour, G; Ost, F; Tavare, S; Korn, L J

    1983-09-01

    A new high-speed computer algorithm is outlined that ascertains within and between nucleic acid and protein sequences all direct repeats, dyad symmetries, and other structural relationships. Large repeats, repeats of high frequency, dyad symmetries of specified stem length and loop distance, and their distributions are determined. Significance of homologies is assessed by a hierarchy of permutation procedures. Applications are made to papovaviruses, the human papillomavirus HPV, lambda phage, the human and mouse mitochondrial genomes, and the human and mouse immunoglobulin kappa-chain genes. PMID:6577449

  19. Purification of a marsupial insulin: amino-acid sequence of insulin from the eastern grey kangaroo Macropus giganteus.

    PubMed

    Treacy, G B; Shaw, D C; Griffiths, M E; Jeffrey, P D

    1989-03-24

    Insulin has been purified from kangaroo pancreas by acidic ethanol extraction, diethyl ether precipitation and gel filtration. The amino-acid sequence of this, the first marsupial insulin to be studied, is reported. It differs from human insulin by only four amino-acid substitutions, all in regions of the molecule previously known to be variable. However, it should be noted that one of these, asparagine for threonine at A8, has not been reported before. Computer comparisons of all 43 insulin sequences reported to date with kangaroo insulin show it to be most closely related to a group of mammalian insulins (dog, pig, cow, human) known to be of high biological potency. The measurement of blood glucose lowering in the rabbit by kangaroo insulin is consistent with this conclusion. Comparisons of amino-acid sequences of other proteins with their kangaroo counterparts show a greater difference, in line with the time of divergence of marsupials. The limited differences observed in insulin and cytochrome c suggest that their structures need to be closely conserved in order to maintain function.

  20. Divergent effects of flavone acetic acid on established versus developing tumour blood flow.

    PubMed Central

    Mahadevan, V.; Hart, I. R.

    1991-01-01

    Flavone Acetic Acid (FAA) exerts much of its effect by reducing tumour blood flow. Previous studies on FAA-induced changes in blood flow have used established tumours with a functional microvasculature. Using radioactive Xenon(133Xe) clearance to monitor local blood flow we show that the effects of FAA are dependent on the presence of this functional microvasculature with no evidence that FAA inhibits the actual development of tumour microcirculation. Thus, administration of multiple doses of FAA around the time of tumour cell injection failed to diminish t1/2 values of 133Xe (e.g. t1/2 16 min for FAA vs 14 min for saline controls at 10 days) or to affect tumour volumes (5.55 +/- 0.06 cm3 in FAA-treated animals vs 5.7 +/- 1.3 cm3 in controls at 25 days). In marked contrast a single dose of FAA (200 mg kg-1 body weight) 2 weeks after tumour cell injection dramatically extended t1/2 times (47 min for FAA vs 7 min for controls; P less than 0.001) and significantly reduced tumour burden. This effect is specific for tumour microvasculature and is not directed simply at new vessels since a similar treatment of animals with implanted-sponge-induced granulation tissue had no effect on t1/2 times (6.8 +/- 1.1 min for FAA at 200 mg kg-1 vs 7.2 +/- 1.0 min for saline-treated controls. PMID:1712621

  1. Identification of a Divergent Environmental DNA Sequence Clade Using the Phylogeny of Gregarine Parasites (Apicomplexa) from Crustacean Hosts

    PubMed Central

    Rueckert, Sonja; Simdyanov, Timur G.; Aleoshin, Vladimir V.; Leander, Brian S.

    2011-01-01

    Background Environmental SSU rDNA surveys have significantly improved our understanding of microeukaryotic diversity. Many of the sequences acquired using this approach are closely related to lineages previously characterized at both morphological and molecular levels, making interpretation of these data relatively straightforward. Some sequences, by contrast, appear to be phylogenetic orphans and are sometimes inferred to represent “novel lineages” of unknown cellular identity. Consequently, interpretation of environmental DNA surveys of cellular diversity rely on an adequately comprehensive database of DNA sequences derived from identified species. Several major taxa of microeukaryotes, however, are still very poorly represented in these databases, and this is especially true for diverse groups of single-celled parasites, such as gregarine apicomplexans. Methodology/Principal Findings This study attempts to address this paucity of DNA sequence data by characterizing four different gregarine species, isolated from the intestines of crustaceans, at both morphological and molecular levels: Thiriotia pugettiae sp. n. from the graceful kelp crab (Pugettia gracilis), Cephaloidophora cf. communis from two different species of barnacles (Balanus glandula and B. balanus), Heliospora cf. longissima from two different species of freshwater amphipods (Eulimnogammarus verrucosus and E. vittatus), and Heliospora caprellae comb. n. from a skeleton shrimp (Caprella alaskana). SSU rDNA sequences were acquired from isolates of these gregarine species and added to a global apicomplexan alignment containing all major groups of gregarines characterized so far. Molecular phylogenetic analyses of these data demonstrated that all of the gregarines collected from crustacean hosts formed a very strongly supported clade with 48 previously unidentified environmental DNA sequences. Conclusions/Significance This expanded molecular phylogenetic context enabled us to establish a major clade

  2. Sequence and Ionomic Analysis of Divergent Strains of Maize Inbred Line B73 with an Altered Growth Phenotype

    PubMed Central

    Gahrtz, Manfred; Bucher, Marcel; Scholz, Uwe; Dresselhaus, Thomas

    2014-01-01

    Maize (Zea mays) is the most widely grown crop species in the world and a classical model organism for plant research. The completion of a high-quality reference genome sequence and the advent of high-throughput sequencing have greatly empowered re-sequencing studies in maize. In this study, plants of maize inbred line B73 descended from two different sets of seed material grown for several generations either in the field or in the greenhouse were found to show a different growth phenotype and ionome under phosphate starvation conditions and moreover a different responsiveness towards mycorrhizal fungi of the species Glomus intraradices (syn: Rhizophagus irregularis). Whole genome re-sequencing of individuals from both sets and comparison to the B73 reference sequence revealed three cryptic introgressions on chromosomes 1, 5 and 10 in the line grown in the greenhouse summing up to a total of 5,257 single-nucleotide polymorphisms (SNPs). Transcriptome sequencing of three individuals from each set lent further support to the location of the introgression intervals and confirmed them to be fixed in all sequenced individuals. Moreover, we identified >120 genes differentially expressed between the two B73 lines. We thus have found a nearly-isogenic line (NIL) of maize inbred line B73 that is characterized by an altered growth phenotype under phosphate starvation conditions and an improved responsiveness towards symbiosis with mycorrhizal fungi. Through next-generation sequencing of the genomes and transcriptomes we were able to delineate exact introgression intervals. Putative de novo mutations appeared approximately uniformly distributed along the ten maize chromosomes mainly representing G:C -> A:T transitions. The plant material described in this study will be a valuable tool both for functional studies of genes differentially expressed in both B73 lines and for research on growth behavior especially in response to symbiosis between maize and mycorrhizal fungi. PMID

  3. Testing models of speciation from genome sequences: divergence and asymmetric admixture in Island South-East Asian Sus species during the Plio-Pleistocene climatic fluctuations

    PubMed Central

    Frantz, Laurent A F; Madsen, Ole; Megens, Hendrik-Jan; Groenen, Martien A M; Lohse, Konrad

    2014-01-01

    In many temperate regions, ice ages promoted range contractions into refugia resulting in divergence (and potentially speciation), while warmer periods led to range expansions and hybridization. However, the impact these climatic oscillations had in many parts of the tropics remains elusive. Here, we investigate this issue using genome sequences of three pig (Sus) species, two of which are found on islands of the Sunda-shelf shallow seas in Island South-East Asia (ISEA). A previous study revealed signatures of interspecific admixture between these Sus species (Genome biology,14, 2013, R107). However, the timing, directionality and extent of this admixture remain unknown. Here, we use a likelihood-based model comparison to more finely resolve this admixture history and test whether it was mediated by humans or occurred naturally. Our analyses suggest that interspecific admixture between Sunda-shelf species was most likely asymmetric and occurred long before the arrival of humans in the region. More precisely, we show that these species diverged during the late Pliocene but around 23% of their genomes have been affected by admixture during the later Pleistocene climatic transition. In addition, we show that our method provides a significant improvement over D-statistics which are uninformative about the direction of admixture. PMID:25294645

  4. Assessing genetic divergence in interspecific hybrids of Aechmea gomosepala and A. recurvata var. recurvata using inflorescence characteristics and sequence-related amplified polymorphism markers.

    PubMed

    Zhang, F; Ge, Y Y; Wang, W Y; Shen, X L; Yu, X Y

    2012-12-03

    Conventional hybridization and selection techniques have aided the development of new ornamental crop cultivars. However, little information is available on the genetic divergence of bromeliad hybrids. In the present study, we investigated the genetic variability in interspecific hybrids of Aechmea gomosepala and A. recurvata var. recurvata using inflorescence characteristics and sequence-related amplified polymorphism (SRAP) markers. The morphological analysis showed that the putative hybrids were intermediate between both parental species with respect to inflorescence characteristics. The 16 SRAP primer combinations yield 265 bands, among which 154 (57.72%) were polymorphic. The genetic similarity was an average of 0.59 and ranged from 0.21 to 0.87, indicating moderate genetic divergence among the hybrids. The unweighted pair group method with arithmetic average (UPGMA)-based cluster analysis distinguished the hybrids from their parents with a genetic distance coefficient of 0.54. The cophenetic correlation was 0.93, indicating a good fit between the dendrogram and the original distance matrix. The two-dimensional plot from the principal coordinate analysis showed that the hybrids were intermediately dispersed between both parents, corresponding to the results of the UPGMA cluster and the morphological analysis. These results suggest that SRAP markers could help to identify breeders, characterize F(1) hybrids of bromeliads at an early stage, and expedite genetic improvement of bromeliad cultivars.

  5. Assessing genetic divergence in interspecific hybrids of Aechmea gomosepala and A. recurvata var. recurvata using inflorescence characteristics and sequence-related amplified polymorphism markers.

    PubMed

    Zhang, F; Ge, Y Y; Wang, W Y; Shen, X L; Yu, X Y

    2012-01-01

    Conventional hybridization and selection techniques have aided the development of new ornamental crop cultivars. However, little information is available on the genetic divergence of bromeliad hybrids. In the present study, we investigated the genetic variability in interspecific hybrids of Aechmea gomosepala and A. recurvata var. recurvata using inflorescence characteristics and sequence-related amplified polymorphism (SRAP) markers. The morphological analysis showed that the putative hybrids were intermediate between both parental species with respect to inflorescence characteristics. The 16 SRAP primer combinations yield 265 bands, among which 154 (57.72%) were polymorphic. The genetic similarity was an average of 0.59 and ranged from 0.21 to 0.87, indicating moderate genetic divergence among the hybrids. The unweighted pair group method with arithmetic average (UPGMA)-based cluster analysis distinguished the hybrids from their parents with a genetic distance coefficient of 0.54. The cophenetic correlation was 0.93, indicating a good fit between the dendrogram and the original distance matrix. The two-dimensional plot from the principal coordinate analysis showed that the hybrids were intermediately dispersed between both parents, corresponding to the results of the UPGMA cluster and the morphological analysis. These results suggest that SRAP markers could help to identify breeders, characterize F(1) hybrids of bromeliads at an early stage, and expedite genetic improvement of bromeliad cultivars. PMID:23079994

  6. Karyotype divergence and spreading of 5S rDNA sequences between genomes of two species: darter and emerald gobies ( Ctenogobius , Gobiidae).

    PubMed

    Lima-Filho, P A; Bertollo, L A C; Cioffi, M B; Costa, G W W F; Molina, W F

    2014-01-01

    Karyotype analyses of the cryptobenthic marine species Ctenogobius boleosoma and C. smaragdus were performed by means of classical and molecular cytogenetics, including physical mapping of the multigene 18S and 5S rDNA families. C. boleosoma has 2n = 44 chromosomes (2 submetacentrics + 42 acrocentrics; FN = 46) with a single chromosome pair each carrying 18S and 5S ribosomal sites; whereas C. smaragdus has 2n = 48 chromosomes (2 submetacentrics + 46 acrocentrics; FN = 50), also with a single pair bearing 18S rDNA, but an extensive increase in the number of GC-rich 5S rDNA sites in 21 chromosome pairs. The highly divergent karyotypes among Ctenogobius species contrast with observations in several other marine fish groups, demonstrating an accelerated rate of chromosomal evolution mediated by both chromosomal rearrangements and the extensive dispersion of 5S rDNA sequences in the genome. PMID:24643007

  7. Karyotype divergence and spreading of 5S rDNA sequences between genomes of two species: darter and emerald gobies ( Ctenogobius , Gobiidae).

    PubMed

    Lima-Filho, P A; Bertollo, L A C; Cioffi, M B; Costa, G W W F; Molina, W F

    2014-01-01

    Karyotype analyses of the cryptobenthic marine species Ctenogobius boleosoma and C. smaragdus were performed by means of classical and molecular cytogenetics, including physical mapping of the multigene 18S and 5S rDNA families. C. boleosoma has 2n = 44 chromosomes (2 submetacentrics + 42 acrocentrics; FN = 46) with a single chromosome pair each carrying 18S and 5S ribosomal sites; whereas C. smaragdus has 2n = 48 chromosomes (2 submetacentrics + 46 acrocentrics; FN = 50), also with a single pair bearing 18S rDNA, but an extensive increase in the number of GC-rich 5S rDNA sites in 21 chromosome pairs. The highly divergent karyotypes among Ctenogobius species contrast with observations in several other marine fish groups, demonstrating an accelerated rate of chromosomal evolution mediated by both chromosomal rearrangements and the extensive dispersion of 5S rDNA sequences in the genome.

  8. Reduced Representation Libraries from DNA Pools Analysed with Next Generation Semiconductor Based-Sequencing to Identify SNPs in Extreme and Divergent Pigs for Back Fat Thickness.

    PubMed

    Bovo, Samuele; Bertolini, Francesca; Schiavo, Giuseppina; Mazzoni, Gianluca; Dall'Olio, Stefania; Fontanesi, Luca

    2015-01-01

    The aim of this study was to identify single nucleotide polymorphisms (SNPs) that could be associated with back fat thickness (BFT) in pigs. To achieve this goal, we evaluated the potential and limits of an experimental design that combined several methodologies. DNA samples from two groups of Italian Large White pigs with divergent estimating breeding value (EBV) for BFT were separately pooled and sequenced, after preparation of reduced representation libraries (RRLs), on the Ion Torrent technology. Taking advantage from SNAPE for SNPs calling in sequenced DNA pools, 39,165 SNPs were identified; 1/4 of them were novel variants not reported in dbSNP. Combining sequencing data with Illumina PorcineSNP60 BeadChip genotyping results on the same animals, 661 genomic positions overlapped with a good approximation of minor allele frequency estimation. A total of 54 SNPs showing enriched alleles in one or in the other RRLs might be potential markers associated with BFT. Some of these SNPs were close to genes involved in obesity related phenotypes. PMID:25821781

  9. Reduced Representation Libraries from DNA Pools Analysed with Next Generation Semiconductor Based-Sequencing to Identify SNPs in Extreme and Divergent Pigs for Back Fat Thickness

    PubMed Central

    Bovo, Samuele; Bertolini, Francesca; Schiavo, Giuseppina; Mazzoni, Gianluca; Dall'Olio, Stefania; Fontanesi, Luca

    2015-01-01

    The aim of this study was to identify single nucleotide polymorphisms (SNPs) that could be associated with back fat thickness (BFT) in pigs. To achieve this goal, we evaluated the potential and limits of an experimental design that combined several methodologies. DNA samples from two groups of Italian Large White pigs with divergent estimating breeding value (EBV) for BFT were separately pooled and sequenced, after preparation of reduced representation libraries (RRLs), on the Ion Torrent technology. Taking advantage from SNAPE for SNPs calling in sequenced DNA pools, 39,165 SNPs were identified; 1/4 of them were novel variants not reported in dbSNP. Combining sequencing data with Illumina PorcineSNP60 BeadChip genotyping results on the same animals, 661 genomic positions overlapped with a good approximation of minor allele frequency estimation. A total of 54 SNPs showing enriched alleles in one or in the other RRLs might be potential markers associated with BFT. Some of these SNPs were close to genes involved in obesity related phenotypes. PMID:25821781

  10. Predicting protein disorder by analyzing amino acid sequence

    PubMed Central

    Yang, Jack Y; Yang, Mary Qu

    2008-01-01

    Background Many protein regions and some entire proteins have no definite tertiary structure, presenting instead as dynamic, disorder ensembles under different physiochemical circumstances. These proteins and regions are known as Intrinsically Unstructured Proteins (IUP). IUP have been associated with a wide range of protein functions, along with roles in diseases characterized by protein misfolding and aggregation. Results Identifying IUP is important task in structural and functional genomics. We exact useful features from sequences and develop machine learning algorithms for the above task. We compare our IUP predictor with PONDRs (mainly neural-network-based predictors), disEMBL (also based on neural networks) and Globplot (based on disorder propensity). Conclusion We find that augmenting features derived from physiochemical properties of amino acids (such as hydrophobicity, complexity etc.) and using ensemble method proved beneficial. The IUP predictor is a viable alternative software tool for identifying IUP protein regions and proteins. PMID:18831799

  11. cDNA sequence of a human skeletal muscle ADP/ATP translocator: lack of a leader peptide, divergence from a fibroblast translocator cDNA, and coevolution with mitochondrial DNA genes

    SciTech Connect

    Neckelmann, N.; Li, K.; Wade, R.P.; Shuster, R.; Wallace, D.C.

    1987-11-01

    The authors have characterized a 1400-nucleotide cDNA for the human skeletal muscle ADP/ATP translocator. The deduced amino acid sequence is 94% homologous to the beef heart ADP/ATP translocator protein and contains only a single additional amino-terminal methionine. This implies that the human translocator lacks an amino-terminal targeting peptide, a conclusion substantiated by measuring the molecular weight of the protein synthesized in vitro. A 1400-nucleotide transcript encoding the skeletal muscle translocator was detected on blots of total RNA from human heart, kidney, skeletal muscle, and HeLa cells by hybridization with oligonucleotide probes homologous to the coding region and 3' noncoding region of the cDNA. However, the level of this mRNA varied substantially among tissues. Comparison of our skeletal muscle translocator sequence with that of a recently published human fibroblast translocator cognate revealed that the two proteins are 88% identical and diverged about 275 million years ago. Hence, tissues vary both in the level of expression of individual translocator genes and in differential expression of cognate translocator genes. Comparison of the base substitution rates of the ADP/ATP translocator and the oxidative phosphorylation genes encoded by mitochondrial DNA revealed that the mitochondrial DNA genes fix 10 times more synonymous substitutions and 12 times more replacement substitutions; yet, these nuclear and cytoplasmic respiration genes experience comparable evolutionary constraints. This suggest that the mitochondrial DNA genes are highly prone to deleterious mutations.

  12. Sequencing of whole plastid genomes and nuclear ribosomal DNA of Diospyros species (Ebenaceae) endemic to New Caledonia: many species, little divergence

    PubMed Central

    Turner, Barbara; Paun, Ovidiu; Munzinger, Jérôme; Chase, Mark W.; Samuel, Rosabelle

    2016-01-01

    Background and Aims Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely related Diospyros species. Methods Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices. Key Results The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species. Conclusions In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared with

  13. Human liver apolipoprotein B-100 cDNA: complete nucleic acid and derived amino acid sequence.

    PubMed Central

    Law, S W; Grant, S M; Higuchi, K; Hospattankar, A; Lackner, K; Lee, N; Brewer, H B

    1986-01-01

    Human apolipoprotein B-100 (apoB-100), the ligand on low density lipoproteins that interacts with the low density lipoprotein receptor and initiates receptor-mediated endocytosis and low density lipoprotein catabolism, has been cloned, and the complete nucleic acid and derived amino acid sequences have been determined. ApoB-100 cDNAs were isolated from normal human liver cDNA libraries utilizing immunoscreening as well as filter hybridization with radiolabeled apoB-100 oligodeoxynucleotides. The apoB-100 mRNA is 14.1 kilobases long encoding a mature apoB-100 protein of 4536 amino acids with a calculated amino acid molecular weight of 512,723. ApoB-100 contains 20 potential glycosylation sites, and 12 of a total of 25 cysteine residues are located in the amino-terminal region of the apolipoprotein providing a potential globular structure of the amino terminus of the protein. ApoB-100 contains relatively few regions of amphipathic helices, but compared to other human apolipoproteins it is enriched in beta-structure. The delineation of the entire human apoB-100 sequence will now permit a detailed analysis of the conformation of the protein, the low density lipoprotein receptor binding domain(s), and the structural relationship between apoB-100 and apoB-48 and will provide the basis for the study of genetic defects in apoB-100 in patients with dyslipoproteinemias. PMID:3464946

  14. Hydrogen Exchange Mass Spectrometry of Related Proteins with Divergent Sequences: A Comparative Study of HIV-1 Nef Allelic Variants

    NASA Astrophysics Data System (ADS)

    Wales, Thomas E.; Poe, Jerrod A.; Emert-Sedlak, Lori; Morgan, Christopher R.; Smithgall, Thomas E.; Engen, John R.

    2016-06-01

    Hydrogen exchange mass spectrometry can be used to compare the conformation and dynamics of proteins that are similar in tertiary structure. If relative deuterium levels are measured, differences in sequence, deuterium forward- and back-exchange, peptide retention time, and protease digestion patterns all complicate the data analysis. We illustrate what can be learned from such data sets by analyzing five variants (Consensus G2E, SF2, NL4-3, ELI, and LTNP4) of the HIV-1 Nef protein, both alone and when bound to the human Hck SH3 domain. Regions with similar sequence could be compared between variants. Although much of the hydrogen exchange features were preserved across the five proteins, the kinetics of Nef binding to Hck SH3 were not the same. These observations may be related to biological function, particularly for ELI Nef where we also observed an impaired ability to downregulate CD4 surface presentation. The data illustrate some of the caveats that must be considered for comparison experiments and provide a framework for investigations of other protein relatives, families, and superfamilies with HX MS.

  15. Structural Variant Detection by Large-scale Sequencing Reveals New Evolutionary Evidence on Breed Divergence between Chinese and European Pigs

    PubMed Central

    Zhao, Pengju; Li, Junhui; Kang, Huimin; Wang, Haifei; Fan, Ziyao; Yin, Zongjun; Wang, Jiafu; Zhang, Qin; Wang, Zhiquan; Liu, Jian-Feng

    2016-01-01

    In this study, we performed a genome-wide SV detection among the genomes of thirteen pigs from diverse Chinese and European originated breeds by next genetation sequencing, and constrcuted a single-nucleotide resolution map involving 56,930 putative SVs. We firstly identified a SV hotspot spanning 35 Mb region on the X chromosome specifically in the genomes of Chinese originated individuals. Further scrutinizing this region by large-scale sequencing data of extra 111 individuals, we obtained the confirmatory evidence on our initial finding. Moreover, thirty five SV-related genes within the hotspot region, being of importance for reproduction ability, rendered significant different evolution rates between Chinese and European originated breeds. The SV hotspot identified herein offers a novel evidence for assessing phylogenetic relationships, as well as likely explains the genetic difference of corresponding phenotypes and features, among Chinese and European pig breeds. Furthermore, we employed various SVs to infer genetic structure of individuls surveyed. We found SVs can clearly detect the difference of genetic background among individuals. This clues us that genome-wide SVs can capture majority of geneic variation and be applied into cladistic analyses. Characterizing whole genome SVs demonstrated that SVs are significantly enriched/depleted with various genomic features. PMID:26729041

  16. Gene duplication and divergence produce divergent MHC genotypes without disassortative mating.

    PubMed

    Dearborn, Donald C; Gager, Andrea B; McArthur, Andrew G; Gilmour, Morgan E; Mandzhukova, Elena; Mauck, Robert A

    2016-09-01

    Genes of the major histocompatibility complex (MHC) exhibit heterozygote advantage in immune defence, which in turn can select for MHC-disassortative mate choice. However, many species lack this expected pattern of MHC-disassortative mating. A possible explanation lies in evolutionary processes following gene duplication: if two duplicated MHC genes become functionally diverged from each other, offspring will inherit diverse multilocus genotypes even under random mating. We used locus-specific primers for high-throughput sequencing of two expressed MHC Class II B genes in Leach's storm-petrels, Oceanodroma leucorhoa, and found that exon 2 alleles fall into two gene-specific monophyletic clades. We tested for disassortative vs. random mating at these two functionally diverged Class II B genes, using multiple metrics and different subsets of exon 2 sequence data. With good statistical power, we consistently found random assortment of mates at MHC. Despite random mating, birds had MHC genotypes with functionally diverged alleles, averaging 13 amino acid differences in pairwise comparisons of exon 2 alleles within individuals. To test whether this high MHC diversity in individuals is driven by evolutionary divergence of the two duplicated genes, we built a phylogenetic permutation model. The model showed that genotypic diversity was strongly impacted by sequence divergence between the most common allele of each gene, with a smaller additional impact of monophyly of the two genes. Divergence of allele sequences between genes may have reduced the benefits of actively seeking MHC-dissimilar mates, in which case the evolutionary history of duplicated genes is shaping the adaptive landscape of sexual selection. PMID:27376487

  17. Divergent synthesis of chiral heterocycles via sequencing of enantioselective three-component reactions and one-pot subsequent cyclization reactions.

    PubMed

    Tang, Min; Xing, Dong; Huang, Haoxi; Hu, Wenhao

    2015-07-01

    A highly efficient sequencing of catalytic asymmetric three-component reactions of alcohols, diazo compounds and aldimines/aldehydes with one-pot subsequent cyclization reactions was reported. The development of a robust and versatile Rh(ii)/Zr(iv)-BINOL co-catalytic system not only gives high diastereo- and enantioselective controls of the three-component reaction, but also shows excellent functionality tolerances that allow a wide range of functionalities to be pre-installed in each component and readily undergo one-pot subsequent cyclization reactions, thus providing rapid and diversity-oriented synthesis (DOS) of different types of chiral nitrogen- and/or oxygen-containing polyfunctional heterocycles. PMID:25864421

  18. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  19. Characterization of a newly identified mycobacterial tautomerase with promiscuous dehalogenase and hydratase activities reveals a functional link to a recently diverged cis-3-chloroacrylic acid dehalogenase.

    PubMed

    Baas, Bert-Jan; Zandvoort, Ellen; Wasiel, Anna A; Quax, Wim J; Poelarends, Gerrit J

    2011-04-12

    The enzyme cis-3-chloroacrylic acid dehalogenase (cis-CaaD) is found in a bacterial pathway that degrades a synthetic nematocide, cis-1,3-dichloropropene, introduced in the 20th century. The previously determined crystal structure of cis-CaaD and its promiscuous phenylpyruvate tautomerase (PPT) activity link this dehalogenase to the tautomerase superfamily, a group of homologous proteins that are characterized by a catalytic amino-terminal proline and a β-α-β structural fold. The low-level PPT activity of cis-CaaD, which may be a vestige of the function of its progenitor, prompted us to search the databases for a homologue of cis-CaaD that was annotated as a putative tautomerase and test both its PPT and cis-CaaD activity. We identified a mycobacterial cis-CaaD homologue (designated MsCCH2) that shares key sequence and active site features with cis-CaaD. Kinetic and 1H NMR spectroscopic studies show that MsCCH2 functions as an efficient PPT and exhibits low-level promiscuous dehalogenase activity, processing both cis- and trans-3-chloroacrylic acid. To further probe the active site of MsCCH2, the enzyme was incubated with 2-oxo-3-pentynoate (2-OP). At pH 8.5, MsCCH2 is inactivated by 2-OP due to the covalent modification of Pro-1, suggesting that Pro-1 functions as a nucleophile at pH 8.5 and attacks 2-OP in a Michael-type reaction. At pH 6.5, however, MsCCH2 exhibits hydratase activity and converts 2-OP to acetopyruvate, which implies that Pro-1 is cationic at pH 6.5 and not functioning as a nucleophile. At pH 7.5, the hydratase and inactivation reactions occur simultaneously. From these results, it can be inferred that Pro-1 of MsCCH2 has a pKa value that lies in between that of a typical tautomerase (pKa of Pro-1∼6) and that of cis-CaaD (pKa of Pro-1∼9). The shared activities and structural features, coupled with the intermediate pKa of Pro-1, suggest that MsCCH2 could be characteristic of an evolutionary intermediate along the past route for the

  20. Rapid Sequence and Expression Divergence Suggest Selection for Novel Function in Primate-Specific KRAB-ZNF Genes

    PubMed Central

    Nowick, Katja; Hamilton, Aaron T.; Zhang, Huimin; Stubbs, Lisa

    2010-01-01

    Recent segmental duplications (SDs), arising from duplication events that occurred within the past 35–40 My, have provided a major resource for the evolution of proteins with primate-specific functions. KRAB zinc finger (KRAB-ZNF) transcription factor genes are overrepresented among genes contained within these recent human SDs. Here, we examine the structural and functional diversity of the 70 human KRAB-ZNF genes involved in the most recent primate SD events including genes that arose in the hominid lineage. Despite their recent advent, many parent–daughter KRAB-ZNF gene pairs display significant differences in zinc finger structure and sequence, expression, and splicing patterns, each of which could significantly alter the regulatory functions of the paralogous genes. Paralogs that emerged on the lineage to humans and chimpanzees have undergone more evolutionary changes per unit of time than genes already present in the common ancestor of rhesus macaques and great apes. Taken together, these data indicate that a substantial fraction of the recently evolved primate-specific KRAB-ZNF gene duplicates have acquired novel functions that may possibly define novel regulatory pathways and suggest an active ongoing selection for regulatory diversity in primates. PMID:20573777

  1. Determining divergence times with a protein clock: update and reevaluation

    NASA Technical Reports Server (NTRS)

    Feng, D. F.; Cho, G.; Doolittle, R. F.; Bada, J. L. (Principal Investigator)

    1997-01-01

    A recent study of the divergence times of the major groups of organisms as gauged by amino acid sequence comparison has been expanded and the data have been reanalyzed with a distance measure that corrects for both constraints on amino acid interchange and variation in substitution rate at different sites. Beyond that, the availability of complete genome sequences for several eubacteria and an archaebacterium has had a great impact on the interpretation of certain aspects of the data. Thus, the majority of the archaebacterial sequences are not consistent with currently accepted views of the Tree of Life which cluster the archaebacteria with eukaryotes. Instead, they are either outliers or mixed in with eubacterial orthologs. The simplest resolution of the problem is to postulate that many of these sequences were carried into eukaryotes by early eubacterial endosymbionts about 2 billion years ago, only very shortly after or even coincident with the divergence of eukaryotes and archaebacteria. The strong resemblances of these same enzymes among the major eubacterial groups suggest that the cyanobacteria and Gram-positive and Gram-negative eubacteria also diverged at about this same time, whereas the much greater differences between archaebacterial and eubacterial sequences indicate these two groups may have diverged between 3 and 4 billion years ago.

  2. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza.

    PubMed

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  3. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  4. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    PubMed

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones.

  5. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    PubMed

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones. PMID:26656109

  6. Genomewide ancestry and divergence patterns from low-coverage sequencing data reveal a complex history of admixture in wild baboons.

    PubMed

    Wall, Jeffrey D; Schlebusch, Stephen A; Alberts, Susan C; Cox, Laura A; Snyder-Mackler, Noah; Nevonen, Kimberly A; Carbone, Lucia; Tung, Jenny

    2016-07-01

    Naturally occurring admixture has now been documented in every major primate lineage, suggesting its key role in primate evolutionary history. Active primate hybrid zones can provide valuable insight into this process. Here, we investigate the history of admixture in one of the best-studied natural primate hybrid zones, between yellow baboons (Papio cynocephalus) and anubis baboons (Papio anubis) in the Amboseli ecosystem of Kenya. We generated a new genome assembly for yellow baboon and low-coverage genomewide resequencing data from yellow baboons, anubis baboons and known hybrids (n = 44). Using a novel composite likelihood method for estimating local ancestry from low-coverage data, we found high levels of genetic diversity and genetic differentiation between the parent taxa, and excellent agreement between genome-scale ancestry estimates and a priori pedigree, life history and morphology-based estimates (r(2)  = 0.899). However, even putatively unadmixed Amboseli yellow individuals carried a substantial proportion of anubis ancestry, presumably due to historical admixture. Further, the distribution of shared vs. fixed differences between a putatively unadmixed Amboseli yellow baboon and an unadmixed anubis baboon, both sequenced at high coverage, is inconsistent with simple isolation-migration or equilibrium migration models. Our findings suggest a complex process of intermittent contact that has occurred multiple times in baboon evolutionary history, despite no obvious fitness costs to hybrids or major geographic or behavioural barriers. In combination with the extensive phenotypic data available for baboon hybrids, our results provide valuable context for understanding the history of admixture in primates, including in our own lineage.

  7. Genomewide ancestry and divergence patterns from low-coverage sequencing data reveal a complex history of admixture in wild baboons.

    PubMed

    Wall, Jeffrey D; Schlebusch, Stephen A; Alberts, Susan C; Cox, Laura A; Snyder-Mackler, Noah; Nevonen, Kimberly A; Carbone, Lucia; Tung, Jenny

    2016-07-01

    Naturally occurring admixture has now been documented in every major primate lineage, suggesting its key role in primate evolutionary history. Active primate hybrid zones can provide valuable insight into this process. Here, we investigate the history of admixture in one of the best-studied natural primate hybrid zones, between yellow baboons (Papio cynocephalus) and anubis baboons (Papio anubis) in the Amboseli ecosystem of Kenya. We generated a new genome assembly for yellow baboon and low-coverage genomewide resequencing data from yellow baboons, anubis baboons and known hybrids (n = 44). Using a novel composite likelihood method for estimating local ancestry from low-coverage data, we found high levels of genetic diversity and genetic differentiation between the parent taxa, and excellent agreement between genome-scale ancestry estimates and a priori pedigree, life history and morphology-based estimates (r(2)  = 0.899). However, even putatively unadmixed Amboseli yellow individuals carried a substantial proportion of anubis ancestry, presumably due to historical admixture. Further, the distribution of shared vs. fixed differences between a putatively unadmixed Amboseli yellow baboon and an unadmixed anubis baboon, both sequenced at high coverage, is inconsistent with simple isolation-migration or equilibrium migration models. Our findings suggest a complex process of intermittent contact that has occurred multiple times in baboon evolutionary history, despite no obvious fitness costs to hybrids or major geographic or behavioural barriers. In combination with the extensive phenotypic data available for baboon hybrids, our results provide valuable context for understanding the history of admixture in primates, including in our own lineage. PMID:27145036

  8. Sequencing of Pax6 loci from the elephant shark reveals a family of Pax6 genes in vertebrate genomes, forged by ancient duplications and divergences.

    PubMed

    Ravi, Vydianathan; Bhatia, Shipra; Gautier, Philippe; Loosli, Felix; Tay, Boon-Hui; Tay, Alice; Murdoch, Emma; Coutinho, Pedro; van Heyningen, Veronica; Brenner, Sydney; Venkatesh, Byrappa; Kleinjan, Dirk A

    2013-01-01

    Pax6 is a developmental control gene essential for eye development throughout the animal kingdom. In addition, Pax6 plays key roles in other parts of the CNS, olfactory system, and pancreas. In mammals a single Pax6 gene encoding multiple isoforms delivers these pleiotropic functions. Here we provide evidence that the genomes of many other vertebrate species contain multiple Pax6 loci. We sequenced Pax6-containing BACs from the cartilaginous elephant shark (Callorhinchus milii) and found two distinct Pax6 loci. Pax6.1 is highly similar to mammalian Pax6, while Pax6.2 encodes a paired-less Pax6. Using synteny relationships, we identify homologs of this novel paired-less Pax6.2 gene in lizard and in frog, as well as in zebrafish and in other teleosts. In zebrafish two full-length Pax6 duplicates were known previously, originating from the fish-specific genome duplication (FSGD) and expressed in divergent patterns due to paralog-specific loss of cis-elements. We show that teleosts other than zebrafish also maintain duplicate full-length Pax6 loci, but differences in gene and regulatory domain structure suggest that these Pax6 paralogs originate from a more ancient duplication event and are hence renamed as Pax6.3. Sequence comparisons between mammalian and elephant shark Pax6.1 loci highlight the presence of short- and long-range conserved noncoding elements (CNEs). Functional analysis demonstrates the ancient role of long-range enhancers for Pax6 transcription. We show that the paired-less Pax6.2 ortholog in zebrafish is expressed specifically in the developing retina. Transgenic analysis of elephant shark and zebrafish Pax6.2 CNEs with homology to the mouse NRE/Pα internal promoter revealed highly specific retinal expression. Finally, morpholino depletion of zebrafish Pax6.2 resulted in a "small eye" phenotype, supporting a role in retinal development. In summary, our study reveals that the pleiotropic functions of Pax6 in vertebrates are served by a divergent

  9. Switchable stereocontrolled divergent synthesis induced by aza-Michael addition of deactivated primary amines under acid catalysis.

    PubMed

    Amara, Z; Drège, E; Troufflard, C; Retailleau, P; Tran Huu-Dau, M-E; Joseph, D

    2014-11-24

    Switchable tandem intramolecular aza-Michael/Michael and double aza-Michael reactions allow the oriented synthesis of highly functionalised cyclic skeletons. Conjugate addition of deactivated anilines triggers chemo- and stereo-divergent ring-closure reaction pathways with a striking selectivity depending on reaction conditions.

  10. Expression of transfected vimentin genes in differentiating murine erythroleukemia cells reveals divergent cis-acting regulation of avian and mammalian vimentin sequences.

    PubMed Central

    Ngai, J; Bond, V C; Wold, B J; Lazarides, E

    1987-01-01

    We studied the expression of transfected chicken and hamster vimentin genes in murine erythroleukemia (MEL) cells. MEL cells normally repress the levels of endogenous mouse vimentin mRNA during inducermediated differentiation, resulting in a subsequent loss of vimentin filaments. Expression of vimentin in differentiating MEL cells reflects the disappearance of vimentin filaments during mammalian erythropoiesis in vivo. In contrast, chicken erythroid cells express high levels of vimentin mRNA and vimentin filaments during terminal differentiation. We demonstrate here that chicken vimentin mRNA levels increase significantly in differentiating transfected MEL cells, whereas similarly transfected hamster vimentin genes are negatively regulated. In conjunction with in vitro nuclear run-on transcription experiments, these results suggest that the difference in vimentin expression in avian and mammalian erythropoiesis is due to a divergence of cis-linked vimentin sequences that are responsible for transcriptional and posttranscriptional regulation of vimentin gene expression. Transfected chicken vimentin genes produce functional vimentin protein and stable vimentin filaments during MEL cell differentiation, further demonstrating that the accumulation of vimentin filaments is determined by the abundance of newly synthesized vimentin. Images PMID:3481037

  11. Genetic Characterization of Betacoronavirus Lineage C Viruses in Bats Reveals Marked Sequence Divergence in the Spike Protein of Pipistrellus Bat Coronavirus HKU5 in Japanese Pipistrelle: Implications for the Origin of the Novel Middle East Respiratory Syndrome Coronavirus

    PubMed Central

    Lau, Susanna K. P.; Li, Kenneth S. M.; Tsang, Alan K. L.; Lam, Carol S. F.; Ahmed, Shakeel; Chen, Honglin; Chan, Kwok-Hung

    2013-01-01

    While the novel Middle East respiratory syndrome coronavirus (MERS-CoV) is closely related to Tylonycteris bat CoV HKU4 (Ty-BatCoV HKU4) and Pipistrellus bat CoV HKU5 (Pi-BatCoV HKU5) in bats from Hong Kong, and other potential lineage C betacoronaviruses in bats from Africa, Europe, and America, its animal origin remains obscure. To better understand the role of bats in its origin, we examined the molecular epidemiology and evolution of lineage C betacoronaviruses among bats. Ty-BatCoV HKU4 and Pi-BatCoV HKU5 were detected in 29% and 25% of alimentary samples from lesser bamboo bat (Tylonycteris pachypus) and Japanese pipistrelle (Pipistrellus abramus), respectively. Sequencing of their RNA polymerase (RdRp), spike (S), and nucleocapsid (N) genes revealed that MERS-CoV is more closely related to Pi-BatCoV HKU5 in RdRp (92.1% to 92.3% amino acid [aa] identity) but is more closely related to Ty-BatCoV HKU4 in S (66.8% to 67.4% aa identity) and N (71.9% to 72.3% aa identity). Although both viruses were under purifying selection, the S of Pi-BatCoV HKU5 displayed marked sequence polymorphisms and more positively selected sites than that of Ty-BatCoV HKU4, suggesting that Pi-BatCoV HKU5 may generate variants to occupy new ecological niches along with its host in diverse habitats. Molecular clock analysis showed that they diverged from a common ancestor with MERS-CoV at least several centuries ago. Although MERS-CoV may have diverged from potential lineage C betacoronaviruses in European bats more recently, these bat viruses were unlikely to be the direct ancestor of MERS-CoV. Intensive surveillance for lineage C betaCoVs in Pipistrellus and related bats with diverse habitats and other animals in the Middle East may fill the evolutionary gap. PMID:23720729

  12. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  13. Amino acid sequence of horseshoe crab, Tachypleus tridentatus, striated muscle troponin C.

    PubMed

    Kobayashi, T; Kagami, O; Takagi, T; Konishi, K

    1989-05-01

    The amino acid sequence of troponin C obtained from horseshoe crab, Tachypleus tridentatus, striated muscle was determined by sequence analysis and alignments of chemically and enzymatically cleaved peptides. Troponin C is composed of 153 amino acid residues with a blocked N-terminus and contains no tryptophan or cysteine residue. The site I, one of the four Ca2+-binding sites, is considered to have lost its ability to bind Ca2+ owing to the replacements of certain amino acid residues.

  14. Amino acid sequence of myoglobin from the chiton Liolophura japonica and a phylogenetic tree for molluscan globins.

    PubMed

    Suzuki, T; Furukohri, T; Okamoto, S

    1993-02-01

    Myoglobin was isolated from the radular muscle of the chiton Liolophura japonica, a primitive archigastropodic mollusc. Liolophura contains three monomeric myoglobins (I, II, and III), and the complete amino acid sequence of myoglobin I has been determined. It is composed of 145 amino acid residues, and the molecular mass was calculated to be 16,070 D. The E7 distal histidine, which is replaced by valine or glutamine in several molluscan globins, is conserved in Liolophura myoglobin. The autoxidation rate at physiological conditions indicated that Liolophura oxymyoglobin is fairly stable when compared with other molluscan myoglobins. The amino acid sequence of Liolophura myoglobin shows low homology (11-21%) with molluscan dimeric myoglobins and hemoglobins, but shows higher homology (26-29%) with monomeric myoglobins from the gastropodic molluscs Aplysia, Dolabella, and Bursatella. A phylogenetic tree was constructed from 19 molluscan globin sequences. The tree separated them into two distinct clusters, a cluster for muscle myoglobins and a cluster for erythrocyte or gill hemoglobins. The myoglobin cluster is divided further into two subclusters, corresponding to monomeric and dimeric myoglobins, respectively. Liolophura myoglobin was placed on the branch of monomeric myoglobin lineage, showing that it diverged earlier from other monomeric myoglobins. The hemoglobin cluster is also divided into two subclusters. One cluster contains homodimeric, heterodimeric, tetrameric, and didomain chains of erythrocyte hemoglobins of the blood clams Anadara, Scapharca, and Barbatia. Of special interest is the other subcluster. It consists of three hemoglobin chains derived from the bacterial symbiontharboring clams Calyptogena and Lucina, in which hemoglobins are supposed to play an important role in maintaining the symbiosis with sulfide bacteria.

  15. Adaptive gene expression divergence inferred from population genomics.

    PubMed

    Holloway, Alisha K; Lawniczak, Mara K N; Mezey, Jason G; Begun, David J; Jones, Corbin D

    2007-10-01

    Detailed studies of individual genes have shown that gene expression divergence often results from adaptive evolution of regulatory sequence. Genome-wide analyses, however, have yet to unite patterns of gene expression with polymorphism and divergence to infer population genetic mechanisms underlying expression evolution. Here, we combined genomic expression data--analyzed in a phylogenetic context--with whole genome light-shotgun sequence data from six Drosophila simulans lines and reference sequences from D. melanogaster and D. yakuba. These data allowed us to use molecular population genetics to test for neutral versus adaptive gene expression divergence on a genomic scale. We identified recent and recurrent adaptive evolution along the D. simulans lineage by contrasting sequence polymorphism within D. simulans to divergence from D. melanogaster and D. yakuba. Genes that evolved higher levels of expression in D. simulans have experienced adaptive evolution of the associated 3' flanking and amino acid sequence. Concomitantly, these genes are also decelerating in their rates of protein evolution, which is in agreement with the finding that highly expressed genes evolve slowly. Interestingly, adaptive evolution in 5' cis-regulatory regions did not correspond strongly with expression evolution. Our results provide a genomic view of the intimate link between selection acting on a phenotype and associated genic evolution.

  16. Fatty acid composition of subcutaneous adipose tissue from entire male pigs with extremely divergent levels of boar taint compounds--an exploratory study.

    PubMed

    Mörlein, Daniel; Tholen, Ernst

    2015-01-01

    This exploratory study investigated the variability of fatty acid composition in entire male pigs with extremely divergent levels of boar taint compounds. Fatty acids were quantified in back fat samples from 20 selected carcasses of Pietrain*F1 sired boars (average carcass weight 84 kg) with extremely low (LL) or extremely high (HH) levels of androstenone, skatole, and indole. Concentrations of polyunsaturated fatty acids (PUFA) were significantly (p<0.05) increased in LL boars (23.4%) compared to HH boars (19.7%). This was mainly due to increased levels of linoleic acid (C18:2 n-6) and α-linolenic acid (C18:3 n-3). Correspondingly, unsaturated fatty acids (SFA) were significantly lower (p<0.05) in LL boars (35.2%) compared to HH boars (37.7%). The findings are discussed with respect to potential effects on flavor formation in boar fat and meat. Further research is needed to study the gender specificity and the interplay of the synthesis and the metabolism of steroids, lipids, and the clearance of skatole in pigs. PMID:25280356

  17. Trichomonas vaginalis acidic phospholipase A2: isolation and partial amino acid sequence.

    PubMed

    Escobedo-Guajardo, Brenda L; González-Salazar, Francisco; Palacios-Corona, Rebeca; Torres de la Cruz, Víctor M; Morales-Vallarta, Mario; Mata-Cárdenas, Benito D; Garza-González, Jesús N; Rivera-Silva, Gerardo; Vargas-Villarreal, Javier

    2013-12-01

    Sexually transmitted diseases are a major cause of acute disease worldwide, and trichomoniasis is the most common and curable disease, generating more than 170 million cases annually worldwide. Trichomonas vaginalis is the causal agent of trichomoniasis and has the ability to destroy in vitro cell monolayers of the vaginal mucosa, where the phospholipases A2 (PLA2) have been reported as potential virulence factors. These enzymes have been partially characterized from the subcellular fraction S30 of pathogenic T. vaginalis strains. The main objective of this study was to purify a phospholipase A2 from T. vaginalis, make a partial characterization, obtain a partial amino acid sequence, and determine its enzymatic participation as hemolytic factor causing lysis of erythrocytes. Trichomonas S30, RF30 and UFF30 sub-fractions from GT-15 strain have the capacity to hydrolyze [2-(14)C-PA]-PC at pH 6.0. Proteins from the UFF30 sub-fraction were separated by affinity chromatography into two eluted fractions with detectable PLA A2 activity. The EDTA-eluted fraction was analyzed by HPLC using on-line HPLC-tandem mass spectrometry and two protein peaks were observed at 8.2 and 13 kDa. Peptide sequences were identified from the proteins present in the eluted EDTA UFF30 fraction; bioinformatic analysis using Protein Link Global Server charged with T. vaginalis protein database suggests that eluted peptides correspond a putative ubiquitin protein in the 8.2 kDa fraction and a phospholipase preserved in the 13 kDa fraction. The EDTA-eluted fraction hydrolyzed [2-(14)C-PA]-PC lyses erythrocytes from Sprague-Dawley in a time and dose-dependent manner. The acidic hemolytic activity decreased by 84% with the addition of 100 μM of Rosenthal's inhibitor. PMID:24338313

  18. Trichomonas vaginalis acidic phospholipase A2: isolation and partial amino acid sequence.

    PubMed

    Escobedo-Guajardo, Brenda L; González-Salazar, Francisco; Palacios-Corona, Rebeca; Torres de la Cruz, Víctor M; Morales-Vallarta, Mario; Mata-Cárdenas, Benito D; Garza-González, Jesús N; Rivera-Silva, Gerardo; Vargas-Villarreal, Javier

    2013-12-01

    Sexually transmitted diseases are a major cause of acute disease worldwide, and trichomoniasis is the most common and curable disease, generating more than 170 million cases annually worldwide. Trichomonas vaginalis is the causal agent of trichomoniasis and has the ability to destroy in vitro cell monolayers of the vaginal mucosa, where the phospholipases A2 (PLA2) have been reported as potential virulence factors. These enzymes have been partially characterized from the subcellular fraction S30 of pathogenic T. vaginalis strains. The main objective of this study was to purify a phospholipase A2 from T. vaginalis, make a partial characterization, obtain a partial amino acid sequence, and determine its enzymatic participation as hemolytic factor causing lysis of erythrocytes. Trichomonas S30, RF30 and UFF30 sub-fractions from GT-15 strain have the capacity to hydrolyze [2-(14)C-PA]-PC at pH 6.0. Proteins from the UFF30 sub-fraction were separated by affinity chromatography into two eluted fractions with detectable PLA A2 activity. The EDTA-eluted fraction was analyzed by HPLC using on-line HPLC-tandem mass spectrometry and two protein peaks were observed at 8.2 and 13 kDa. Peptide sequences were identified from the proteins present in the eluted EDTA UFF30 fraction; bioinformatic analysis using Protein Link Global Server charged with T. vaginalis protein database suggests that eluted peptides correspond a putative ubiquitin protein in the 8.2 kDa fraction and a phospholipase preserved in the 13 kDa fraction. The EDTA-eluted fraction hydrolyzed [2-(14)C-PA]-PC lyses erythrocytes from Sprague-Dawley in a time and dose-dependent manner. The acidic hemolytic activity decreased by 84% with the addition of 100 μM of Rosenthal's inhibitor.

  19. tax and rex Sequences of bovine leukaemia virus from globally diverse isolates: rex amino acid sequence more variable than tax.

    PubMed

    McGirr, K M; Buehring, G C

    2005-02-01

    Bovine leukaemia virus (BLV) is an important agricultural problem with high costs to the dairy industry. Here, we examine the variation of the tax and rex genes of BLV. The tax and rex genes share 420 bases and have overlapping reading frames. The tax gene encodes a protein that functions as a transactivator of the BLV promoter, is required for viral replication, acts on cellular promoters, and is responsible for oncogenesis. The rex facilitates the export of viral mRNAs from the nucleus and regulates transcription. We have sequenced five new isolates of the tax/rex gene. We examined the five new and three previously published tax/rex DNA and predicted amino acid sequences of BLV isolates from cattle in representative regions worldwide. The highest variation among nucleic acid sequences for tax and rex was 7% and 5%, respectively; among predicted amino acid sequences for Tax and Rex, 9% and 11%, respectively. Significantly more nucleotide changes resulted in predicted amino acid changes in the rex gene than in the tax gene (P < or = 0.0006). This variability is higher than previously reported for any region of the viral genome. This research may also have implications for the development of Tax-based vaccines. PMID:15702995

  20. A nucleic acid sequence-based amplification system for detection of Listeria monocytogenes hlyA sequences.

    PubMed Central

    Blais, B W; Turner, G; Sooknanan, R; Malek, L T

    1997-01-01

    A nucleic acid sequence-based amplification system primarily targeting mRNA from the Listeria monocytogenes hlyA gene was developed. This system enabled the detection of low numbers (< 10 CFU/g) of L. monocytogenes cells inoculated into a variety of dairy and egg products after 48 h of enrichment in modified listeria enrichment broth. PMID:8979357

  1. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  2. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  3. The amino acid sequence of elephant (Elephas maximus) myoglobin and the phylogeny of Proboscidea.

    PubMed

    Dene, H; Goodman, M; Romero-Herrera, A E

    1980-02-13

    The complete amino acid sequence of skeletal myoglobin from the Asian elephant (Elephas maximus) is reported. The functional significance of variations seen when this sequence is compared with that of sperm whale myoglobin is explored in the light of the crystallographic model available for the latter molecule. The phylogenetic implications of the elephant myoglobin amino acid sequence are evaluated by using the maximum parsimony technique. A similar analysis is also presented which incorporates all of the proteins sequenced from the elephant. These results are discussed with respect to current views on proboscidean phylogeny.

  4. Facile Analysis and Sequencing of Linear and Branched Peptide Boronic Acids by MALDI Mass Spectrometry

    PubMed Central

    Crumpton, Jason; Zhang, Wenyu; Santos, Webster

    2011-01-01

    Interest in peptides incorporating boronic acid moieties is increasing due to their potential as therapeutics/diagnostics for a variety of diseases such as cancer. The utility of peptide boronic acids may be expanded with access to vast libraries that can be deconvoluted rapidly and economically. Unfortunately, current detection protocols using mass spectrometry are laborious and confounded by boronic acid trimerization, which requires time consuming analysis of dehydration products. These issues are exacerbated when the peptide sequence is unknown, as with de novo sequencing, and especially when multiple boronic acid moieties are present. Thus, a rapid, reliable and simple method for peptide identification is of utmost importance. Herein, we report the identification and sequencing of linear and branched peptide boronic acids containing up to five boronic acid groups by matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS). Protocols for preparation of pinacol boronic esters were adapted for efficient MALDI analysis of peptides. Additionally, a novel peptide boronic acid detection strategy was developed in which 2,5-dihydroxybenzoic acid (DHB) served as both matrix and derivatizing agent in a convenient, in situ, on-plate esterification. Finally, we demonstrate that DHB-modified peptide boronic acids from a single bead can be analyzed by MALDI-MSMS analysis, validating our approach for the identification and sequencing of branched peptide boronic acid libraries. PMID:21449540

  5. Evolution of an Enzyme from a Noncatalytic Nucleic Acid Sequence.

    PubMed

    Gysbers, Rachel; Tram, Kha; Gu, Jimmy; Li, Yingfu

    2015-01-01

    The mechanism by which enzymes arose from both abiotic and biological worlds remains an unsolved natural mystery. We postulate that an enzyme can emerge from any sequence of any functional polymer under permissive evolutionary conditions. To support this premise, we have arbitrarily chosen a 50-nucleotide DNA fragment encoding for the Bos taurus (cattle) albumin mRNA and subjected it to test-tube evolution to derive a catalytic DNA (DNAzyme) with RNA-cleavage activity. After only a few weeks, a DNAzyme with significant catalytic activity has surfaced. Sequence comparison reveals that seven nucleotides are responsible for the conversion of the noncatalytic sequence into the enzyme. Deep sequencing analysis of DNA pools along the evolution trajectory has identified individual mutations as the progressive drivers of the molecular evolution. Our findings demonstrate that an enzyme can indeed arise from a sequence of a functional polymer via permissive molecular evolution, a mechanism that may have been exploited by nature for the creation of the enormous repertoire of enzymes in the biological world today. PMID:26091540

  6. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  7. Amino acid sequence similarity of the VP7 protein of human rotavirus HCR3 to that of canine and feline rotaviruses.

    PubMed

    Li, B; Clark, H F; Gouvea, V

    1994-01-01

    The sequence of the VP7 gene of human rotavirus strain HCR3 was determined and its predicted amino acid sequence was compared with that of other rotavirus strains. The VP7 gene is 1062 nucleotides long and contains a single open reading frame of 981 nucleotides capable of encoding a protein of 326 amino acids. The VP7 amino acid sequence similarity of strain HCR3 to those of various human and animal G3 serotypes ranged from 88.7 to 99.4%, and from 60.4 to 88.3% to strains representing each of the other 13 G serotypes. Alignment of four variable regions [VR4, VR5(A), VR7(B) and VR8(C)] of HCR3 with those of G3 strains of different host species showed that HCR3 possesses a sequence almost identical to that of canine rotaviruses and feline rotavirus strain CAT97 in all four regions. A considerable divergence in regions VR4, VR7(B) and VR8(C) was found with strains of human, mouse, monkey, horse and rabbit rotaviruses. This observation together with results of our previous study on VP4 indicated that human rotavirus HCR3 is genetically more closely related to animal rotaviruses than to other human rotaviruses. PMID:8113730

  8. Infectious salmon anaemia virus (ISAV) isolated from the ISA disease outbreaks in Chile diverged from ISAV isolates from Norway around 1996 and was disseminated around 2005, based on surface glycoprotein gene sequences

    PubMed Central

    Kibenge, Frederick SB; Godoy, Marcos G; Wang, Yingwei; Kibenge, Molly JT; Gherardelli, Valentina; Mansilla, Soledad; Lisperger, Angelica; Jarpa, Miguel; Larroquete, Geraldine; Avendaño, Fernando; Lara, Marcela; Gallardo, Alicia

    2009-01-01

    ISA outbreaks that started in June 2007. The genes of the F and HE glycoproteins were cloned and sequenced for 51 and 78 new isolates, respectively. An extensive comparative analysis of ISAV F and HE sequence data, including reference isolates sampled from Norway, Faroe Islands, Scotland, USA, and Canada was performed. Based on phylogenetic analysis of concatenated ISAV F and HE genes of 103 individual isolates, the isolates from the ISA outbreaks in Chile grouped in their own cluster of 7 distinct strains within Genotype I (European genotype) of ISAV, with the closest relatedness to Norwegian ISAVs isolated in 1997. The phylogenetic software program, BACKTRACK, estimated the Chile isolates diverged from Norway isolates about 1996 and, therefore, had been present in Chile for some time before the recent outbreaks. Analysis of the deduced F protein sequence showed 43 of 51 Chile isolates with an 11-amino acid insert between 265N and 266Q, with 100% sequence identity with Genotype I ISAV RNA segment 2. Twenty four different HE-HPRs, including HPR0, were detected, with HPR7b making up 79.7%. This is considered a manifestation of ISAV quasispecies HE protein sequence diversity. Conclusion Taken together, these findings suggest that the ISA outbreaks were caused by virus that was already present in Chile that mutated to new strains. This is the first comprehensive report tracing ISAV from Europe to South America. PMID:19558648

  9. Statistics of divergence times.

    PubMed

    Haubold, B; Wiehe, T

    2001-07-01

    Given the number of nucleotide substitutions between two species (K) and the substitution rate nu, the expectation of the corresponding divergence time is usually calculated as K/(2 nu). This is strictly true only if nu is regarded as a constant because the ratio of two random variables, such as K/(2 nu), has distributional properties different from those of the distribution of K. Therefore, both the mean and any confidence interval for divergence times are unknown in this situation. We model the distribution of K and nu using the Gamma distribution and calculate the mean and 95% confidence interval for the corresponding divergence time. These calculations are compared with results obtained by bootstrapping sequence data from the model plant Arabidopsis thaliana and its relatives. We show that for nonoverlapping pairs of phylogenetic distances, our method approaches the bootstrap results very closely. In contrast, regarding the mutation rate as a constant leads to strong underestimation of the confidence interval. An implementation of our method of computing divergence times is accessible through a web interface at http://www.soft.ice.mpg.de/cite.

  10. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly).

  11. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly). PMID:9836434

  12. cDNA-derived amino acid sequences of myoglobins from nine species of whales and dolphins.

    PubMed

    Iwanami, Kentaro; Mita, Hajime; Yamamoto, Yasuhiko; Fujise, Yoshihiro; Yamada, Tadasu; Suzuki, Tomohiko

    2006-10-01

    We determined the myoglobin (Mb) cDNA sequences of nine cetaceans, of which six are the first reports of Mb sequences: sei whale (Balaenoptera borealis), Bryde's whale (Balaenoptera edeni), pygmy sperm whale (Kogia breviceps), Stejneger's beaked whale (Mesoplodon stejnegeri), Longman's beaked whale (Indopacetus pacificus), and melon-headed whale (Peponocephala electra), and three confirm the previously determined chemical amino acid sequences: sperm whale (Physeter macrocephalus), common minke whale (Balaenoptera acutorostrata) and pantropical spotted dolphin (Stenella attenuata). We found two types of Mb in the skeletal muscle of pantropical spotted dolphin: Mb I with the same amino acid sequence as that deposited in the protein database, and Mb II, which differs at two amino acid residues compared with Mb I. Using an alignment of the amino acid or cDNA sequences of cetacean Mb, we constructed a phylogenetic tree by the NJ method. Clustering of cetacean Mb amino acid and cDNA sequences essentially follows the classical taxonomy of cetaceans, suggesting that Mb sequence data is valid for classification of cetaceans at least to the family level. PMID:16962803

  13. Contrasting responses of root morphology and root-exuded organic acids to low phosphorus availability in three important food crops with divergent root traits

    PubMed Central

    Wang, Yan-Liang; Almvik, Marit; Clarke, Nicholas; Eich-Greatorex, Susanne; Øgaard, Anne Falk; Krogstad, Tore; Lambers, Hans; Clarke, Jihong Liu

    2015-01-01

    Phosphorus (P) is an important element for crop productivity and is widely applied in fertilizers. Most P fertilizers applied to land are sorbed onto soil particles, so research on improving plant uptake of less easily available P is important. In the current study, we investigated the responses in root morphology and root-exuded organic acids (OAs) to low available P (1 μM P) and sufficient P (50 μM P) in barley, canola and micropropagated seedlings of potato—three important food crops with divergent root traits, using a hydroponic plant growth system. We hypothesized that the dicots canola and tuber-producing potato and the monocot barley would respond differently under various P availabilities. WinRHIZO and liquid chromatography triple quadrupole mass spectrometry results suggested that under low P availability, canola developed longer roots and exhibited the fastest root exudation rate for citric acid. Barley showed a reduction in root length and root surface area and an increase in root-exuded malic acid under low-P conditions. Potato exuded relatively small amounts of OAs under low P, while there was a marked increase in root tips. Based on the results, we conclude that different crops show divergent morphological and physiological responses to low P availability, having evolved specific traits of root morphology and root exudation that enhance their P-uptake capacity under low-P conditions. These results could underpin future efforts to improve P uptake of the three crops that are of importance for future sustainable crop production. PMID:26286222

  14. Contrasting responses of root morphology and root-exuded organic acids to low phosphorus availability in three important food crops with divergent root traits.

    PubMed

    Wang, Yan-Liang; Almvik, Marit; Clarke, Nicholas; Eich-Greatorex, Susanne; Øgaard, Anne Falk; Krogstad, Tore; Lambers, Hans; Clarke, Jihong Liu

    2015-08-17

    Phosphorus (P) is an important element for crop productivity and is widely applied in fertilizers. Most P fertilizers applied to land are sorbed onto soil particles, so research on improving plant uptake of less easily available P is important. In the current study, we investigated the responses in root morphology and root-exuded organic acids (OAs) to low available P (1 μM P) and sufficient P (50 μM P) in barley, canola and micropropagated seedlings of potato-three important food crops with divergent root traits, using a hydroponic plant growth system. We hypothesized that the dicots canola and tuber-producing potato and the monocot barley would respond differently under various P availabilities. WinRHIZO and liquid chromatography triple quadrupole mass spectrometry results suggested that under low P availability, canola developed longer roots and exhibited the fastest root exudation rate for citric acid. Barley showed a reduction in root length and root surface area and an increase in root-exuded malic acid under low-P conditions. Potato exuded relatively small amounts of OAs under low P, while there was a marked increase in root tips. Based on the results, we conclude that different crops show divergent morphological and physiological responses to low P availability, having evolved specific traits of root morphology and root exudation that enhance their P-uptake capacity under low-P conditions. These results could underpin future efforts to improve P uptake of the three crops that are of importance for future sustainable crop production.

  15. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    PubMed Central

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  16. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    PubMed Central

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  17. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid.

    PubMed

    Cao, Guangxiang; Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid.

  18. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid.

    PubMed

    Cao, Guangxiang; Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid. PMID:27660792

  19. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid

    PubMed Central

    Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid. PMID:27660792

  20. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    PubMed

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  1. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    PubMed

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  2. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    PubMed

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  3. Theoretical foundations for quantitative paleogenetics. III - The molecular divergence of nucleic acids and proteins for the case of genetic events of unequal probability

    NASA Technical Reports Server (NTRS)

    Holmquist, R.; Pearl, D.

    1980-01-01

    Theoretical equations are derived for molecular divergence with respect to gene and protein structure in the presence of genetic events with unequal probabilities: amino acid and base compositions, the frequencies of nucleotide replacements, the usage of degenerate codons, the distribution of fixed base replacements within codons and the distribution of fixed base replacements among codons. Results are presented in the form of tables relating the probabilities of given numbers of codon base changes with respect to the original codon for the alpha hemoglobin, beta hemoglobin, myoglobin, cytochrome c and parvalbumin group gene families. Application of the calculations to the rabbit alpha and beta hemoglobin mRNAs and proteins indicates that the genes are separated by about 425 fixed based replacements distributed over 114 codon sites, which is a factor of two greater than previous estimates. The theoretical results also suggest that many more base replacements are required to effect a given gene or protein structural change than previously believed.

  4. Total synthesis of dihydrolysergic acid and dihydrolysergol: development of a divergent synthetic strategy applicable to rapid assembly of D-ring analogs

    PubMed Central

    Lee, Kiyoun; Poudel, Yam B.; Glinkerman, Christopher M.; Boger, Dale L.

    2015-01-01

    The total syntheses of dihydrolysergic acid and dihydrolysergol are detailed based on a Pd(0)-catalyzed intramolecular Larock indole cyclization for the preparation of the embedded tricyclic indole (ABC ring system) and a subsequent powerful inverse electron demand Diels–Alder reaction of 5-carbomethoxy-1,2,3-triazine with a ketone-derived enamine for the introduction of a functionalized pyridine, serving as the precursor for a remarkably diastereoselective reduction to the N-methylpiperidine D-ring. By design, the use of the same ketone-derived enamine and a set of related complementary heterocyclic azadiene [4 + 2] cycloaddition reactions permitted the late stage divergent preparation of a series of alternative heterocyclic derivatives not readily accessible by more conventional approaches. PMID:26273113

  5. Amino acid sequence of a new mitochondrially synthesized proteolipid of the ATP synthase of Saccharomyces cerevisiae.

    PubMed Central

    Velours, J; Esparza, M; Hoppe, J; Sebald, W; Guerin, B

    1984-01-01

    The purification and the amino acid sequence of a proteolipid translated on ribosomes in yeast mitochondria is reported. This protein, which is a subunit of the ATP synthase, was purified by extraction with chloroform/methanol (2/1) and subsequent chromatography on phosphocellulose and reverse phase h.p.l.c. A mol. wt. of 5500 was estimated by chromatography on Bio-Gel P-30 in 80% formic acid. The complete amino acid sequence of this protein was determined by automated solid phase Edman degradation of the whole protein and of fragments obtained after cleavage with cyanogen bromide. The sequence analysis indicates a length of 48 amino acid residues. The calculated mol. wt. of 5870 corresponds to the value found by gel chromatography. This polypeptide contains three basic residues and no negatively charged side chain. The three basic residues are clustered at the C terminus. The primary structure of this protein is in full agreement with the predicted amino acid sequence of the putative polypeptide encoded by the mitochondrial aap1 gene recently discovered in Saccharomyces cerevisiae. Moreover, this protein shows 50% homology with the amino acid sequence of a putative polypeptide encoded by an unidentified reading frame also discovered near the mitochondrial ATPase subunit 6 gene in Aspergillus nidulans. Images Fig. 2. PMID:6323165

  6. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations.

    PubMed

    Abascal, Federico; Zardoya, Rafael; Telford, Maximilian J

    2010-07-01

    We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

  7. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor. PMID:2708331

  8. Homology of amino acid sequences of rat liver cathepsins B and H with that of papain.

    PubMed Central

    Takio, K; Towatari, T; Katunuma, N; Teller, D C; Titani, K

    1983-01-01

    The amino acid sequences of rat liver lysosomal thiol endopeptidases, cathepsins B and H, are presented and compared with that of the plant thiol protease papain. The 252-residue sequence of cathepsin B and the 220-residue sequence of cathepsin H were determined largely by automated Edman degradation of their intact polypeptide chains and of the two chains of each enzyme generated by limited proteolysis. Subfragments of the chains were produced by enzymatic digestion and by chemical cleavage of methionyl and tryptophanyl bonds. Comparison of the amino acid sequences of cathepsins B and H with each other and with that of papain demonstrates a striking homology among their primary structures. Sequence identity is extremely high in regions which, according to the three-dimensional structure of papain, constitute the catalytic site. The results not only reveal the first structural features of mammalian thiol endopeptidases but also provide insight into the evolutionary relationships among plant and mammalian thiol proteases. PMID:6574504

  9. Complete cDNA and derived amino acid sequence of human factor V.

    PubMed Central

    Jenny, R J; Pittman, D D; Toole, J J; Kriz, R W; Aldape, R A; Hewick, R M; Kaufman, R J; Mann, K G

    1987-01-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A) tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approximately equal to 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approximately 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approximately 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues. Images PMID:3110773

  10. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  11. Recognition of protein phosphorylation site based on amino acids sequence features

    NASA Astrophysics Data System (ADS)

    Zhang, Ying; Ding, Changjiang; Lu, Jun

    2012-09-01

    Protein phosphorylation is one of the most important reversible post-translational modifications (PTMs), and the theoretical recognition of the phosphorylation site is one of the important content of the computational biology. In the paper, we use the amino acid component, the position-dependent residue statistics and the non-adjacent residue pair frequency as the recognition parameters, and use Jensen-Shannon Divergence with Quadratic Discriminant analysis(JSDQD) as the method for predicting the phosphorylation sites. The 7-fold cross-validation test accuracies for the CK2, PKA and PKC kinase families are 90%, 90% and 86%, respectively.

  12. Using intron sequence comparisons in the triose-phosphate isomerase gene to study the divergence of the fall armyworm host strains

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Noctuid moth, Spodoptera frugiperda (the fall armyworm), is endemic to the Western Hemisphere and appears to be undergoing sympatric speciation to produce two subpopulations that differ in their choice of host plants. The diverging “rice strain” and “corn strain” are morphologically indistinguis...

  13. Amino acid sequence heterogeneity of the chromosomal encoded Borrelia burgdorferi sensu lato major antigen P100.

    PubMed

    Fellinger, W; Farencena, A; Redl, B; Sambri, V; Cevenini, R; Stöffler, G

    1995-04-01

    The entire nucleotide sequence of the chromosomal encoded major antigen p100 of the European Borrelia garinii isolate B29 was determined and the deduced amino acid sequence was compared to the homologous antigen p83 of the North American Borrelia burgdorferi sensu stricto strain B31 and the p100 of the European Borrelia afzelii (group VS461) strain PKo. p100 of strain B29 shows 87% amino acid sequence identity to strain B31 and 79.2% to strain PKo, p100 of strain B31 and PKo shows 62.5% identity to each other. In addition, partial nucleotide sequences of the most heterogeneous region of the p100 gene of two other Borrelia garinii isolates (PBi and VS286) have been determined and the deduced amino acid sequences were compared with all p100 of Borrelia garinii published so far. We found an amino acid sequence identity between 88.6 and 100% within the same genospecies. The N-terminal part of the p100 proteins is highly conserved whereas a striking heterogeneous region within the C-terminal part of the proteins was observed.

  14. Divergent reactivity in palladium-catalyzed annulation with diarylamines and α,β-unsaturated acids: direct access to substituted 2-quinolinones and indoles.

    PubMed

    Kancherla, Rajesh; Naveen, Togati; Maiti, Debabrata

    2015-06-01

    A palladium-catalyzed CH activation strategy has been successfully employed for exclusive synthesis of a variety of 3-substituted indoles. A [3+3] annulation for synthesizing substituted 2-quinolinones was recently developed by reaction of α,β-unsaturated carboxylic acids with diarylamines under acidic conditions. In the present work, an analogous [3+2] annulation is achieved from the same set of starting materials under basic conditions to generate 1,3-disubstituted indoles exclusively. Mechanistic studies revealed an ortho-palladation-π-coordination-β-migratory insertion-β-hydride elimination reaction sequence to be operative under the reaction conditions. PMID:25941155

  15. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  16. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  17. Mitochondrial divergence between slow- and fast-aging garter snakes.

    PubMed

    Schwartz, Tonia S; Arendsee, Zebulun W; Bronikowski, Anne M

    2015-11-01

    Mitochondrial function has long been hypothesized to be intimately involved in aging processes--either directly through declining efficiency of mitochondrial respiration and ATP production with advancing age, or indirectly, e.g., through increased mitochondrial production of damaging free radicals with age. Yet we lack a comprehensive understanding of the evolution of mitochondrial genotypes and phenotypes across diverse animal models, particularly in species that have extremely labile physiology. Here, we measure mitochondrial genome-types and transcription in ecotypes of garter snakes (Thamnophis elegans) that are adapted to disparate habitats and have diverged in aging rates and lifespans despite residing in close proximity. Using two RNA-seq datasets, we (1) reconstruct the garter snake mitochondrial genome sequence and bioinformatically identify regulatory elements, (2) test for divergence of mitochondrial gene expression between the ecotypes and in response to heat stress, and (3) test for sequence divergence in mitochondrial protein-coding regions in these slow-aging (SA) and fast-aging (FA) naturally occurring ecotypes. At the nucleotide sequence level, we confirmed two (duplicated) mitochondrial control regions one of which contains a glucocorticoid response element (GRE). Gene expression of protein-coding genes was higher in FA snakes relative to SA snakes for most genes, but was neither affected by heat stress nor an interaction between heat stress and ecotype. SA and FA ecotypes had unique mitochondrial haplotypes with amino acid substitutions in both CYTB and ND5. The CYTB amino acid change (Isoleucine → Threonine) was highly segregated between ecotypes. This divergence of mitochondrial haplotypes between SA and FA snakes contrasts with nuclear gene-flow estimates, but correlates with previously reported divergence in mitochondrial function (mitochondrial oxygen consumption, ATP production, and reactive oxygen species consequences).

  18. Ligation with nucleic acid sequence-based amplification.

    PubMed

    Ong, Carmichael; Tai, Warren; Sarma, Aartik; Opal, Steven M; Artenstein, Andrew W; Tripathi, Anubhav

    2012-01-01

    This work presents a novel method for detecting nucleic acid targets using a ligation step along with an isothermal, exponential amplification step. We use an engineered ssDNA with two variable regions on the ends, allowing us to design the probe for optimal reaction kinetics and primer binding. This two-part probe is ligated by T4 DNA Ligase only when both parts bind adjacently to the target. The assay demonstrates that the expected 72-nt RNA product appears only when the synthetic target, T4 ligase, and both probe fragments are present during the ligation step. An extraneous 38-nt RNA product also appears due to linear amplification of unligated probe (P3), but its presence does not cause a false-positive result. In addition, 40 mmol/L KCl in the final amplification mix was found to be optimal. It was also found that increasing P5 in excess of P3 helped with ligation and reduced the extraneous 38-nt RNA product. The assay was also tested with a single nucleotide polymorphism target, changing one base at the ligation site. The assay was able to yield a negative signal despite only a single-base change. Finally, using P3 and P5 with longer binding sites results in increased overall sensitivity of the reaction, showing that increasing ligation efficiency can improve the assay overall. We believe that this method can be used effectively for a number of diagnostic assays. PMID:22449695

  19. The amino acid sequence of mitogenic lectin-B from the roots of pokeweed (Phytolacca americana).

    PubMed

    Yamaguchi, K; Yurino, N; Kino, M; Ishiguro, M; Funatsu, G

    1997-04-01

    The complete amino acid sequence of pokeweed lectin-B (PL-B) has been analyzed by first sequencing seven lysylendopeptidase peptides derived from the reduced and S-pyridylethylated PL-B and then connecting them by analyzing the arginylendopeptidase peptides from the reduced and S-carboxymethylated PL-B. PL-B consists of 295 amino acid residues and two oligosaccharides linked to Asn96 and Asn139, and has a molecular mass of 34,493 Da. PL-B is composed of seven repetitive chitin-binding domains having 48-79% sequence homology with each other. Twelve amino acid residues including eight cysteine residues in these domains are absolutely conserved in all other chitin-binding domains of plant lectins and class I chitinases. Also, it was strongly suggested that the extremely high hemagglutinating and mitogenic activities of PL-B may be ascribed to its seven-domain structure.

  20. ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids.

    PubMed

    Ashkenazy, Haim; Erez, Elana; Martz, Eric; Pupko, Tal; Ben-Tal, Nir

    2010-07-01

    It is informative to detect highly conserved positions in proteins and nucleic acid sequence/structure since they are often indicative of structural and/or functional importance. ConSurf (http://consurf.tau.ac.il) and ConSeq (http://conseq.tau.ac.il) are two well-established web servers for calculating the evolutionary conservation of amino acid positions in proteins using an empirical Bayesian inference, starting from protein structure and sequence, respectively. Here, we present the new version of the ConSurf web server that combines the two independent servers, providing an easier and more intuitive step-by-step interface, while offering the user more flexibility during the process. In addition, the new version of ConSurf calculates the evolutionary rates for nucleic acid sequences. The new version is freely available at: http://consurf.tau.ac.il/.

  1. Amino acid repeats cause extraordinary coding sequence variation in the social amoeba Dictyostelium discoideum.

    PubMed

    Scala, Clea; Tian, Xiangjun; Mehdiabadi, Natasha J; Smith, Margaret H; Saxer, Gerda; Stephens, Katie; Buzombo, Prince; Strassmann, Joan E; Queller, David C

    2012-01-01

    Protein sequences are normally the most conserved elements of genomes owing to purifying selection to maintain their functions. We document an extraordinary amount of within-species protein sequence variation in the model eukaryote Dictyostelium discoideum stemming from triplet DNA repeats coding for long strings of single amino acids. D. discoideum has a very large number of such strings, many of which are polyglutamine repeats, the same sequence that causes various human neurological disorders in humans, like Huntington's disease. We show here that D. discoideum coding repeat loci are highly variable among individuals, making D. discoideum a candidate for the most variable proteome. The coding repeat loci are not significantly less variable than similar non-coding triplet repeats. This pattern is consistent with these amino-acid repeats being largely non-functional sequences evolving primarily by mutation and drift. PMID:23029418

  2. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  3. Shark myoglobins. II. Isolation, characterization and amino acid sequence of myoglobin from Galeorhinus japonicus.

    PubMed

    Suzuki, T; Suzuki, T; Yata, T

    1985-01-01

    Native oxymyoglobin (MbO2) was isolated from red muscle of G. japonicus by chromatographic separation from metmyoglobin (metMb) on DEAE-cellulose and the amino acid sequence of the major chain was determined with the aid of sequence homology with that of G. australis. It was shown to differ in amino acid sequence from that of G. australis by 10 replacements, to be acetylated at the amino terminus and to contain glutamine at the distal (E7) residue. It was also shown to have a spectrum very similar to that of mammalian MbO2. However, the pH-dependence for the autoxidation of MbO2 was seen to be quite different from that of sperm whale (Physeter catodon) MbO2. Although the sequence homology between sperm whale and G. japonicus myoglobins is about 40%, their hydropathy profiles were very similar, indicating that they have a similar geometry in their globin folding.

  4. Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns

    PubMed Central

    2007-01-01

    We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882

  5. The amino acid sequences of two alpha chains of hemoglobins from Komodo dragon Varanus komodoensis and phylogenetic relationships of amniotes.

    PubMed

    Fushitani, K; Higashiyama, K; Moriyama, E N; Imai, K; Hosokawa, K

    1996-09-01

    To elucidate phylogenetic relationships among amniotes and the evolution of alpha globins, hemoglobins were analyzed from the Komodo dragon (Komodo monitor lizard) Varanus komodoensis, the world's largest extant lizard, inhabiting Komodo Islands, Indonesia. Four unique globin chains (alpha A, alpha D, beta B, and beta C) were isolated in an equal molar ratio by high performance liquid chromatography from the hemolysate. The amino acid sequences of two alpha chains were determined. The alpha D chain has a glutamine at E7 as does an alpha chain of a snake, Liophis miliaris, but the alpha A chain has a histidine at E7 like the majority of hemoglobins. Phylogenetic analyses of 19 globins including two alpha chains of Komodo dragon and ones from representative amniotes showed the following results: (1) The a chains of squamates (snakes and lizards), which have a glutamine at E7, are clustered with the embryonic alpha globin family, which typically includes the alpha D chain from birds; (2) birds form a sister group with other reptiles but not with mammals; (3) the genes for embryonic and adult types of alpha globins were possibly produced by duplication of the ancestral alpha gene before ancestral amniotes diverged, indicating that each of the present amniotes might carry descendants of the two types of alpha globin genes; (4) squamates first split off from the ancestor of other reptiles and birds.

  6. The amino acid sequences of two alpha chains of hemoglobins from Komodo dragon Varanus komodoensis and phylogenetic relationships of amniotes.

    PubMed

    Fushitani, K; Higashiyama, K; Moriyama, E N; Imai, K; Hosokawa, K

    1996-09-01

    To elucidate phylogenetic relationships among amniotes and the evolution of alpha globins, hemoglobins were analyzed from the Komodo dragon (Komodo monitor lizard) Varanus komodoensis, the world's largest extant lizard, inhabiting Komodo Islands, Indonesia. Four unique globin chains (alpha A, alpha D, beta B, and beta C) were isolated in an equal molar ratio by high performance liquid chromatography from the hemolysate. The amino acid sequences of two alpha chains were determined. The alpha D chain has a glutamine at E7 as does an alpha chain of a snake, Liophis miliaris, but the alpha A chain has a histidine at E7 like the majority of hemoglobins. Phylogenetic analyses of 19 globins including two alpha chains of Komodo dragon and ones from representative amniotes showed the following results: (1) The a chains of squamates (snakes and lizards), which have a glutamine at E7, are clustered with the embryonic alpha globin family, which typically includes the alpha D chain from birds; (2) birds form a sister group with other reptiles but not with mammals; (3) the genes for embryonic and adult types of alpha globins were possibly produced by duplication of the ancestral alpha gene before ancestral amniotes diverged, indicating that each of the present amniotes might carry descendants of the two types of alpha globin genes; (4) squamates first split off from the ancestor of other reptiles and birds. PMID:8752011

  7. Visible sensing of nucleic acid sequences using a genetically encodable unmodified mRNA probe.

    PubMed

    Narita, Atsushi; Ogawa, Kazumasa; Sando, Shinsuke; Aoyama, Yasuhiro

    2006-01-01

    We previously reported a molecular beacon-mRNA (MB-mRNA) strategy for nucleic acid detection/sensing in a cell-free translation system using unmodified RNA as a probe. Here in this presentation, we report that a combination with RNase H activity, which induces an additional process of irreversible cleavage of MB-domain, achieves an improved sequence selectivity (one nucleotide selectivity) and an enhanced sensitivity. This improved system finally enabled visible sensing of target nucleic acid sequence at a single nucleotide resolution under isothermal conditions.

  8. Amino acid and cDNA sequences of lysozyme from Hyalophora cecropia

    PubMed Central

    Engström, Å.; Xanthopoulos, K. G.; Boman, H. G.; Bennich, H.

    1985-01-01

    The amino acid and cDNA sequences of lysozyme from the giant silk moth Hyalophora cecropia have been determined. This enzyme is one of several immune proteins produced by the diapausing pupae after injection of bacteria. Cecropia lysozyme is composed of 120 amino acids, has a mol. wt. of 13.8 kd and shows great similarity with vertebrate lysozymes of the chicken type. The amino acid residues responsible for the catalytic activity and for the binding of substrate are essentially conserved. Three allelic variants of the Cecropia enzyme are identified. A comparison of the chicken and the Cecropia lysozymes shows that there is a 40% identity at both the amino acid and the nucleotide level. Some evolutionary aspects of the sequence data are discussed. PMID:16453632

  9. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66.

    PubMed

    Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan

    2016-06-01

    Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids. PMID:27222814

  10. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66.

    PubMed

    Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan

    2016-06-01

    Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.

  11. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences.

  12. AcalPred: a sequence-based tool for discriminating between acidic and alkaline enzymes.

    PubMed

    Lin, Hao; Chen, Wei; Ding, Hui

    2013-01-01

    The structure and activity of enzymes are influenced by pH value of their surroundings. Although many enzymes work well in the pH range from 6 to 8, some specific enzymes have good efficiencies only in acidic (pH<5) or alkaline (pH>9) solution. Studies have demonstrated that the activities of enzymes correlate with their primary sequences. It is crucial to judge enzyme adaptation to acidic or alkaline environment from its amino acid sequence in molecular mechanism clarification and the design of high efficient enzymes. In this study, we developed a sequence-based method to discriminate acidic enzymes from alkaline enzymes. The analysis of variance was used to choose the optimized discriminating features derived from g-gap dipeptide compositions. And support vector machine was utilized to establish the prediction model. In the rigorous jackknife cross-validation, the overall accuracy of 96.7% was achieved. The method can correctly predict 96.3% acidic and 97.1% alkaline enzymes. Through the comparison between the proposed method and previous methods, it is demonstrated that the proposed method is more accurate. On the basis of this proposed method, we have built an online web-server called AcalPred which can be freely accessed from the website (http://lin.uestc.edu.cn/server/AcalPred). We believe that the AcalPred will become a powerful tool to study enzyme adaptation to acidic or alkaline environment.

  13. AcalPred: A Sequence-Based Tool for Discriminating between Acidic and Alkaline Enzymes

    PubMed Central

    Lin, Hao; Chen, Wei; Ding, Hui

    2013-01-01

    The structure and activity of enzymes are influenced by pH value of their surroundings. Although many enzymes work well in the pH range from 6 to 8, some specific enzymes have good efficiencies only in acidic (pH<5) or alkaline (pH>9) solution. Studies have demonstrated that the activities of enzymes correlate with their primary sequences. It is crucial to judge enzyme adaptation to acidic or alkaline environment from its amino acid sequence in molecular mechanism clarification and the design of high efficient enzymes. In this study, we developed a sequence-based method to discriminate acidic enzymes from alkaline enzymes. The analysis of variance was used to choose the optimized discriminating features derived from g-gap dipeptide compositions. And support vector machine was utilized to establish the prediction model. In the rigorous jackknife cross-validation, the overall accuracy of 96.7% was achieved. The method can correctly predict 96.3% acidic and 97.1% alkaline enzymes. Through the comparison between the proposed method and previous methods, it is demonstrated that the proposed method is more accurate. On the basis of this proposed method, we have built an online web-server called AcalPred which can be freely accessed from the website (http://lin.uestc.edu.cn/server/AcalPred). We believe that the AcalPred will become a powerful tool to study enzyme adaptation to acidic or alkaline environment. PMID:24130738

  14. Antibody-specific model of amino acid substitution for immunological inferences from alignments of antibody sequences.

    PubMed

    Mirsky, Alexander; Kazandjian, Linda; Anisimova, Maria

    2015-03-01

    Antibodies are glycoproteins produced by the immune system as a dynamically adaptive line of defense against invading pathogens. Very elegant and specific mutational mechanisms allow B lymphocytes to produce a large and diversified repertoire of antibodies, which is modified and enhanced throughout all adulthood. One of these mechanisms is somatic hypermutation, which stochastically mutates nucleotides in the antibody genes, forming new sequences with different properties and, eventually, higher affinity and selectivity to the pathogenic target. As somatic hypermutation involves fast mutation of antibody sequences, this process can be described using a Markov substitution model of molecular evolution. Here, using large sets of antibody sequences from mice and humans, we infer an empirical amino acid substitution model AB, which is specific to antibody sequences. Compared with existing general amino acid models, we show that the AB model provides significantly better description for the somatic evolution of mice and human antibody sequences, as demonstrated on large next generation sequencing (NGS) antibody data. General amino acid models are reflective of conservation at the protein level due to functional constraints, with most frequent amino acids exchanges taking place between residues with the same or similar physicochemical properties. In contrast, within the variable part of antibody sequences we observed an elevated frequency of exchanges between amino acids with distinct physicochemical properties. This is indicative of a sui generis mutational mechanism, specific to antibody somatic hypermutation. We illustrate this property of antibody sequences by a comparative analysis of the network modularity implied by the AB model and general amino acid substitution models. We recommend using the new model for computational studies of antibody sequence maturation, including inference of alignments and phylogenetic trees describing antibody somatic hypermutation in

  15. The value of short amino acid sequence matches for prediction of protein allergenicity.

    PubMed

    Silvanovich, Andre; Nemeth, Margaret A; Song, Ping; Herman, Rod; Tagliani, Laura; Bannon, Gary A

    2006-03-01

    Typically, genetically engineered crops contain traits encoded by one or a few newly expressed proteins. The allergenicity assessment of newly expressed proteins is an important component in the safety evaluation of genetically engineered plants. One aspect of this assessment involves sequence searches that compare the amino acid sequence of the protein to all known allergens. Analyses are performed to determine the potential for immunologically based cross-reactivity where IgE directed against a known allergen could bind to the protein and elicit a clinical reaction in sensitized individuals. Bioinformatic searches are designed to detect global sequence similarity and short contiguous amino acid sequence identity. It has been suggested that potential allergen cross-reactivity may be predicted by identifying matches as short as six to eight contiguous amino acids between the protein of interest and a known allergen. A series of analyses were performed, and match probabilities were calculated for different size peptides to determine if there was a scientifically justified search window size that identified allergen sequence characteristics. Four probability modeling methods were tested: (1) a mock protein and a mock allergen database, (2) a mock protein and genuine allergen database, (3) a genuine allergen and genuine protein database, and (4) a genuine allergen and genuine protein database combined with a correction for repeating peptides. These analyses indicated that searches for short amino acid sequence matches of eight amino acids or fewer to identify proteins as potential cross-reactive allergens is a product of chance and adds little value to allergy assessments for newly expressed proteins.

  16. Comparison of the amino acid sequence of the major immunogen from three serotypes of foot and mouth disease virus.

    PubMed Central

    Makoff, A J; Paynter, C A; Rowlands, D J; Boothroyd, J C

    1982-01-01

    Cloned cDNA molecules from three serotypes of FMDV have been sequenced around the VP1-coding region. The predicted amino acid sequences for VP1 were compared with the published sequences and variable regions identified. The amino acid sequences were also analysed for hydrophilic regions. Two of the variable regions, numbered 129-160 and 193-204 overlapped hydrophilic regions, and were therefore identified as potentially immunogenic. These regions overlap regions shown by others to be immunogenic. PMID:6298715

  17. Divergence of cuticular hydrocarbons in two sympatric grasshopper species and the evolution of fatty acid synthases and elongases across insects

    PubMed Central

    Finck, Jonas; Berdan, Emma L.; Mayer, Frieder; Ronacher, Bernhard; Geiselhardt, Sven

    2016-01-01

    Cuticular hydrocarbons (CHCs) play a major role in the evolution of reproductive isolation between insect species. The CHC profiles of two closely related sympatric grasshopper species, Chorthippus biguttulus and C. mollis, differ mainly in the position of the first methyl group in major methyl-branched CHCs. The position of methyl branches is determined either by a fatty acid synthase (FAS) or by elongases. Both protein families showed an expansion in insects. Interestingly, the FAS family showed several lineage-specific expansions, especially in insect orders with highly diverse methyl-branched CHC profiles. We found five putative FASs and 12 putative elongases in the reference transcriptomes for both species. A dN/dS test showed no evidence for positive selection acting on FASs and elongases in these grasshoppers. However, one candidate FAS showed species-specific transcriptional differences and may contribute to the shift of the methyl-branch position between the species. In addition, transcript levels of four elongases were expressed differentially between the sexes. Our study indicates that complex methyl-branched CHC profiles are linked to an expansion of FASs genes, but that species differences can also mediated at the transcriptional level. PMID:27677406

  18. Quantitative detection of Aspergillus spp. by real-time nucleic acid sequence-based amplification.

    PubMed

    Zhao, Yanan; Perlin, David S

    2013-01-01

    Rapid and quantitative detection of Aspergillus from clinical samples may facilitate an early diagnosis of invasive pulmonary aspergillosis (IPA). As nucleic acid-based detection is a viable option, we demonstrate that Aspergillus burdens can be rapidly and accurately detected by a novel real-time nucleic acid assay other than qPCR by using the combination of nucleic acid sequence-based amplification (NASBA) and the molecular beacon (MB) technology. Here, we detail a real-time NASBA assay to determine quantitative Aspergillus burdens in lungs and bronchoalveolar lavage (BAL) fluids of rats with experimental IPA.

  19. Unifying bacteria from decaying wood with various ubiquitous Gibbsiella species as G. acetica sp. nov. based on nucleotide sequence similarities and their acetic acid secretion.

    PubMed

    Geider, Klaus; Gernold, Marina; Jock, Susanne; Wensing, Annette; Völksch, Beate; Gross, Jürgen; Spiteller, Dieter

    2015-12-01

    Bacteria were isolated from necrotic apple and pear tree tissue and from dead wood in Germany and Austria as well as from pear tree exudate in China. They were selected for growth at 37 °C, screened for levan production and then characterized as Gram-negative, facultatively anaerobic rods. Nucleotide sequences from 16S rRNA genes, the housekeeping genes dnaJ, gyrB, recA and rpoB alignments, BLAST searches and phenotypic data confirmed by MALDI-TOF analysis showed that these bacteria belong to the genus Gibbsiella and resembled strains isolated from diseased oaks in Britain and Spain. Gibbsiella-specific PCR primers were designed from the proline isomerase and the levansucrase genes. Acid secretion was investigated by screening for halo formation on calcium carbonate agar and the compound identified by NMR as acetic acid. Its production by Gibbsiella spp. strains was also determined in culture supernatants by GC/MS analysis after derivatization with pentafluorobenzyl bromide. Some strains were differentiated by the PFGE patterns of SpeI digests and by sequence analyses of the lsc and the ppiD genes, and the Chinese Gibbsiella strain was most divergent. The newly investigated bacteria as well as Gibbsiella querinecans, Gibbsiella dentisursi and Gibbsiella papilionis, isolated in Britain, Spain, Korea and Japan, are taxonomically related Enterobacteriaceae, tolerate and secrete acetic acid. We therefore propose to unify them in the species Gibbsiella acetica sp. nov.

  20. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923)

    PubMed Central

    Clément, Benjamin; Lopes Ferreira, Nicolas

    2016-01-01

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain. PMID:26941139

  1. Amino acid sequence of the encephalitogenic basic protein from human myelin

    PubMed Central

    Carnegie, P. R.

    1971-01-01

    Myelin from the central nervous system contains an unusual basic protein, which can induce experimental autoimmune encephalomyelitis. The basic protein from human brain was digested with trypsin and other enzymes and the sequence of the 170 amino acids was determined. The localization of the encephalitogenic determinants was described. Possible roles for the protein in the structure and function of myelin are discussed. PMID:4108501

  2. Sequence-specific formation of d-amino acids in a monoclonal antibody during light exposure.

    PubMed

    Mozziconacci, Olivier; Schöneich, Christian

    2014-11-01

    The photoirradiation of a monoclonal antibody 1 (mAb1) at λ = 254 nm and λmax = 305 nm resulted in the sequence-specific generation of d-Val, d-Tyr, and potentially d-Ala and d-Arg, in the heavy chain sequence [95-101] YCARVVY. d-Amino acid formation is most likely the product of reversible intermediary carbon-centered radical formation at the (α)C-positions of the respective amino acids ((α)C(•) radicals) through the action of Cys thiyl radicals (CysS(•)). The latter can be generated photochemically either through direct homolysis of cystine or through photoinduced electron transfer from Trp and/or Tyr residues. The potential of mAb1 sequences to undergo epimerization was first evaluated through covalent H/D exchange during photoirradiation in D2O, and proteolytic peptides exhibiting deuterium incorporation were monitored by HPLC-MS/MS analysis. Subsequently, mAb1 was photoirradiated in H2O, and peptides, for which deuterium incorporation in D2O had been documented, were purified by HPLC and subjected to hydrolysis and amino acid analysis. Importantly, not all peptide sequences which incorporated deuterium during photoirradiation in D2O also exhibited photoinduced d-amino acid formation. For example, the heavy chain sequence [12-18] VQPGGSL showed significant deuterium incorporation during photoirradiation in D2O, but no photoinduced formation of d-amino acids was detected. Instead this sequence contained ca. 22% d-Val in both a photoirradiated and a control sample. This observation could indicate that d-Val may have been generated either during production and/or storage or during sample preparation. While sample preparation did not lead to the formation of d-Val or other d-amino acids in the control sample for the heavy chain sequence [95-101] YCARVVY, we may have to consider that during hydrolysis N-terminal residues (such as in VQPGGSL) may be more prone to epimerization. We conclude that the photoinduced, radical-dependent formation of d-amino acids

  3. The complete amino acid sequence of chitinase-B from the leaves of pokeweed (Phytolacca americana).

    PubMed

    Tanigawa, M; Yamagami, T; Funatsu, G

    1995-05-01

    The complete amino acid sequence of pokeweed leaf chitinase-B (PLC-B) has been determined by first sequencing all 19 tryptic peptides derived from the reduced and S-carboxymethylated (RCm-) PLC-B and then connecting them by analyzing the chymotryptic peptides from three fragments produced by cyanogen bromide cleavage of RCm-PLC-B. PLC-B consists of 274 amino acid residues and has a molecular mass of 29,473 Da. Six cysteine residues are linked by disulfide bonds between Cys20 and Cys67, Cys50 and Cys57, and Cys159 and Cys188. From 58-68% sequence homology of PLC-B with five class III chitinases, it was concluded that PLC-B is a basic class III chitinase.

  4. Creation of a data base for sequences of ribosomal nucleic acids and detection of conserved restriction endonucleases sites through computerized processing.

    PubMed Central

    Patarca, R; Dorta, B; Ramirez, J L

    1982-01-01

    As part of a project pertaining the organization of ribosomal genes in Kinetoplastidae, we have created a data base for published sequences of ribosomal nucleic acids, with information in Spanish. As a first step in their processing, we have written a computer program which introduces the new feature of determining the length of the fragments produced after single or multiple digestion with any of the known restriction enzymes. With this information we have detected conserved SAU 3A sites: (i) at the 5' end of the 5.8S rRNA and at the 3' end of the small subunit rRNA, both included in similar larger sequences; (ii) in the 5.8S rRNA of vertebrates (a second one), which is not present in lower eukaryotes, showing a clear evolutive divergence; and, (iii) at the 5' terminal of the small subunit rRNA, included in a larger conserved sequence. The possible biological importance of these sequences is discussed. PMID:6278402

  5. Pyruvate decarboxylase from Pisum sativum. Properties, nucleotide and amino acid sequences.

    PubMed

    Mücke, U; Wohlfarth, T; Fiedler, U; Bäumlein, H; Rücknagel, K P; König, S

    1996-04-15

    To study the molecular structure and function of pyruvate decarboxylase (PDC) from plants the protein was isolated from pea seeds and partially characterised. The active enzyme which occurs in the form of higher oligomers consists of two different subunits appearing in SDS/PAGE and mass spectroscopy experiments. For further experiments, like X-ray crystallography, it was necessary to elucidate the protein sequence. Partial cDNA clones encoding pyruvate decarboxylase from seeds of Pisum sativum cv. Miko have been obtained by means of polymerase chain reaction techniques. The first sequences were found using degenerate oligonucleotide primers designated according to conserved amino acid sequences of known pyruvate decarboxylases. The missing parts of one cDNA were amplified applying the 3'- and 5'-rapid amplification of cDNA ends systems. The amino acid sequence deduced from the entire cDNA sequence displays strong similarity to pyruvate decarboxylases from other organisms, especially from plants. A molecular mass of 64 kDa was calculated for this protein correlating with estimations for the smaller subunit of the oligomeric enzyme. The PCR experiments led to at least three different clones representing the middle part of the PDC cDNA indicating the existence of three isozymes. Two of these isoforms could be confirmed on the protein level by sequencing tryptic peptides. Only anaerobically treated roots showed a positive signal for PDC mRNA in Northern analysis although the cDNA from imbibed seeds was successfully used for PCR.

  6. Divergent synthesis of 4-epi-fagomine, 3,4-dihydroxypipecolic acid, and a dihydroxyindolizidine and their β-galactosidase inhibitory and immunomodulatory activities.

    PubMed

    Kumar, K S Ajish; Rathee, J S; Subramanian, M; Chattopadhyay, S

    2013-08-01

    A divergent asymmetric synthesis of the titled iminosugars has been formulated starting from a chiral homoallyl alcohol as the versatile intermediate. The homoallyl alcohol was prepared by a highly diastereoselective Barbier reaction on a d-glucose-derived aldehyde. The protection of its hydroxyl function followed by reductive ozonolysis of the olefin and a subsequent one-pot three-step protocol involving a Staudinger reaction, reductive amination, and benzyloxy carbonyl protection yielded an important bicyclic furanopiperidine derivative. This was converted to the target compounds by following standard reactions. Among the synthesized compounds, 4-epi-fagomine (2b) was the best β-galactosidase inhibitor, and it also prevented LPS-mediated activation of Raw 264.7 macrophage cells. Its congener, 3,4-dihydroxypipecolic acid (4b) also showed similar trends in its cytokine- and enzyme-inhibitory properties at a low concentration (10 μM) but was proinflammatory at higher concentrations. The bicyclic compound dihydroxyindolizidine (21) reduced the proinflammatory cytokine (IL-1β and TNF-α) levels in the LPS-activated Raw 264.7 cells without showing any enzyme-inhibition activity.

  7. Allelic polymorphism in arabian camel ribonuclease and the amino acid sequence of bactrian camel ribonuclease.

    PubMed

    Welling, G W; Mulder, H; Beintema, J J

    1976-04-01

    Pancreatic ribonucleases from several species (whitetail deer, roe deer, guinea pig, and arabian camel) exhibit more than one amino acid at particular positions in their amino acid sequences. Since these enzymes were isolated from pooled pancreas, the origin of this heterogeneity is not clear. The pancreatic ribonucleases from 11 individual arabian camels (Camelus dromedarius) have been investigated with respect to the lysine-glutamine heterogeneity at position 103 (Welling et al., 1975). Six ribonucleases showed only one basic band and five showed two bands after polyacrylamide gel electrophoresis, suggesting a gene frequency of about 0.75 for the Lys gene and about 0.25 for the Gln gene. The amino acid sequence of bactrian camel (Camelus bactrianus) ribonuclease isolated from individual pancreatic tissue was determined and compared with that of arabian camel ribonuclease. The only difference was observed at position 103. In the ribonucleases from two unrelated bactrian camels, only glutamine was observed at that position. PMID:962846

  8. Nucleotide sequence of Crithidia fasciculata cytosol 5S ribosomal ribonucleic acid.

    PubMed

    MacKay, R M; Gray, M W; Doolittle, W F

    1980-11-11

    The complete nucleotide sequence of the cytosol 5S ribosomal ribonucleic acid of the trypanosomatid protozoan Crithidia fasciculata has been determined by a combination of T1-oligonucleotide catalog and gel sequencing techniques. The sequence is: GAGUACGACCAUACUUGAGUGAAAACACCAUAUCCCGUCCGAUUUGUGAAGUUAAGCACC CACAGGCUUAGUUAGUACUGAGGUCAGUGAUGACUCGGGAACCCUGAGUGCCGUACUCCCOH. This 5S ribosomal RNA is unique in having GAUU in place of the GAAC or GAUC found in all other prokaryotic and eukaryotic 5S RNAs, and thought to be involved in interactions with tRNAs. Comparisons to other eukaryotic cytosol 5S ribosomal RNA sequences indicate that the four major eukaryotic kingdoms (animals, plants, fungi, and protists) are about equally remote from each other, and that the latter kingdom may be the most internally diverse.

  9. Pattern recognition in nucleic acid sequences. II. An efficient method for finding locally stable secondary structures.

    PubMed Central

    Kanehisa, M I; Goad, W B

    1982-01-01

    We present a method for calculating all possible single hairpin loop secondary structures in a nucleic acid sequence by the order of N2 operations where N is the total number of bases. Each structure may contain any number of bulges and internal loops. Most natural sequences are found to be indistinguishable from random sequences in the potential of forming secondary structures, which is defined by the frequency of possible secondary structures calculated by the method. There is a strong correlation between the higher G+C content and the higher structure forming potential. Interestingly, the removal of intervening sequences in mRNAs is almost always accompanied by an increase in the G+C content, which may suggest an involvement of structural stabilization in the mRNA maturation. PMID:6174936

  10. Comparison of orthologous and paralogous DNA flanking the wheat high molecular weight glutenin genes: sequence conservation and divergence, transposon distribution, and matrix-attachment regions.

    PubMed

    Anderson, O D; Larka, L; Christoffers, M J; McCue, K F; Gustafson, J P

    2002-04-01

    Extended flanking DNA sequences were characterized for five members of the wheat high molecular weight (HMW) glutenin gene family to understand more of the structure, control, and evolution of these genes. Analysis revealed more sequence conservation among orthologous regions than between paralogous regions, with differences mainly owing to transposition events involving putative retrotransposons and several miniature inverted transposable elements (MITEs). Both gyspy-like long terminal repeat (LTR) and non-LTR retrotransposon sequences are represented in the flanking DNAs. One of the MITEs is a novel class, but another MITE is related to the maize Stowaway family and is widely represented in Triticeae express sequence tags (ESTs). Flanking DNA of the longest sequence, a 20 425-bp fragment including and surrounding the HMW-glutenin Bx7 gene, showed additional cereal gene-like sequences both immediately 5' and 3' to the HMW-glutenin coding region. The transcriptional activities of sequences related to these flanking putative genes and the retrotransposon-related regions were indicated by matches to wheat and other Triticeae ESTs. Predictive analysis of matrix-attachment regions (MARs) of the HMW glutenin and several alpha-, gamma-, and omega-gliadin flanking DNAs indicate potential MARs immediately flanking each of the genes. Matrix binding activity in the predicted regions was confirmed for two of the HMW-glutenin genes.

  11. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization.

    PubMed

    Anahtar, Melis N; Bowman, Brittany A; Kwon, Douglas S

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  12. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization

    PubMed Central

    Anahtar, Melis N.; Bowman, Brittany A.; Kwon, Douglas S.

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  13. Design of nucleic acid sequences for DNA computing based on a thermodynamic approach.

    PubMed

    Tanaka, Fumiaki; Kameda, Atsushi; Yamamoto, Masahito; Ohuchi, Azuma

    2005-01-01

    We have developed an algorithm for designing multiple sequences of nucleic acids that have a uniform melting temperature between the sequence and its complement and that do not hybridize non-specifically with each other based on the minimum free energy (DeltaG (min)). Sequences that satisfy these constraints can be utilized in computations, various engineering applications such as microarrays, and nano-fabrications. Our algorithm is a random generate-and-test algorithm: it generates a candidate sequence randomly and tests whether the sequence satisfies the constraints. The novelty of our algorithm is that the filtering method uses a greedy search to calculate DeltaG (min). This effectively excludes inappropriate sequences before DeltaG (min) is calculated, thereby reducing computation time drastically when compared with an algorithm without the filtering. Experimental results in silico showed the superiority of the greedy search over the traditional approach based on the hamming distance. In addition, experimental results in vitro demonstrated that the experimental free energy (DeltaG (exp)) of 126 sequences correlated well with DeltaG (min) (|R| = 0.90) than with the hamming distance (|R| = 0.80). These results validate the rationality of a thermodynamic approach. We implemented our algorithm in a graphic user interface-based program written in Java.

  14. Retention and loss of amino acid biosynthetic pathways based on analysis of whole-genome sequences.

    PubMed

    Payne, Samuel H; Loomis, William F

    2006-02-01

    Plants and fungi can synthesize each of the 20 amino acids by using biosynthetic pathways inherited from their bacterial ancestors. However, the ability to synthesize nine amino acids (Phe, Trp, Ile, Leu, Val, Lys, His, Thr, and Met) was lost in a wide variety of eukaryotes that evolved the ability to feed on other organisms. Since the biosynthetic pathways and their respective enzymes are well characterized, orthologs can be recognized in whole genomes to understand when in evolution pathways were lost. The pattern of pathway loss and retention was analyzed in the complete genomes of three early-diverging protist parasites, the amoeba Dictyostelium, and six animals. The nine pathways were lost independently in animals, Dictyostelium, Leishmania, Plasmodium, and Cryptosporidium. Seven additional pathways appear to have been lost in one or another parasite, demonstrating that they are dispensable in a nutrition-rich environment. Our predictions of pathways retained and pathways lost based on computational analyses of whole genomes are validated by minimal-medium studies with mammals, fish, worms, and Dictyostelium. The apparent selective advantages of retaining biosynthetic capabilities for amino acids available in the diet are considered.

  15. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken; SNL,

    2016-07-12

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  16. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    SciTech Connect

    Patel, Kamlesh D; SNL,

    2012-06-01

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  17. Complete Genome Sequence of the Alfalfa latent virus

    PubMed Central

    Shao, Jonathan; Postnikova, Olga A.

    2015-01-01

    The first complete genome sequence of the Alfalfa latent carlavirus (ALV) was obtained by primer walking and Illumina RNA sequencing. The virus differs substantially from the Czech ALV isolate and the Pea streak virus isolate from Wisconsin. The absence of a clear nucleic acid-binding protein indicates ALV divergence from other carlaviruses. PMID:25883281

  18. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    PubMed

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  19. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  20. Complete amino acid sequence of a human monocyte chemoattractant, a putative mediator of cellular immune reactions.

    PubMed Central

    Robinson, E A; Yoshimura, T; Leonard, E J; Tanaka, S; Griffin, P R; Shabanowitz, J; Hunt, D F; Appella, E

    1989-01-01

    In a study of the structural basis for leukocyte specificity of chemoattractants, we determined the complete amino acid sequence of human glioma-derived monocyte chemotactic factor (GDCF-2), a peptide that attracts human monocytes but not neutrophils. The choice of a tumor cell product for analysis was dictated by its relative abundance and an amino acid composition indistinguishable from that of lymphocyte-derived chemotactic factor (LDCF), the agonist thought to account for monocyte accumulation in cellular immune reactions. By a combination of Edman degradation and mass spectrometry, it was established that GDCF-2 comprises 76 amino acid residues, commencing at the N terminus with pyroglutamic acid. The peptide contains four half-cystines, at positions 11, 12, 36, and 52, which create a pair of loops, clustered at the disulfide bridges. The relative positions of the half-cystines are almost identical to those of monocyte-derived neutrophil chemotactic factor (MDNCF), a peptide of similar mass but with only 24% sequence identity to GDCF. Thus, GDCF and MDNCF have a similar gross secondary structure because of the loops formed by the clustered disulfides, and their different leukocyte specificities are most likely determined by the large differences in primary sequence. PMID:2648385

  1. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  2. Nucleotide and amino acid sequences of human intestinal alkaline phosphatase: close homology to placental alkaline phosphatase

    SciTech Connect

    Henthorn, P.S.; Raducha, M.; Edwards, Y.H.; Weiss, M.J.; Slaughter, C.; Lafferty, M.A.; Harris, H.

    1987-03-01

    A cDNA clone for human adult intestinal alkaline phosphatase (ALP) (orthophosphoric-monoester phosphohydrolase (alkaline optimum); EC 3.1.3.1) was isolated from a lambdagt11 expression library. The cDNA insert of this clone is 2513 base pairs in length and contains an open reading frame that encodes a 528-amino acid polypeptide. This deduced polypeptide contains the first 40 amino acids of human intestinal ALP, as determined by direct protein sequencing. Intestinal ALP shows 86.5% amino acid identity to placental (type 1) ALP and 56.6% amino acid identity to liver/bone/kidney ALP. In the 3'-untranslated regions, intestinal and placental ALP cDNAs are 73.5% identical (excluding gaps). The evolution of this multigene enzyme family is discussed.

  3. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  4. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  5. Reaction sequences in simulated neutralized current acid waste slurry during processing with formic acid

    SciTech Connect

    Smith, H.D.; Wiemers, K.D.; Langowski, M.H.; Powell, M.R.; Larson, D.E.

    1993-11-01

    The Hanford Waste Vitrification Plant (HWVP) is being designed for the Department of Energy to immobilize high-level and transuranic wastes as glass for permanent disposal. Pacific Northwest Laboratory is supporting the HWVP design activities by conducting laboratory-scale studies using a HWVP simulated waste slurry. Conditions which affect the slurry processing chemistry were evaluated in terms of offgas composition and peak generation rate and changes in slurry composition. A standard offgas profile defined in terms of three reaction phases, decomposition of H{sub 2}CO{sub 3}, destruction of NO{sub 2}{sup {minus}}, and production of H{sub 2} and NH{sub 3} was used as a baseline against which changes were evaluated. The test variables include nitrite concentration, acid neutralization capacity, temperature, and formic acid addition rate. Results to date indicate that pH is an important parameter influencing the N{sub 2}O/NO{sub x} generation ratio; nitrite can both inhibit and activate rhodium as a catalyst for formic acid decomposition to CO{sub 2} and H{sub 2}; and a separate reduced metal phase forms in the reducing environment. These data are being compiled to provide a basis for predicting the HWVP feed processing chemistry as a function of feed composition and operation variables, recommending criteria for chemical adjustments, and providing guidelines with respect to important control parameters to consider during routine and upset plant operation.

  6. The complete amino acid sequence of lectin-C from the roots of pokeweed (Phytolacca americana).

    PubMed

    Yamaguchi, K; Mori, A; Funatsu, G

    1995-07-01

    The complete amino acid sequence of pokeweed lectin-C (PL-C) consisting of 126 residues has been determined. PL-C is an acidic simple protein with molecular mass of 13,747 Da and consists of three cysteine-rich domains with 51-63% homology. PL-C shows homology to chitin-binding proteins such as wheat germ agglutinin, and all eight cysteine residues in the three domains of PL-C are completely conserved in all other chitin-binding domains.

  7. Amino-acid sequence of a cooperative, dimeric myoglobin from the gastropod mollusc, Buccinum undatum L.

    PubMed

    Wen, D; Laursen, R A

    1994-10-19

    The complete amino-acid sequence of a dimeric myoglobin from the radular mussel of the gastropod mollusc, Buccinum undatum L. has been determined. The globin, which shows cooperative binding of oxygen, contains 146 amino acids, is N-terminal aminoacetylated, and has histidine residues at position 65 and 97, corresponding to the heme-binding histidines seen in mammalian myoglobins. It shows about 75% and 50% homology, respectively, with the dimeric molluscan myoglobins from Busycon canaliculatum and Cerithidea rhizophorarum, the former of which also shows weak cooperatively, but much less similarity to other species of myoglobin and hemoglobin.

  8. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  9. Amino acid sequence of human cholinesterase. Annual report, 30 September 1984-30 September 1985

    SciTech Connect

    Lockridge, O.

    1985-10-01

    The active-site serine residue is located 198 amino acids from the N-terminal. The active-site peptide was isolated from three different genetic types of human serum cholinesterase: from usual, atypical, and atypical-silent genotypes. It was found that the amino acid sequence of the active-site peptide was identical in all three genotypes. Comparison of the complete sequences of cholinesterase from human serum and acetylcholinesterase from the electric organ of Torpedo californica shows an identity of 53%. Cholinesterase is of interest to the Department of Defense because cholinesterase protects against organophosphate poisons of the type used in chemical warfare. The structural results presented here will serve as the basis for cloning the gene for cholinesterase. The potential uses of large amounts of cholinesterase would be for cleaning up spills of organophosphates and possibly for detoxifying exposed personnel.

  10. Amino acid sequence differences in pancreatic ribonucleases from water buffalo breeds from Indonesia and Italy.

    PubMed

    Sidik, A; Martena, B; Beintema, J J

    1979-12-01

    The amino acid sequences of the pancreatic ribonucleases from river-breed water buffaloes from Italy and swamp-breed water buffaloes from Indonesia differ at three positions. One of the differences involves a replacement of asparagine-34, with covalently attached carbohydrate on all molecules, in the river-breed enzyme by serine in the swamp-breed enzyme. The ribonuclease content of the pancreas differs considerably between breeds and is lower in river buffaloes. A ribonuclease preparation from two swamp buffaloes contained a minor glycosylated component. Preliminary evidence was obtained that the amino acid sequence of this component has factors in common with the main component of the swamp-breed ribonuclease and with the river-breed enzyme.

  11. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    NASA Astrophysics Data System (ADS)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2016-10-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  12. On human disease-causing amino acid variants: statistical study of sequence and structural patterns

    PubMed Central

    Alexov, Emil

    2015-01-01

    Statistical analysis was carried out on large set of naturally occurring human amino acid variations and it was demonstrated that there is a preference for some amino acid substitutions to be associated with diseases. At an amino acid sequence level, it was shown that the disease-causing variants frequently involve drastic changes of amino acid physico-chemical properties of proteins such as charge, hydrophobicity and geometry. Structural analysis of variants involved in diseases and being frequently observed in human population showed similar trends: disease-causing variants tend to cause more changes of hydrogen bond network and salt bridges as compared with harmless amino acid mutations. Analysis of thermodynamics data reported in literature, both experimental and computational, indicated that disease-causing variants tend to destabilize proteins and their interactions, which prompted us to investigate the effects of amino acid mutations on large databases of experimentally measured energy changes in unrelated proteins. Although the experimental datasets were linked neither to diseases nor exclusory to human proteins, the observed trends were the same: amino acid mutations tend to destabilize proteins and their interactions. Having in mind that structural and thermodynamics properties are interrelated, it is pointed out that any large change of any of them is anticipated to cause a disease. PMID:25689729

  13. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  14. Comparisons of the Distribution of Nucleotides and Common Sequences in Deoxyribonucleic Acid from Selected Bacteriophages

    PubMed Central

    Skalka, A.; Hanson, P.

    1972-01-01

    Results from comparisons of deoxyribonucleic acid (DNA) from several classes of bacteriophages suggest that most phage chromosomes contain either a homogeneous distribution of nucleotides or are made up of a few, rather large segments of different quanine plus cytosine (G + C) contents which are internally homogeneous. Among those temperate phages tested, most contained segmented DNA. Comparisons of sequence similarities among segments from lambdoid phage DNA species revealed the following order in relatedness to λ: 82 (and 434) > 21 > 424 > φ80. Most common sequences are found in the highest G + C segments, which in λ contain head and tail genes. Hybridization tests with λ and 186 or P2 DNA species verified that the lambdoids and 186 and P2 belong to two distinct groups. There are fewer homologous sequences between the DNA species of coliphages λ and P2 or 186 than there are between the DNA species of coliphage λ and salmonella phage P22. PMID:4553679

  15. Structure of the fully modified left-handed cyclohexene nucleic acid sequence GTGTACAC.

    PubMed

    Robeyns, Koen; Herdewijn, Piet; Van Meervelt, Luc

    2008-02-13

    CeNA oligonucleotides consist of a phosphorylated backbone where the deoxyribose sugars are replaced by cyclohexene moieties. The X-ray structure determination and analysis of a fully modified octamer sequence GTGTACAC, which is the first crystal structure of a carbocyclic-based nucleic acid, is presented. This particular sequence was built with left-handed building blocks and crystallizes as a left-handed double helix. The helix can be characterized as belonging to the (mirrored) A-type family. Crystallographic data were processed up to 1.53 A, and the octamer sequence crystallizes in the space group R32. The sugar puckering is found to adopt the 3H2 half-chair conformation which mimics the C3'-endo conformation of the ribose sugar. The double helices stack on top of each other to form continuous helices, and static disorder is observed due to this end-to-end stacking.

  16. Amino acid sequence of a protease inhibitor isolated from Sarcophaga bullata determined by mass spectrometry.

    PubMed

    Papayannopoulos, I A; Biemann, K

    1992-02-01

    The amino acid sequence of a protease inhibitor isolated from the hemolymph of Sarcophaga bullata larvae was determined by tandem mass spectrometry. Homology considerations with respect to other protease inhibitors with known primary structures assisted in the choice of the procedure followed in the sequence determination and in the alignment of the various peptides obtained from specific chemical cleavage at cysteines and enzyme digests of the S. bullata protease inhibitor. The resulting sequence of 57 residues is as follows: Val Asp Lys Ser Ala Cys Leu Gln Pro Lys Glu Val Gly Pro Cys Arg Lys Ser Asp Phe Val Phe Phe Tyr Asn Ala Asp Thr Lys Ala Cys Glu Glu Phe Leu Tyr Gly Gly Cys Arg Gly Asn Asp Asn Arg Phe Asn Thr Lys Glu Glu Cys Glu Lys Leu Cys Leu.

  17. Fatty Acid Profile and Unigene-Derived Simple Sequence Repeat Markers in Tung Tree (Vernicia fordii)

    PubMed Central

    Zhang, Lin; Jia, Baoguang; Tan, Xiaofeng; Thammina, Chandra S.; Long, Hongxu; Liu, Min; Wen, Shanna; Song, Xianliang; Cao, Heping

    2014-01-01

    Tung tree (Vernicia fordii) provides the sole source of tung oil widely used in industry. Lack of fatty acid composition and molecular markers hinders biochemical, genetic and breeding research. The objectives of this study were to determine fatty acid profiles and develop unigene-derived simple sequence repeat (SSR) markers in tung tree. Fatty acid profiles of 41 accessions showed that the ratio of α-eleostearic acid was increasing continuously with a parallel trend to the amount of tung oil accumulation while the ratios of other fatty acids were decreasing in different stages of the seeds and that α-eleostearic acid (18∶3) consisted of 77% of the total fatty acids in tung oil. Transcriptome sequencing identified 81,805 unigenes from tung cDNA library constructed using seed mRNA and discovered 6,366 SSRs in 5,404 unigenes. The di- and tri-nucleotide microsatellites accounted for 92% of the SSRs with AG/CT and AAG/CTT being the most abundant SSR motifs. Fifteen polymorphic genic-SSR markers were developed from 98 unigene loci tested in 41 cultivated tung accessions by agarose gel and capillary electrophoresis. Genbank database search identified 10 of them putatively coding for functional proteins. Quantitative PCR demonstrated that all 15 polymorphic SSR-associated unigenes were expressed in tung seeds and some of them were highly correlated with oil composition in the seeds. Dendrogram revealed that most of the 41 accessions were clustered according to the geographic region. These new polymorphic genic-SSR markers will facilitate future studies on genetic diversity, molecular fingerprinting, comparative genomics and genetic mapping in tung tree. The lipid profiles in the seeds of 41 tung accessions will be valuable for biochemical and breeding studies. PMID:25167054

  18. Some properties and amino acid sequence of plastocyanin from a green alga, Ulva arasakii.

    PubMed

    Yoshizaki, F; Fukazawa, T; Mishina, Y; Sugimura, Y

    1989-08-01

    Plastocyanin was purified from a multicellular, marine green alga, Ulva arasakii, by conventional methods to homogeneity. The oxidized plastocyanin showed absorption maxima at 252, 276.8, 460, 595.3, and 775 nm, and shoulders at 259, 265, 269, and 282.5 nm; the ratio A276.8/A595.3 was 1.5. The midpoint redox potential was determined to be 0.356 V at pH 7.0 with a ferri- and ferrocyanide system. The molecular weight was estimated to be 10,200 and 11,000 by SDS-PAGE and by gel filtration, respectively. U. arasakii also has a small amount of cytochrome c6, like Enteromorpha prolifera. The amino acid sequence of U. arasakii plastocyanin was determined by Edman degradation and by carboxypeptidase digestion of the plastocyanin, six tryptic peptides, and five staphylococcal protease peptides. The plastocyanin contained 98 amino acid residues, giving a molecular weight of 10,236 including one copper atom. The complete sequence is as follows: AQIVKLGGDDGALAFVPSKISVAAGEAIEFVNNAGFPHNIVFDEDAVPAGVDADAISYDDYLNSKGETV VRKLSTPGVY G VYCEPHAGAGMKMTITVQ. The sequence of U. arasakii plastocyanin is closet to that of the E. prolifera protein (85% homology). A phylogenetic tree of five algal and two higher plant plastocyanins was constructed by comparing the amino acid differences. The branching order is considered to be as follows: a blue-green alga, unicellular green algae, multicellular green algae, and higher plants. PMID:2509442

  19. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment. PMID:23485423

  20. Complete amino acid sequence of chitinase-A from leaves of pokeweed (Phytolacca americana).

    PubMed

    Yamagami, T; Tanigawa, M; Ishiguro, M; Funatsu, G

    1998-04-01

    The complete amino acid sequence of pokeweed leaf chitinase-A was determined. First all 11 tryptic peptides from the reduced and S-carboxymethylated form of the enzyme were sequenced. Then the same form of the enzyme was cleaved with cyanogen bromide, giving three fragments. The fragments were digested with chymotrypsin or Staphylococcus aureus V8 protease. Last, the 11 tryptic peptides were put in order. Of seven cysteine residues, six were linked by disulfide bonds (between Cys25 and Cys74, Cys89 and Cys98, and Cys195 and Cys208); Cys176 was free. The enzyme consisted of 208 amino acid residues and had a molecular weight of 22,391. It consisted of only one polypeptide chain without a chitin-binding domain. The length of the chain was almost the same as that of the catalytic domains of class IL chitinases. These findings suggested that this enzyme is a new kind of class IIL chitinase, although its sequence resembles that of catalytic domains of class IL chitinases more than that of the class IIL chitinases reported so far. Discussion on the involvement of specific tryptophan residue in the active site of PLC-A is also given based on the sequence similarity with rye seed chitinase-c.

  1. The amino acid sequence of the aspartate aminotransferase from baker's yeast (Saccharomyces cerevisiae).

    PubMed Central

    Cronin, V B; Maras, B; Barra, D; Doonan, S

    1991-01-01

    1. The single (cytosolic) aspartate aminotransferase was purified in high yield from baker's yeast (Saccharomyces cerevisiae). 2. Amino-acid-sequence analysis was carried out by digestion of the protein with trypsin and with CNBr; some of the peptides produced were further subdigested with Staphylococcus aureus V8 proteinase or with pepsin. Peptides were sequenced by the dansyl-Edman method and/or by automated gas-phase methods. The amino acid sequence obtained was complete except for a probable gap of two residues as indicated by comparison with the structures of counterpart proteins in other species. 3. The N-terminus of the enzyme is blocked. Fast-atom-bombardment m.s. was used to identify the blocking group as an acetyl one. 4. Alignment of the sequence of the enzyme with those of vertebrate cytosolic and mitochondrial aspartate aminotransferases and with the enzyme from Escherichia coli showed that about 25% of residues are conserved between these distantly related forms. 5. Experimental details and confirmatory data for the results presented here are given in a Supplementary Publication (SUP 50164, 25 pages) that has been deposited at the British Library Document Supply Centre, Boston Spa. Wetherby, West Yorkshire LS23 7 BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1991) 273, 5. PMID:1859361

  2. [MOLECULAR EVOLUTION OF ION CHANNELS: AMINO ACID SEQUENCES AND 3D STRUCTURES].

    PubMed

    Korkosh, V S; Zhorov, B S; Tikhonov, D B

    2016-01-01

    An integral part of modern evolutionary biology is comparative analysis of structure and function of macromolecules such as proteins. The first and critical step to understand evolution of homologous proteins is their amino acid sequence alignment. However, standard algorithms fop not provide unambiguous sequence alignments for proteins of poor homology. More reliable results can be obtained by comparing experimental 3D structures obtained at atomic resolution, for instance, with the aid of X-ray structural analysis. If such structures are lacking, homology modeling is used, which may take into account indirect experimental data on functional roles of individual amino-acid residues. An important problem is that the sequence alignment, which reflects genetic modifications, does not necessarily correspond to the functional homology. The latter depends on three-dimensional structures which are critical for natural selection. Since alignment techniques relying only on the analysis of primary structures carry no information on the functional properties of proteins, including 3D structures into consideration is very important. Here we consider several examples involving ion channels and demonstrate that alignment of their three-dimensional structures can significantly improve sequence alignments obtained by traditional methods.

  3. Subgroup C avian metapneumovirus (MPV) and the recently isolated human MPV exhibit a common organization but have extensive sequence divergence in their putative SH and G genes.

    PubMed

    Toquin, D; de Boisseson, C; Beven, V; Senne, D A; Eterradossi, N

    2003-08-01

    The genes encoding the putative small hydrophobic (SH), attachment (G) and polymerase (L) proteins of the Colorado isolate of subgroup C avian pneumovirus (APV) were entirely or partially sequenced. They all included metapneumovirus (MPV)-like gene start and gene end sequences. The deduced Colorado SH protein shared 26.9 and 21.7 % aa identity with its counterpart in human MPV (hMPV) and APV subgroup A, respectively, but its only significant aa similarities were to hMPV. Conserved features included a common hydrophobicity profile with an unique transmembrane domain and the conservation of most extracellular cysteine residues. The Colorado putative G gene encoded several ORFs, the longer of which encoded a 252 aa long type II glycoprotein with aa similarities to hMPV G only (20.6 % overall aa identity with seven conserved N-terminal residues). The putative Colorado G protein shared, at best, 21.0 % aa identity with its counterparts in the other APV subgroups and did not contain the extracellular cysteine residues and short aa stretch highly conserved in other APVs. The N-terminal end of the Colorado L protein exhibited 73.6 and 54.9 % aa identity with hMPV and APV subgroup A, respectively, with four aa blocks highly conserved among Pneumovirus: Phylogenetic analysis performed on the nt sequences confirmed that the L sequences from MPVs were genetically related, whereas analysis of the G sequences revealed that among MPVs, only APV subgroups A, B and D clustered together, independently of both the Colorado isolate and hMPV, which shared weak genetic relatedness at the G gene level.

  4. A Note on Divergences.

    PubMed

    Liang, Xiao

    2016-10-01

    In many areas of neural computation, like learning, optimization, estimation, and inference, suitable divergences play a key role. In this note, we study the conjecture presented by Amari ( 2009 ) and find a counterexample to show that the conjecture does not hold generally. Moreover, we investigate two classes of [Formula: see text]-divergence (Zhang, 2004 ), weighted f-divergence and weighted [Formula: see text]-divergence, and prove that if a divergence is a weighted f-divergence, as well as a Bregman divergence, then it is a weighted [Formula: see text]-divergence. This result reduces in form to the main theorem established by Amari ( 2009 ) when [Formula: see text] [Formula: see text].

  5. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  6. BeadCons: detection of nucleic acid sequences by flow cytometry.

    PubMed

    Horejsh, Douglas; Martini, Federico; Capobianchi, Maria Rosaria

    2005-11-01

    Molecular beacons are single-stranded nucleic acid structures with a terminal fluorophore and a distal, terminal quencher. These molecules are typically used in real-time PCR assays, but have also been conjugated with solid matrices. This unit describes protocols related to molecular beacon-conjugated beads (BeadCons), whose specific hybridization with complementary target sequences can be resolved by cytometry. Assay sensitivity is achieved through the concentration of fluorescence signal on discrete particles. By using molecular beacons with different fluorophores and microspheres of different sizes, it is possible to construct a fluid array system with each bead corresponding to a specific target nucleic acid. Methods are presented for the design, construction, and use of BeadCons for the specific, multiplexed detection of unlabeled nucleic acids in solution. The use of bead-based detection methods will likely lead to the design of new multiplex molecular diagnostic tools.

  7. Measuring nanometer distances in nucleic acids using a sequence-independent nitroxide probe

    PubMed Central

    Qin, Peter Z; Haworth, Ian S; Cai, Qi; Kusnetzow, Ana K; Grant, Gian Paola G; Price, Eric A; Sowa, Glenna Z; Popova, Anna; Herreros, Bruno; He, Honghang

    2008-01-01

    This protocol describes the procedures for measuring nanometer distances in nucleic acids using a nitroxide probe that can be attached to any nucleotide within a given sequence. Two nitroxides are attached to phosphorothioates that are chemically substituted at specific sites of DNA or RNA. Inter-nitroxide distances are measured using a four-pulse double electron–electron resonance technique, and the measured distances are correlated to the parent structures using a Web-accessible computer program. Four to five days are needed for sample labeling, purification and distance measurement. The procedures described herein provide a method for probing global structures and studying conformational changes of nucleic acids and protein/nucleic acid complexes. PMID:17947978

  8. [Partial sequence homology of FtsZ in phylogenetics analysis of lactic acid bacteria].

    PubMed

    Zhang, Bin; Dong, Xiu-zhu

    2005-10-01

    FtsZ is a structurally conserved protein, which is universal among the prokaryotes. It plays a key role in prokaryote cell division. A partial fragment of the ftsZ gene about 800bp in length was amplified and sequenced and a partial FtsZ protein phylogenetic tree for the lactic acid bacteria was constructed. By comparing the FtsZ phylogenetic tree with the 16S rDNA tree, it was shown that the two trees were similar in topology. Both trees revealed that Pediococcus spp. were closely related with L. casei group of Lactobacillus spp. , but less related with other lactic acid cocci such as Enterococcus and Streptococcus. The results also showed that the discriminative power of FtsZ was higher than that of 16S rDNA for either inter-species or inter-genus and could be a very useful tool in species identification of lactic acid bacteria. PMID:16342751

  9. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group.

  10. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group. PMID:1368578

  11. Regulatory Divergence of Transcript Isoforms in a Mammalian Model System

    PubMed Central

    Thybert, David; Stefflova, Klara; Watt, Stephen; Flicek, Paul; Brazma, Alvis; Marioni, John C.; Odom, Duncan T.

    2015-01-01

    Phenotypic differences between species are driven by changes in gene expression and, by extension, by modifications in the regulation of the transcriptome. Investigation of mammalian transcriptome divergence has been restricted to analysis of bulk gene expression levels and gene-internal splicing. Using allele-specific expression analysis in inter-strain hybrids of Mus musculus, we determined the contribution of multiple cellular regulatory systems to transcriptome divergence, including: alternative promoter usage, transcription start site selection, cassette exon usage, alternative last exon usage, and alternative polyadenylation site choice. Between mouse strains, a fifth of genes have variations in isoform usage that contribute to transcriptomic changes, half of which alter encoded amino acid sequence. Virtually all divergence in isoform usage altered the post-transcriptional regulatory instructions in gene UTRs. Furthermore, most genes with isoform differences between strains contain changes originating from multiple regulatory systems. This result indicates widespread cross-talk and coordination exists among different regulatory systems. Overall, isoform usage diverges in parallel with and independently to gene expression evolution, and the cis and trans regulatory contribution to each differs significantly. PMID:26339903

  12. Rates of genomic divergence in humans, chimpanzees and their lice

    PubMed Central

    Johnson, Kevin P.; Allen, Julie M.; Olds, Brett P.; Mugisha, Lawrence; Reed, David L.; Paige, Ken N.; Pittendrigh, Barry R.

    2014-01-01

    The rate of DNA mutation and divergence is highly variable across the tree of life. However, the reasons underlying this variation are not well understood. Comparing the rates of genetic changes between hosts and parasite lineages that diverged at the same time is one way to begin to understand differences in genetic mutation and substitution rates. Such studies have indicated that the rate of genetic divergence in parasites is often faster than that of their hosts when comparing single genes. However, the variation in this relative rate of molecular evolution across different genes in the genome is unknown. We compared the rate of DNA sequence divergence between humans, chimpanzees and their ectoparasitic lice for 1534 protein-coding genes across their genomes. The rate of DNA substitution in these orthologous genes was on average 14 times faster for lice than for humans and chimpanzees. In addition, these rates were positively correlated across genes. Because this correlation only occurred for substitutions that changed the amino acid, this pattern is probably produced by similar functional constraints across the same genes in humans, chimpanzees and their ectoparasites. PMID:24403325

  13. Synthesis, Improved Antisense Activity and Structural Rationale for the Divergent RNA Affinities of 3;#8242;-Fluoro Hexitol Nucleic Acid (FHNA and Ara-FHNA) Modified Oligonucleotides

    SciTech Connect

    Egli, Martin; Pallan, Pradeep S.; Allerson, Charles R.; Prakash, Thazha P.; Berdeja, Andres; Yu, Jinghua; Lee, Sam; Watt, Andrew; Gaus, Hans; Bhat, Balkrishen; Swayze, Eric E.; Seth, Punit P.

    2012-03-16

    The synthesis, biophysical, structural, and biological properties of both isomers of 3'-fluoro hexitol nucleic acid (FHNA and Ara-FHNA) modified oligonucleotides are reported. Synthesis of the FHNA and Ara-FHNA thymine phosphoramidites was efficiently accomplished starting from known sugar precursors. Optimal RNA affinities were observed with a 3'-fluorine atom and nucleobase in a trans-diaxial orientation. The Ara-FHNA analog with an equatorial fluorine was found to be destabilizing. However, the magnitude of destabilization was sequence-dependent. Thus, the loss of stability is sharply reduced when Ara-FHNA residues were inserted at pyrimidine-purine (Py-Pu) steps compared to placement within a stretch of pyrimidines (Py-Py). Crystal structures of A-type DNA duplexes modified with either monomer provide a rationalization for the opposing stability effects and point to a steric origin of the destabilization caused by the Ara-FHNA analog. The sequence dependent effect can be explained by the formation of an internucleotide C-F {hor_ellipsis} H-C pseudo hydrogen bond between F3' of Ara-FHNA and C8-H of the nucleobase from the 3'-adjacent adenosine that is absent at Py-Py steps. In animal experiments, FHNA-modified antisense oligonucleotides formulated in saline showed a potent downregulation of gene expression in liver tissue without producing hepatotoxicity. Our data establish FHNA as a useful modification for antisense therapeutics and also confirm the stabilizing influence of F(Py) {hor_ellipsis} H-C(Pu) pseudo hydrogen bonds in nucleic acid structures.

  14. Binding of Plasmodium falciparum Merozoite Surface Proteins DBLMSP and DBLMSP2 to Human Immunoglobulin M Is Conserved among Broadly Diverged Sequence Variants.

    PubMed

    Crosnier, Cécile; Iqbal, Zamin; Knuepfer, Ellen; Maciuca, Sorina; Perrin, Abigail J; Kamuyu, Gathoni; Goulding, David; Bustamante, Leyla Y; Miles, Alistair; Moore, Shona C; Dougan, Gordon; Holder, Anthony A; Kwiatkowski, Dominic P; Rayner, Julian C; Pleass, Richard J; Wright, Gavin J

    2016-07-01

    Diversity at pathogen genetic loci can be driven by host adaptive immune selection pressure and may reveal proteins important for parasite biology. Population-based genome sequencing of Plasmodium falciparum, the parasite responsible for the most severe form of malaria, has highlighted two related polymorphic genes called dblmsp and dblmsp2, which encode Duffy binding-like (DBL) domain-containing proteins located on the merozoite surface but whose function remains unknown. Using recombinant proteins and transgenic parasites, we show that DBLMSP and DBLMSP2 directly and avidly bind human IgM via their DBL domains. We used whole genome sequence data from over 400 African and Asian P. falciparum isolates to show that dblmsp and dblmsp2 exhibit extreme protein polymorphism in their DBL domain, with multiple variants of two major allelic classes present in every population tested. Despite this variability, the IgM binding function was retained across diverse sequence representatives. Although this interaction did not seem to have an effect on the ability of the parasite to invade red blood cells, binding of DBLMSP and DBLMSP2 to IgM inhibited the overall immunoreactivity of these proteins to IgG from patients who had been exposed to the parasite. This suggests that IgM binding might mask these proteins from the host humoral immune system.

  15. Binding of Plasmodium falciparum Merozoite Surface Proteins DBLMSP and DBLMSP2 to Human Immunoglobulin M Is Conserved among Broadly Diverged Sequence Variants.

    PubMed

    Crosnier, Cécile; Iqbal, Zamin; Knuepfer, Ellen; Maciuca, Sorina; Perrin, Abigail J; Kamuyu, Gathoni; Goulding, David; Bustamante, Leyla Y; Miles, Alistair; Moore, Shona C; Dougan, Gordon; Holder, Anthony A; Kwiatkowski, Dominic P; Rayner, Julian C; Pleass, Richard J; Wright, Gavin J

    2016-07-01

    Diversity at pathogen genetic loci can be driven by host adaptive immune selection pressure and may reveal proteins important for parasite biology. Population-based genome sequencing of Plasmodium falciparum, the parasite responsible for the most severe form of malaria, has highlighted two related polymorphic genes called dblmsp and dblmsp2, which encode Duffy binding-like (DBL) domain-containing proteins located on the merozoite surface but whose function remains unknown. Using recombinant proteins and transgenic parasites, we show that DBLMSP and DBLMSP2 directly and avidly bind human IgM via their DBL domains. We used whole genome sequence data from over 400 African and Asian P. falciparum isolates to show that dblmsp and dblmsp2 exhibit extreme protein polymorphism in their DBL domain, with multiple variants of two major allelic classes present in every population tested. Despite this variability, the IgM binding function was retained across diverse sequence representatives. Although this interaction did not seem to have an effect on the ability of the parasite to invade red blood cells, binding of DBLMSP and DBLMSP2 to IgM inhibited the overall immunoreactivity of these proteins to IgG from patients who had been exposed to the parasite. This suggests that IgM binding might mask these proteins from the host humoral immune system. PMID:27226583

  16. Binding of Plasmodium falciparum Merozoite Surface Proteins DBLMSP and DBLMSP2 to Human Immunoglobulin M Is Conserved among Broadly Diverged Sequence Variants*

    PubMed Central

    Crosnier, Cécile; Iqbal, Zamin; Knuepfer, Ellen; Maciuca, Sorina; Perrin, Abigail J.; Kamuyu, Gathoni; Goulding, David; Bustamante, Leyla Y.; Miles, Alistair; Moore, Shona C.; Dougan, Gordon; Holder, Anthony A.; Kwiatkowski, Dominic P.; Rayner, Julian C.; Pleass, Richard J.; Wright, Gavin J.

    2016-01-01

    Diversity at pathogen genetic loci can be driven by host adaptive immune selection pressure and may reveal proteins important for parasite biology. Population-based genome sequencing of Plasmodium falciparum, the parasite responsible for the most severe form of malaria, has highlighted two related polymorphic genes called dblmsp and dblmsp2, which encode Duffy binding-like (DBL) domain-containing proteins located on the merozoite surface but whose function remains unknown. Using recombinant proteins and transgenic parasites, we show that DBLMSP and DBLMSP2 directly and avidly bind human IgM via their DBL domains. We used whole genome sequence data from over 400 African and Asian P. falciparum isolates to show that dblmsp and dblmsp2 exhibit extreme protein polymorphism in their DBL domain, with multiple variants of two major allelic classes present in every population tested. Despite this variability, the IgM binding function was retained across diverse sequence representatives. Although this interaction did not seem to have an effect on the ability of the parasite to invade red blood cells, binding of DBLMSP and DBLMSP2 to IgM inhibited the overall immunoreactivity of these proteins to IgG from patients who had been exposed to the parasite. This suggests that IgM binding might mask these proteins from the host humoral immune system. PMID:27226583

  17. Conserved and divergent rhythms of crassulacean acid metabolism-related and core clock gene expression in the cactus Opuntia ficus-indica.

    PubMed

    Mallona, Izaskun; Egea-Cortines, Marcos; Weiss, Julia

    2011-08-01

    The cactus Opuntia ficus-indica is a constitutive Crassulacean acid metabolism (CAM) species. Current knowledge of CAM metabolism suggests that the enzyme phosphoenolpyruvate carboxylase kinase (PPCK) is circadian regulated at the transcriptional level, whereas phosphoenolpyruvate carboxylase (PEPC), malate dehydrogenase (MDH), NADP-malic enzyme (NADP-ME), and pyruvate phosphate dikinase (PPDK) are posttranslationally controlled. As little transcriptomic data are available from obligate CAM plants, we created an expressed sequence tag database derived from different organs and developmental stages. Sequences were assembled, compared with sequences in the National Center for Biotechnology Information nonredundant database for identification of putative orthologs, and mapped using Kyoto Encyclopedia of Genes and Genomes Orthology and Gene Ontology. We identified genes involved in circadian regulation and CAM metabolism for transcriptomic analysis in plants grown in long days. We identified stable reference genes for quantitative polymerase chain reaction and found that OfiSAND, like its counterpart in Arabidopsis (Arabidopsis thaliana), and OfiTUB are generally appropriate standards for use in the quantification of gene expression in O. ficus-indica. Three kinds of expression profiles were found: transcripts of OfiPPCK oscillated with a 24-h periodicity; transcripts of the light-active OfiNADP-ME and OfiPPDK genes adapted to 12-h cycles, while transcript accumulation patterns of OfiPEPC and OfiMDH were arrhythmic. Expression of the circadian clock gene OfiTOC1, similar to Arabidopsis, oscillated with a 24-h periodicity, peaking at night. Expression of OfiCCA1 and OfiPRR9, unlike in Arabidopsis, adapted best to a 12-h rhythm, suggesting that circadian clock gene interactions differ from those of Arabidopsis. Our results indicate that the evolution of CAM metabolism could be the result of modified circadian regulation at both the transcriptional and posttranscriptional

  18. N-terminal amino acid sequences and some characteristics of fibrinolytic/hemorrhagic metalloproteinases purified from Bothrops jararaca venom.

    PubMed

    Maruyama, Masugi; Sugiki, Masahiko; Anai, Keita; Yoshida, Etsuo

    2002-08-01

    We determined the N-terminal amino acid sequences of the fibrinolytic/hemorrhagic metalloproteinases (jararafibrases I, III and IV) purified from Bothrops jararaca venom. The N-terminal amino acid sequences of jararafibrase I and its degradation products were identical to those of jararhagin, another hemorrhagic metalloproteinase purified from the same snake venom. Together with enzymatic and immunological properties, we concluded that those two enzymes are identical. The N-terminal amino acid sequence of jararafibrase III was quite similar to C-type lectin isolated from Crotalus atrox, and the protein had a hemagglutinating activity on intact rat red blood cells. PMID:12165326

  19. Protein sequence analysis by incorporating modified chaos game and physicochemical properties into Chou's general pseudo amino acid composition.

    PubMed

    Xu, Chunrui; Sun, Dandan; Liu, Shenghui; Zhang, Yusen

    2016-10-01

    In this contribution we introduced a novel graphical method to compare protein sequences. By mapping a protein sequence into 3D space based on codons and physicochemical properties of 20 amino acids, we are able to get a unique P-vector from the 3D curve. This approach is consistent with wobble theory of amino acids. We compute the distance between sequences by their P-vectors to measure similarities/dissimilarities among protein sequences. Finally, we use our method to analyze four datasets and get better results compared with previous approaches. PMID:27375218

  20. AdoMet radical proteins--from structure to evolution--alignment of divergent protein sequences reveals strong secondary structure element conservation.

    PubMed

    Nicolet, Yvain; Drennan, Catherine L

    2004-01-01

    Eighteen subclasses of S-adenosyl-l-methionine (AdoMet) radical proteins have been aligned in the first bioinformatics study of the AdoMet radical superfamily to utilize crystallographic information. The recently resolved X-ray structure of biotin synthase (BioB) was used to guide the multiple sequence alignment, and the recently resolved X-ray structure of coproporphyrinogen III oxidase (HemN) was used as the control. Despite the low 9% sequence identity between BioB and HemN, the multiple sequence alignment correctly predicted all but one of the core helices in HemN, and correctly predicted the residues in the enzyme active site. This alignment further suggests that the AdoMet radical proteins may have evolved from half-barrel structures (alphabeta)4 to three-quarter-barrel structures (alphabeta)6 to full-barrel structures (alphabeta)8. It predicts that anaerobic ribonucleotide reductase (RNR) activase, an ancient enzyme that, it has been suggested, serves as a link between the RNA and DNA worlds, will have a half-barrel structure, whereas the three-quarter barrel, exemplified by HemN, will be the most common architecture for AdoMet radical enzymes, and fewer members of the superfamily will join BioB in using a complete (alphabeta)8 TIM-barrel fold to perform radical chemistry. These differences in barrel architecture also explain how AdoMet radical enzymes can act on substrates that range in size from 10 atoms to 608 residue proteins.

  1. Purification to homogeneity and amino acid sequence analysis of two anionic species of human interleukin 1

    PubMed Central

    1986-01-01

    Two anionic species of human IL-1 have been purified to homogeneity. These molecules were characterized as having pI of 5.4 and 5.2 and molecular weights identical to IL-1/6.8 (17,500). The specific activities of IL-1/5.4 and IL-1/5.2, as measured in the mouse thymocyte co-mitogenic assay, were identical to that of IL-1/6.8, namely 1.2 X 10(7) U/mg, with half-maximal stimulation observed at 2 X 10(-11) M. IL- 1/5.4 and IL-1/5.2 were found to be antigenically distinct from IL- 1/6.8 in an ELISA. IL-1/5.4 was structurally distinct from IL-1/6.8 based on reverse-phase HPLC or CNBr peptides. Intact IL-1/5.2 and three intact CNBr peptides of IL-1/5.4 were sequenced, with the identification of 74 amino acid residues. These sequences were found to correspond exactly with the amino acid sequence deduced from the IL-1- alpha cDNA reported by March et al. PMID:3487613

  2. Protein meta-functional signatures from combining sequence, structure, evolution, and amino acid property information.

    PubMed

    Wang, Kai; Horst, Jeremy A; Cheng, Gong; Nickle, David C; Samudrala, Ram

    2008-09-26

    Protein function is mediated by different amino acid residues, both their positions and types, in a protein sequence. Some amino acids are responsible for the stability or overall shape of the protein, playing an indirect role in protein function. Others play a functionally important role as part of active or binding sites of the protein. For a given protein sequence, the residues and their degree of functional importance can be thought of as a signature representing the function of the protein. We have developed a combination of knowledge- and biophysics-based function prediction approaches to elucidate the relationships between the structural and the functional roles of individual residues and positions. Such a meta-functional signature (MFS), which is a collection of continuous values representing the functional significance of each residue in a protein, may be used to study proteins of known function in greater detail and to aid in experimental characterization of proteins of unknown function. We demonstrate the superior performance of MFS in predicting protein functional sites and also present four real-world examples to apply MFS in a wide range of settings to elucidate protein sequence-structure-function relationships. Our results indicate that the MFS approach, which can combine multiple sources of information and also give biological interpretation to each component, greatly facilitates the understanding and characterization of protein function.

  3. Mitochondrial DNA sequence divergence among Schizaphis graminum (Hemiptera: Aphididae) clones from cultivated and non-cultivated hosts: haplotype and host associations.

    PubMed

    Anstead, J A; Burd, J D; Shufran, K A

    2002-02-01

    A 1.0 kb region of the mitochondrial cytochrome oxidase subunit I gene from the greenbug aphid, Schizaphis graminum (Rondani), was sequenced for 24 field collected clones from non-cultivated and cultivated hosts. Maximum likelihood, maximum parsimony and neighbour-joining phylogenies were estimated for these clones, plus 12 previously sequenced clones. All three tests produced trees with identical topologies and confirmed the presence of three clades within S. graminum. Clones showed no relationship between biotype and mtDNA haplotype. At least one biotype was found in all three clades, suggesting exchange among clades of genetic material conditioning for crop virulence, or the sharing of a common ancestor. However, there was a relationship between host and haplotype. Clade 1 was the most homogeneous and contained 12 of 16 clones collected from cultivated hosts and five of the six collected from johnsongrass, Sorghum halepense, a congener of cultivated sorghum, S. bicolor. Four of the six clones collected from Agropyron spp. were found in clade 2. Clade 3 contained two clones from wheat, Triticum aestivum, and four from non-cultivated hosts other than Agropyron spp. A partitioning of populations by mtDNA haplotype and host suggests the occurrence of host adapted races in Schizaphis graminum.

  4. Using intron sequence comparisons in the triose-phosphate isomerase gene to study the divergence of the fall armyworm host strains.

    PubMed

    Nagoshi, R N; Meagher, R L

    2016-06-01

    The noctuid moth Spodoptera frugiperda (the fall armyworm) is endemic to the Western Hemisphere and appears to be undergoing sympatric speciation to produce two subpopulations that differ in their choice of host plants. The 'rice strain' and 'corn strain' are morphologically indistinguishable, requiring the use of genetic markers for identification. Because fall armyworm is a major pest of corn and several other agricultural crops, characterizing the strains has important economic consequences. In this study, comparisons were made of the intron sequences from the triose-phosphate isomerase (Tpi) gene isolated from 85 fall armyworm specimens collected from two host plants. Sixteen new strain-specific haplotypes based on intron polymorphisms are described that can facilitate the characterization of fall armyworm populations associated with different host plants. Comparisons of genetic diversity within and between the strains provides evidence that the corn strain is undergoing active selection and supports the proposal of directional interstrain mating occurring in the wild. Comparisons of the polymorphisms indicate that each intron undergoes different patterns of mutation that in some cases corresponds to host plant preferences. The results confirm that intron sequence comparisons are an effective approach to study fall armyworm population genetics. PMID:26991678

  5. Phylogeny of Trypanosoma ( Megatrypanum ) theileri and related trypanosomes reveals lineages of isolates associated with artiodactyl hosts diverging on SSU and ITS ribosomal sequences.

    PubMed

    Rodrigues, A C; Paiva, F; Campaner, M; Stevens, J R; Noyes, H A; Teixeira, M M G

    2006-02-01

    SSU ribosomal sequences of trypanosomes from Brazilian cattle and water buffalo were used to infer phylogenetic relationships between non-pathogenic T. theileri and allied species parasitic in artiodactyls. T. theileri trypanosomes from distinct geographical regions in Brazil and from other countries were tightly clustered into the 'clade T. theileri' distant from the 'T. brucei clade' of pathogenic parasites of artiodactyls, and also distinct from trypanosomes of other mammals. The existence of this monophyletic assemblage (T. theileri clade) composed only by isolates from artiodactyl species justifies the continued recognition of the subgenus T. (Megatrypanum) with T. theileri as its type species. Phylogenies based on SSU and ITS1 ribosomal sequences produced the same branching pattern with isolates from different mammalian hosts clustered in 5 lineages: A, related to water buffalo; B, C and D, to cattle; E, to fallow deer. The pattern of host specificity allied to some congruence between host and parasite phylogenies suggested association of these trypanosomes with their respective hosts. Segregation of cattle isolates into three lineages revealed an overall geographical structure. Moreover, positioning of trypanosomes infecting tabanids in the T. theileri clade is consistent with the role of these flies as important vectors of these trypanosomes.

  6. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    PubMed

    Mohn, W W

    1995-06-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS)

  7. Development of a SCAR (sequence-characterised amplified region) marker for acid resistance-related gene in Lactobacillus plantarum.

    PubMed

    Liu, Shu-Wen; Li, Kai; Yang, Shi-Ling; Tian, Shu-Fen; He, Ling

    2015-03-01

    A sequence characterised amplified region marker was developed to determine an acid resistance-related gene in Lactobacillus plantarum. A random amplified polymorphic DNA marker named S116-680 was reported to be closely related to the acid resistance of the strains. The DNA band corresponding to this marker was cloned and sequenced with the induction of specific designed PCR primers. The results of PCR test helped to amplify a clear specific band of 680 bp in the tested acid-resistant strains. S116-680 marker would be useful to explore the acid-resistant mechanism of L. plantarum and to screen desirable malolactic fermentation strains.

  8. Vorticity and Divergence in the Solar Photosphere

    NASA Astrophysics Data System (ADS)

    Wang, Yi; Noyes, Robert W.; Tarbell, Theodore D.; Title, Alan M.

    1995-07-01

    We have studied an outstanding sequence of continuum images of the solar granulation from Pic du Midi Observatory. We have calculated the horizontal vector flow field using a correlation tracking algorithm, and from this determined three scalar fields: the vertical component of the curl, the horizontal divergence, and the horizontal flow speed. The divergence field has substantially longer coherence time and more power than does the curl field. Statistically, curl is better correlated with regions of negative divergence that is, the vertical vorticity is higher in downflow regions, suggesting excess vorticity in intergranular lanes. The average value of the divergence is largest (i.e., outflow is largest) where the horizontal speed is large; we associate these regions with exploding granules. A numerical simulation of general convection also shows similar statistical differences between curl and divergence. Some individual small bright points in the granulation pattern show large local vorticities.

  9. Vorticity and divergence in the solar photosphere

    NASA Technical Reports Server (NTRS)

    Wang, YI; Noyes, Robert W.; Tarbell, Theodore D.; Title, Alan M.

    1995-01-01

    We have studied an outstanding sequence of continuum images of the solar granulation from Pic du Midi Observatory. We have calculated the horizontal vector flow field using a correlation tracking algorithm, and from this determined three scalar field: the vertical component of the curl; the horizontal divergence; and the horizontal flow speed. The divergence field has substantially longer coherence time and more power than does the curl field. Statistically, curl is better correlated with regions of negative divergence - that is, the vertical vorticity is higher in downflow regions, suggesting excess vorticity in intergranular lanes. The average value of the divergence is largest (i.e., outflow is largest) where the horizontal speed is large; we associate these regions with exploding granules. A numerical simulation of general convection also shows similar statistical differences between curl and divergence. Some individual small bright points in the granulation pattern show large local vorticities.

  10. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  11. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  12. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  13. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  14. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  15. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  16. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  17. A new antifungal peptide from the seeds of Phytolacca americana: characterization, amino acid sequence and cDNA cloning.

    PubMed

    Shao, F; Hu, Z; Xiong, Y M; Huang, Q Z; WangCG; Zhu, R H; Wang, D C

    1999-03-19

    An antifungal peptide from seeds of Phytolacca americana, designated PAFP-s, has been isolated. The peptide is highly basic and consists of 38 residues with three disulfide bridges. Its molecular mass of 3929.0 was determined by mass spectrometry. The complete amino acid sequence was obtained from automated Edman degradation, and cDNA cloning was successfully performed by 3'-RACE. The deduced amino acid sequence of a partial cDNA corresponded to the amino acid sequence from chemical sequencing. PAFP-s exhibited a broad spectrum of antifungal activity, and its activities differed among various fungi. PAFP-s displayed no inhibitory activity towards Escherichia coli. PAFP-s shows significant sequence similarities and the same cysteine motif with Mj-AMPs, antimicrobial peptides from seeds of Mirabilis jalapa belonging to the knottin-type antimicrobial peptide.

  18. Amino acid sequence and variant forms of favin, a lectin from Vicia faba.

    PubMed

    Hopp, T P; Hemperly, J J; Cunningham, B A

    1982-04-25

    We have determined the complete amino acid sequence (182 residues) of the beta chain of favin, the glucose-binding lectin from fava beans (Vicia faba), and have established that the carbohydrate moiety is attached to Asn 168. Together with the sequence of the alpha chain previously reported (Hemperly, J. J., Hopp, T. P., Becker, J. W., and Cunningham, B. A. (1979) J. Biol. Chem. 254, 6803-6810), these data complete the analysis of the primary structure of the lectin. We have also examined minor polypeptides that appear in all preparations of favin. Two lower molecular weight species (Mr = 9,500-11,600) appear to be fragments of the beta chain resulting from cleavage following Asn 76, whereas six high molecular weight forms (Mr = 25,000 or greater) appear to include aggregates of the beta chain and possibly some alternative products of chain processing. PMID:7068646

  19. Pyrosequencing on templates generated by asymmetric nucleic acid sequence-based amplification (asymmetric-NASBA).

    PubMed

    Jia, Huning; Chen, Zhiyao; Wu, Haiping; Ye, Hui; Yan, Zhengyu; Zhou, Guohua

    2011-12-21

    Pyrosequencing is an ideal tool for verifying the sequence of amplicons. To enable pyrosequencing on amplicons from nucleic acid sequence-based amplification (NASBA), asymmetric NASBA with unequal concentrations of T7 promoter primer and reverse transcription primer was proposed. By optimizing the ratio of two primers and the concentration of dNTPs and NTPs, the amount of single-stranded cDNA in the amplicons from asymmetric NASBA was found increased 12 times more than the conventional NASBA through the real-time detection of a molecular beacon specific to cDNA of interest. More than 20 bases have been successfully detected by pyrosequencing on amplicons from asymmetric NASBA using Human parainfluenza virus (HPIV) as an amplification template. The primary results indicate that the combination of NASBA with a pyrosequencing system is practical, and should open a new field in clinical diagnosis.

  20. Divergence in function and expression of the NOD26-like intrinsic proteins in plants

    PubMed Central

    Liu, Qingpo; Wang, Huasen; Zhang, Zhonghua; Wu, Jiasheng; Feng, Ying; Zhu, Zhujun

    2009-01-01

    Background NOD26-like intrinsic proteins (NIPs) that belong to the aquaporin superfamily are plant-specific and exhibit a similar three-dimensional structure. Experimental evidences however revealed that functional divergence should have extensively occurred among NIP genes. It is therefore intriguing to further investigate the evolutionary mechanisms being responsible for the functional diversification of the NIP genes. To better understand this process, a comprehensive analysis including the phylogenetic, positive selection, functional divergence, and transcriptional analysis was carried out. Results The origination of NIPs could be dated back to the primitive land plants, and their diversification would be no younger than the emergence time of the moss P. patens. The rapid proliferation of NIPs in plants may be primarily attributed to the segmental chromosome duplication produced by polyploidy and tandem duplications. The maximum likelihood analysis revealed that NIPs should have experienced strong selective pressure for adaptive evolution after gene duplication and/or speciation, prompting the formation of distinct NIP groups. Functional divergence analysis at the amino acid level has provided strong statistical evidence for shifted evolutionary rate and/or radical change of the physiochemical properties of amino acids after gene duplication, and DIVERGE2 has identified the critical amino acid sites that are thought to be responsible for the divergence for further investigation. The expression of plant NIPs displays a distinct tissue-, cell-type-, and developmental specific pattern, and their responses to various stress treatments are quite different also. The differences in organization of cis-acting regulatory elements in the promoter regions may partially explain their distinction in expression. Conclusion A number of analyses both at the DNA and amino acid sequence levels have provided strong evidences that plant NIPs have suffered a high divergence in

  1. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  2. The complete amino acid sequence of R-phycocyanin-I alpha and beta subunits from the red alga Porphyridium cruentum. Structural and phylogenetic relationships of the phycocyanins within the phycobiliprotein families.

    PubMed

    Ducret, A; Sidler, W; Frank, G; Zuber, H

    1994-04-01

    We present here the complete primary structure of R-phycocyanin-I alpha and beta subunits from the red alga Porphyridium cruentum. The alpha chain is composed of 162 amino acid residues (18049 Da, calculated from sequence, including chromophore) and carries a phycocyanobilin pigment covalently linked to Cys84. The beta chain contains 172 amino acids (19344Da, calculated from sequence, including chromophores) and carries a phycocyanobilin pigment covalently linked at Cys82 and a phycoerythrobilin pigment at Cys153. A gamma-N-methyl asparagine residue was also characterised at position beta 72 similar to other phycobiliprotein beta subunits. R-phycocyanin-I from Porphyridium cruentum shares high sequence identity with C-phycocyanins (69-83%), R-phycocyanins (66-70%) and in a less extent with phycoerythrocyanins (57-65%) from various sources. The presented phylogenetic trees are based on a comparison of all phycobiliprotein amino acid sequences known so far and confirm the clear affiliation of the R-phycocyanins in the phycocyanin family. In spite of their particular phycobilin pattern, they do not represent intermediate forms between the phycocyanin and the phycoerythrin family. Phycoerythrocyanin, a phycocyanin-related phycobiliprotein adapted to green light harvesting, is also shown to belong to the phycocyanin family. However, the phycoerythrocyanins diverge from phycocyanins in their different function and it is suggested that they should be assigned to a separate group within the phycocyanin family.

  3. The amino-acid sequences of sculpin islet somatostatin-28 and peptide YY.

    PubMed

    Cutfield, S M; Carne, A; Cutfield, J F

    1987-04-01

    Two pancreatic peptides, somatostatin-28 and peptide YY, have been isolated from the Brockmann bodies of the teleost fish Cottus scorpius (daddy sculpin). Following purification by reverse-phase HPLC, each peptide was sequenced completely through to the carboxyl-terminus by gas-phase Edman degradation. Somatostatin-28 was the major form of somatostatin detected and is similar to the gene II product from anglerfish. Peptide YY (36 amino acids) more closely resembles porcine neuropeptide YY and intestinal peptide YY than it does the pancreatic polypeptides. PMID:2883025

  4. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    PubMed

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  5. Sequence selective double strand DNA cleavage by peptide nucleic acid (PNA) targeting using nuclease S1.

    PubMed Central

    Demidov, V; Frank-Kamenetskii, M D; Egholm, M; Buchardt, O; Nielsen, P E

    1993-01-01

    A novel method for sequence specific double strand DNA cleavage using PNA (peptide nucleic acid) targeting is described. Nuclease S1 digestion of double stranded DNA gives rise to double strand cleavage at an occupied PNA strand displacement binding site, and under optimized conditions complete cleavage can be obtained. The efficiency of this cleavage is more than 10 fold enhanced when a tandem PNA site is targeted, and additionally enhanced if this site is in trans rather than in cis orientation. Thus in effect, the PNA targeting makes the single strand specific nuclease S1 behave like a pseudo restriction endonuclease. Images PMID:8502550

  6. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  7. WinGene/WinPep: user-friendly software for the analysis of amino acid sequences.

    PubMed

    Hennig, L

    1999-06-01

    WinGene1.0/WinPep1.2 is a pair of Microsoft Windows programs designed to read nucleotide or amino acid sequence data. These versatile programs have the following capabilities: (i) searches for open reading frames and their translation, (ii) assisting the design of primers for PCR and (iii) calculation of molecular weight, isoelectric point and molar absorbtion coefficients of polypeptides. Furthermore, hydropathic plots and helical wheel displays are easily produced. The programs run with an intuitive Windows interface, contain a comprehensive help file and enable data exchange with other applications by means of the Copy&Paste command. The software is free for academic and noncommercial users.

  8. Complete genome sequence of Lactococcus lactis IO-1, a lactic acid bacterium that utilizes xylose and produces high levels of L-lactic acid.

    PubMed

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-04-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly L-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  9. Purification and amino acid sequence of aminopeptidase P from pig kidney.

    PubMed

    Vergas Romero, C; Neudorfer, I; Mann, K; Schäfer, W

    1995-04-01

    Aminopeptidase P from kidney cortex was purified in high yield (recovery greater than or equal to 20%) by a series of column chromatographic steps after solubilization of the membrane-bound glycoprotein with n-butanol. A coupled enzymic assay, using Gly-Pro-Pro-NH-Nap as substrate and dipeptidyl-peptidase IV as auxilliary enzyme, was used to monitor the purification. The purification procedure yielded two forms of aminopeptidase P differing in their carbohydrate composition (glycoforms). Both enzyme preparations were homogeneous as assessed by SDS/PAGE silver staining, and isoelectric focusing. Both forms possessed the same substrate specificity, catalysed the same reaction, and consisted of identical protein chains. The amino acid sequence determined by Edman degradation and mass spectrometry consisted of 623 amino acids. Six N-glycosylation sites, all contained in the N-terminal half of the protein, were characterized. PMID:7744038

  10. Mass spectrometric detection of the amino acid sequence polymorphism of the hepatitis C virus antigen.

    PubMed

    Kaysheva, A L; Ivanov, Yu D; Frantsuzov, P A; Krohin, N V; Pavlova, T I; Uchaikin, V F; Konev, V А; Kovalev, O B; Ziborov, V S; Archakov, A I

    2016-03-01

    A method for detection and identification of the hepatitis C virus antigen (HCVcoreAg) in human serum with consideration for possible amino acid substitutions is proposed. The method is based on a combination of biospecific capturing and concentrating of the target protein on the surface of the chip for atomic force microscope (AFM chip) with subsequent protein identification by tandem mass spectrometric (MS/MS) analysis. Biospecific AFM-capturing of viral particles containing HCVcoreAg from serum samples was performed by use of AFM chips with monoclonal antibodies (anti-HCVcore) covalently immobilized on the surface. Biospecific complexes were registered and counted by AFM. Further MS/MS analysis allowed to reliably identify the HCVcoreAg in the complexes formed on the AFM chip surface. Analysis of MS/MS spectra, with the account taken of the possible polymorphisms in the amino acid sequence of the HCVcoreAg, enabled us to increase the number of identified peptides.

  11. Draft Genome Sequence of Bacillus subtilis subsp. natto Strain CGMCC 2108, a High Producer of Poly-γ-Glutamic Acid

    PubMed Central

    Tan, Siyuan; Su, Anping; Zhang, Chen; Ren, Yuanyuan

    2016-01-01

    Here, we report the 4.1-Mb draft genome sequence of Bacillus subtilis subsp. natto strain CGMCC 2108, a high producer of poly-γ-glutamic acid (γ-PGA). This sequence will provide further help for the biosynthesis of γ-PGA and will greatly facilitate research efforts in metabolic engineering of B. subtilis subsp. natto strain CGMCC 2108. PMID:27231363

  12. WAViS server for handling, visualization and presentation of multiple alignments of nucleotide or amino acids sequences.

    PubMed

    Zika, Radek; Paces, Jan; Pavlícek, Adam; Paces, Václav

    2004-07-01

    Web Alignment Visualization Server contains a set of web-tools designed for quick generation of publication-quality color figures of multiple alignments of nucleotide or amino acids sequences. It can be used for identification of conserved regions and gaps within many sequences using only common web browsers. The server is accessible at http://wavis.img.cas.cz.

  13. ANTICALIgN: visualizing, editing and analyzing combined nucleotide and amino acid sequence alignments for combinatorial protein engineering.

    PubMed

    Jarasch, Alexander; Kopp, Melanie; Eggenstein, Evelyn; Richter, Antonia; Gebauer, Michaela; Skerra, Arne

    2016-07-01

    ANTIC ALIGN: is an interactive software developed to simultaneously visualize, analyze and modify alignments of DNA and/or protein sequences that arise during combinatorial protein engineering, design and selection. ANTIC ALIGN: combines powerful functions known from currently available sequence analysis tools with unique features for protein engineering, in particular the possibility to display and manipulate nucleotide sequences and their translated amino acid sequences at the same time. ANTIC ALIGN: offers both template-based multiple sequence alignment (MSA), using the unmutated protein as reference, and conventional global alignment, to compare sequences that share an evolutionary relationship. The application of similarity-based clustering algorithms facilitates the identification of duplicates or of conserved sequence features among a set of selected clones. Imported nucleotide sequences from DNA sequence analysis are automatically translated into the corresponding amino acid sequences and displayed, offering numerous options for selecting reading frames, highlighting of sequence features and graphical layout of the MSA. The MSA complexity can be reduced by hiding the conserved nucleotide and/or amino acid residues, thus putting emphasis on the relevant mutated positions. ANTIC ALIGN: is also able to handle suppressed stop codons or even to incorporate non-natural amino acids into a coding sequence. We demonstrate crucial functions of ANTIC ALIGN: in an example of Anticalins selected from a lipocalin random library against the fibronectin extradomain B (ED-B), an established marker of tumor vasculature. Apart from engineered protein scaffolds, ANTIC ALIGN: provides a powerful tool in the area of antibody engineering and for directed enzyme evolution.

  14. 3-d structure-based amino acid sequence alignment of esterases, lipases and related proteins

    SciTech Connect

    Gentry, M.K.; Doctor, B.P.; Cygler, M.; Schrag, J.D.; Sussman, J.L.

    1993-05-13

    Acetylcholinesterase and butyrylcholinesterase, enzymes with potential as pretreatment drugs for organophosphate toxicity, are members of a larger family of homologous proteins that includes carboxylesterases, cholesterol esterases, lipases, and several nonhydrolytic proteins. A computer-generated alignment of 18 of the proteins, the acetylcholinesases, butyrylcholinesterases, carboxylesterases, some esterases, and the nonenzymatic proteins has been previously presented. More recently, the three-dimensional structures of two enzymes enzymes in this group, acetylcholinesterase from Torpedo californica and lipase from Geotrichum candidum, have been determined. Based on the x-ray structures and the superposition of these two enzymes, it was possible to obtain an improved amino acid sequence alignment of 32 members of this family of proteins. Examination of this alignment reveals that 24 amino acids are invariant in all of the hydrolytic proteins, and an additional 49 are well conserved. Conserved amino acids include those of the active site, the disulfide bridges, the salt bridges, in the core of the proteins, and at the edges of secondary structural elements. Comparison of the three-dimensional structures makes it possible to find a well-defined structural basis for the conservation of many of these amino acids.

  15. Transcriptome sequencing reveals genetic mechanisms underlying the transition between the laying and brooding phases and gene expression changes associated with divergent reproductive phenotypes in chickens.

    PubMed

    Shen, Xu; Bai, Xue; Xu, Jin; Zhou, Min; Xu, Haipin; Nie, Qinghua; Lu, Xuemei; Zhang, Xiquan

    2016-09-01

    Transition from laying to incubation behavior in chicken is an interesting topic in reproductive biology. The decline of incubation behavior in chicken population has led to considerable phenotypic differences in reproductive traits between breeds. However, the exact genetic mechanism of the reproductive phase transition still largely unknown and little is known about the gene expression changes that contribute to the phenotypic differences. We performed mRNA sequencing to investigate the molecular mechanism underlying the transition from laying to brooding and to detect difference in gene regulation underlying the phenotypic diversification using two chicken breeds. The majority of gene expression changes during phase transition were steroidogenesis and hormone-releasing genes. Brooding chickens shared a conservative pattern of greatly inhibited steroidogenic enzyme genes in the pituitary gland, therefore, low levels of steroidogenic enzymes might result in reproductive defects such as ovary regression and brooding onset. The conserved network responsible for brooding behavior was maintained by steroid biosynthesis and hormonal interactions. Interestingly, three transcription factors, SREBF2, NR5A1 and PGR, act as central signal modulators of steroid biosynthesis and hormonal interactions during the transition from laying to brooding modes at the molecular level. Furthermore, Genes correlated with protein synthesis and accumulation showed expression variation between breeds, which might result in different concentrations of and sensitivities to reproduction-related hormones. This study provided a new insight in neuroendocrine system at the molecular level, and helps to understand the genetic and hormonal responses that ultimately translate into behavior in chicken.

  16. Multiple Amino Acid Sequence Alignment Nitrogenase Component 1: Insights into Phylogenetics and Structure-Function Relationships

    PubMed Central

    Howard, James B.; Kechris, Katerina J.; Rees, Douglas C.; Glazer, Alexander N.

    2013-01-01

    Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as “core” for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases

  17. Correlations Between Amino Acids at Different Sites in Local Sequences of Protein Fragments with Given Structural Patterns

    NASA Astrophysics Data System (ADS)

    Lu, Wen; Liu, Hai-yan

    2007-02-01

    Ample evidence suggests that the local structures of peptide fragments in native proteins are to some extent encoded by their local sequences. Detecting such local correlations is important but it is still an open question what would be the most appropriate method. This is partly because conventional sequence analyses treat amino acid preferences at each site of a protein sequence independently, while it is often the inter-site interactions that bring about local sequence-structure correlations. Here a new scheme is introduced to capture the correlation between amino acid preferences at different sites for different local structure types. A library of nine-residue fragments is constructed, and the fragments are divided into clusters based on their local structures. For each local structure cluster or type, chi-square tests are used to identify correlated preferences of amino acid combinations at pairs of sites. A score function is constructed including both the single site amino acid preferences and the dual-site amino acid combination preferences, which can be used to identify whether a sequence fragment would have a strong tendency to form a particular local structure in native proteins. The results show that, given a local structure pattern, dual-site amino acid combinations contain different information from single site amino acid preferences. Representative examples show that many of the statistically identified correlations agree with previously-proposed heuristic rules about local sequence-structure correlations, or are consistent with physical-chemical interactions required to stabilize particular local structures. Results also show that such dual-site correlations in the score function significantly improves the Z-score matching a sequence fragment to its native local structure relative to non-native local structures, and certain local structure types are highly predictable from the local sequence alone if inter-site correlations are considered.

  18. A Hybrid Genetic Linkage Map of Two Ecologically and Morphologically Divergent Midas Cichlid Fishes (Amphilophus spp.) Obtained by Massively Parallel DNA Sequencing (ddRADSeq)

    PubMed Central

    Recknagel, Hans; Elmer, Kathryn R.; Meyer, Axel

    2013-01-01

    Cichlid fishes are an excellent model system for studying speciation and the formation of adaptive radiations because of their tremendous species richness and astonishing phenotypic diversity. Most research has focused on African rift lake fishes, although Neotropical cichlid species display much variability as well. Almost one dozen species of the Midas cichlid species complex (Amphilophus spp.) have been described so far and have formed repeated adaptive radiations in several Nicaraguan crater lakes. Here we apply double-digest restriction-site associated DNA sequencing to obtain a high-density linkage map of an interspecific cross between the benthic Amphilophus astorquii and the limnetic Amphilophus zaliosus, which are sympatric species endemic to Crater Lake Apoyo, Nicaragua. A total of 755 RAD markers were genotyped in 343 F2 hybrids. The map resolved 25 linkage groups and spans a total distance of 1427 cM with an average marker spacing distance of 1.95 cM, almost matching the total number of chromosomes (n = 24) in these species. Regions of segregation distortion were identified in five linkage groups. Based on the pedigree of parents to F2 offspring, we calculated a genome-wide mutation rate of 6.6 × 10−8 mutations per nucleotide per generation. This genetic map will facilitate the mapping of ecomorphologically relevant adaptive traits in the repeated phenotypes that evolved within the Midas cichlid lineage and, as the first linkage map of a Neotropical cichlid, facilitate comparative genomic analyses between African cichlids, Neotropical cichlids and other teleost fishes. PMID:23316439

  19. Draft Genome Sequences of Gluconobacter cerinus CECT 9110 and Gluconobacter japonicus CECT 8443, Acetic Acid Bacteria Isolated from Grape Must

    PubMed Central

    Sainz, Florencia

    2016-01-01

    We report here the draft genome sequences of Gluconobacter cerinus strain CECT9110 and Gluconobacter japonicus CECT8443, acetic acid bacteria isolated from grape must. Gluconobacter species are well known for their ability to oxidize sugar alcohols into the corresponding acids. Our objective was to select strains to oxidize effectively d-glucose. PMID:27365351

  20. Molecular cloning, encoding sequence, and expression of vaccinia virus nucleic acid-dependent nucleoside triphosphatase gene.

    PubMed Central

    Rodriguez, J F; Kahn, J S; Esteban, M

    1986-01-01

    A rabbit poxvirus genomic library contained within the expression vector lambda gt11 was screened with polyclonal antiserum prepared against vaccinia virus nucleic acid-dependent nucleoside triphosphatase (NTPase)-I enzyme. Five positive phage clones containing from 0.72- to 2.5-kilobase-pair (kbp) inserts expressed a beta-galactosidase fusion protein that was reactive by immunoblotting with the NTPase-I antibody. Hybridization analysis allowed the location of this gene within the vaccinia HindIIID restriction fragment. From the known nucleotide sequence of the 16-kbp vaccinia HindIIID fragment, we identified a region that contains a 1896-base open reading frame coding for a 631-amino acid protein. Analysis of the complete sequence revealed a highly basic protein, with hydrophilic COOH and NH2 termini, various hydrophobic domains, and no significant homology to other known proteins. Translational studies demonstrate that NTPase-I belongs to a late class of viral genes. This protein is highly conserved among Orthopoxviruses. Images PMID:3025846

  1. Partial amino acid sequences around sulfhydryl groups of soybean beta-amylase.

    PubMed

    Nomura, K; Mikami, B; Morita, Y

    1987-08-01

    Sulfhydryl (SH) groups of soybean beta-amylase were modified with 5-(iodoaceto-amidoethyl)aminonaphthalene-1-sulfonate (IAEDANS) and the SH-containing peptides exhibiting fluorescence were purified after chymotryptic digestion of the modified enzyme. The sequence analysis of the peptides derived from the modification of all SH groups in the denatured enzyme revealed the existence of six SH groups, in contrast to five reported previously. One of them was found to have extremely low reactivity toward SH-reagents without reduction. In the native state, IAEDANS reacted with 2 mol of SH groups per mol of the enzyme (SH1 and SH2) accompanied with inactivation of the enzyme owing to the modification of SH2 located near the active site of this enzyme. The selective modification of SH2 with IAEDANS was attained after the blocking of SH1 with 5,5'-dithiobis-(2-nitrobenzoic acid). The amino acid sequences of the peptides containing SH1 and SH2 were determined to be Cys-Ala-Asn-Pro-Gln and His-Gln-Cys-Gly-Gly-Asn-Val-Gly-Asp-Ile-Val-Asn-Ile-Pro-Ile-Pro-Gln-Trp, respectively.

  2. Detection of piscine nodaviruses by real-time nucleic acid sequence based amplification (NASBA).

    PubMed

    Starkey, William G; Millar, Rose Mary; Jenkins, Mary E; Ireland, Jacqueline H; Muir, K Fiona; Richards, Randolph H

    2004-05-01

    Nucleic acid sequence based amplification (NASBA) is an isothermal nucleic acid amplification procedure based on target-specific primers and probes, and the co-ordinated activity of 3 enzymes: AMV reverse transcriptase, RNase H, and T7 RNA polymerase. We have developed a real-time NASBA procedure for detection of piscine nodaviruses, which have emerged as major pathogens of marine fish. Viral RNA was isolated by guanidine thiocyanate lysis followed by purification on silica particles. Primers were designed to target sequences in the nodavirus capsid protein gene, yielding an amplification product of 120 nucleotides. Amplification products were detected in real-time with a molecular beacon (FAM labelled/methyl-red quenched) that recognised an internal region of the target amplicon. Amplification and detection were performed at 41 degrees C for 90 min in a Corbett Research Rotorgene. Based on the detection of cell culture-derived nodavirus, and a synthetic RNA target, the real-time NASBA procedure was approximately 100-fold more sensitive than single-tube RT-PCR. When used to test a panel of 37 clinical samples (negative, n = 18; positive, n = 19), the real-time NASBA assay correctly identified all 18 negative and 19 positive samples. In comparison, the RT-PCR procedure identified all 18 negative samples, but only 16 of the positive samples. These results suggest that real-time NASBA may represent a sensitive and specific diagnostic procedure for piscine nodaviruses.

  3. From amino acid sequence to bioactivity: The biomedical potential of antitumor peptides.

    PubMed

    Blanco-Míguez, Aitor; Gutiérrez-Jácome, Alberto; Pérez-Pérez, Martín; Pérez-Rodríguez, Gael; Catalán-García, Sandra; Fdez-Riverola, Florentino; Lourenço, Anália; Sánchez, Borja

    2016-06-01

    Chemoprevention is the use of natural and/or synthetic substances to block, reverse, or retard the process of carcinogenesis. In this field, the use of antitumor peptides is of interest as, (i) these molecules are small in size, (ii) they show good cell diffusion and permeability, (iii) they affect one or more specific molecular pathways involved in carcinogenesis, and (iv) they are not usually genotoxic. We have checked the Web of Science Database (23/11/2015) in order to collect papers reporting on bioactive peptide (1691 registers), which was further filtered searching terms such as "antiproliferative," "antitumoral," or "apoptosis" among others. Works reporting the amino acid sequence of an antiproliferative peptide were kept (60 registers), and this was complemented with the peptides included in CancerPPD, an extensive resource for antiproliferative peptides and proteins. Peptides were grouped according to one of the following mechanism of action: inhibition of cell migration, inhibition of tumor angiogenesis, antioxidative mechanisms, inhibition of gene transcription/cell proliferation, induction of apoptosis, disorganization of tubulin structure, cytotoxicity, or unknown mechanisms. The main mechanisms of action of those antiproliferative peptides with known amino acid sequences are presented and finally, their potential clinical usefulness and future challenges on their application is discussed.

  4. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    PubMed

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-01

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  5. Molecular cloning, encoding sequence, and expression of vaccinia virus nucleic acid-dependent nucleoside triphosphatase gene.

    PubMed

    Rodriguez, J F; Kahn, J S; Esteban, M

    1986-12-01

    A rabbit poxvirus genomic library contained within the expression vector lambda gt11 was screened with polyclonal antiserum prepared against vaccinia virus nucleic acid-dependent nucleoside triphosphatase (NTPase)-I enzyme. Five positive phage clones containing from 0.72- to 2.5-kilobase-pair (kbp) inserts expressed a beta-galactosidase fusion protein that was reactive by immunoblotting with the NTPase-I antibody. Hybridization analysis allowed the location of this gene within the vaccinia HindIIID restriction fragment. From the known nucleotide sequence of the 16-kbp vaccinia HindIIID fragment, we identified a region that contains a 1896-base open reading frame coding for a 631-amino acid protein. Analysis of the complete sequence revealed a highly basic protein, with hydrophilic COOH and NH2 termini, various hydrophobic domains, and no significant homology to other known proteins. Translational studies demonstrate that NTPase-I belongs to a late class of viral genes. This protein is highly conserved among Orthopoxviruses.

  6. Quantum skew divergence

    SciTech Connect

    Audenaert, Koenraad M. R.

    2014-11-15

    In this paper, we study the quantum generalisation of the skew divergence, which is a dissimilarity measure between distributions introduced by Lee in the context of natural language processing. We provide an in-depth study of the quantum skew divergence, including its relation to other state distinguishability measures. Finally, we present a number of important applications: new continuity inequalities for the quantum Jensen-Shannon divergence and the Holevo information, and a new and short proof of Bravyi's Small Incremental Mixing conjecture.

  7. [Genomic structure of the autotetraploid oat species Avena macrostachya inferred from comparative analysis of the ITS1 and ITS2 sequences: on the oat karyotype evolution during the early stages of the Avena species divergence].

    PubMed

    Rodionov, A V; Tiupa, N B; Kim, E S; Machs, E M; Loskutov, I G

    2005-05-01

    To examine the genomic structure of Avena macrostachya, internal transcribed spacers, ITS1 and ITS2, as well as nuclear 5.8S tRNA genes from three oat species with AsAs karyotype (A. wiestii, A. hirtula, and A. atlantica), and those from A. longiglumis (AlAl), A. canariensis (AcAc), A. ventricosa (CvCv), A. pilosa, and A. clauda (CpCp) were sequenced. All species of the genus Avena examined represented a monophyletic group (bootstrap index = 98), within which two branches, i.e., species with A- and C-genomes, were distinguished (bootstrap indices = 100). The subject of our study, A. macrostachya, albeit belonging to the phylogenetic branch of C-genome oat species (karyotype with submetacentic and subacrocentric chromosomes), has preserved an isobrachyal karyotype, (i.e., that containing metacentric chromosomes), probably typical of the common Avena ancestor. It was suggested to classify the A. macrostachya genome as a specific form of C-genome, Cm-genome. Among the species from other genera studied, Arrhenatherum elatius was found to be the closest to Avena in ITS1 and ITS structure. Phylogenetic relationships between Avena and Helictotrichon remain intriguingly uncertain. The HPR389153 sequence from H. pratense genome was closest to the ITS1 sequences specific to the Avena A-genomes (p-distance = 0.0237), while the differences of this sequence from the ITS1 of A. macrostachya reached 0.1221. On the other hand, HAD389117 from H. adsurgens was close to the ITS1 specific to Avena C-genomes (p-distance = 0.0189), while its differences from the A-genome specific ITS1 sequences reached 0.1221. It seems likely that the appearance of highly polyploid (2n = 12-21x) species of H. pratense and H. adsurgens could be associated with interspecific hybridization involving Mediterranean oat species carrying A- and C-genomes. A hypothesis on the pathways of Avena chromosomes evolution during the early stages the oat species divergence is proposed.

  8. Complete amino acid sequence of the myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani.

    PubMed

    Jones, B N; Wang, C C; Dwulet, F E; Lehman, L D; Meuth, J L; Bogardt, R A; Gurd, F R

    1979-04-25

    The complete amino acid sequence of the major component myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani, was determined by the automated Edman degradation of several large peptides obtained by specific cleavage of the protein. The acetimidated apomyoglobin was selectively cleaved at its two methionyl residues with cyanogen bromide and at its three arginyl residues by trypsin. By subjecting four of these peptides and the apomyoglobin to automated Edman degradation, over 80% of the primary structure of the protein was obtained. The remainder of the covalent structure was determined by the sequence analysis of peptides that resulted from further digestion of the central cyanogen bromide fragment. This fragment was cleaved at its glutamyl residues with staphylococcal protease and its lysyl residues with trypsin. The action of trypsin was restricted to the lysyl residues by chemical modification of the single arginyl residue of the fragment with 1,2-cyclohexanedione. The primary structure of this myoglobin proved to be identical with that from the Atlantic bottlenosed dolphin and Pacific common dolphin but differs from the myoglobins of the killer whale and pilot whale at two positions. The above sequence identities and differences reflect the close taxonomic relationship of these five species of Cetacea. PMID:454657

  9. Purification, amino acid sequence and characterisation of kangaroo IGF-I.

    PubMed

    Yandell, C A; Francis, G L; Wheldrake, J F; Upton, Z

    1998-01-01

    Insulin-like growth factor-I (IGF-I) and IGF-II have been purified to homogeneity from kangaroo (Macropus fuliginosus) serum, thus this represents the first report of the purification, sequencing and characterisation of marsupial IGFs. N-Terminal protein sequencing reveals that there are six amino acid differences between kangaroo and human IGF-I. Kangaroo IGF-II has been partially sequenced and no differences were found between human and kangaroo IGF-II in the 53 residues identified. Thus the IGFs appear to be remarkably structurally conserved during mammalian radiation. In addition, in vitro characterisation of kangaroo IGF-I demonstrated that the functional properties of human, kangaroo and chicken IGF-I are very similar. In an assay measuring the ability of the proteins to stimulate protein synthesis in rat L6 myoblasts, all IGF-I proteins were found to be equally potent. The ability of all three proteins to compete for binding with radiolabelled human IGF-I to type-1 IGF receptors in L6 myoblasts and in Sminthopsis crassicaudata transformed lung fibroblasts, a marsupial cell line, was comparable. Furthermore, kangaroo and human IGF-I react equally in a human IGF-I RIA using a human reference standard, radiolabelled human IGF-I and a polyclonal antibody raised against recombinant human IGF-I. This study indicates that not only is the primary structure of eutherian and metatherian IGF-I conserved, but also the proteins appear to be functionally similar.

  10. The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences.

    PubMed

    White, S H

    1994-04-01

    This paper continues an examination of the hypothesis that modern proteins evolved from random heteropeptide sequences. In support of the hypothesis, White and Jacobs (1993, J Mol Evol 36:79-95) have shown that any sequence chosen randomly from a large collection of nonhomologous proteins has a 90% or better chance of having a lengthwise distribution of amino acids that is indistinguishable from the random expectation regardless of amino acid type. The goal of the present study was to investigate the possibility that the random-origin hypothesis could explain the lengths of modern protein sequences without invoking specific mechanisms such as gene duplication or exon splicing. The sets of sequences examined were taken from the 1989 PIR database and consisted of 1,792 "super-family" proteins selected to have little sequence identity, 623 E. coli sequences, and 398 human sequences. The length distributions of the proteins could be described with high significance by either of two closely related probability density functions: The gamma distribution with parameter 2 or the distribution for the sum of two exponential random independent variables. A simple theory for the distributions was developed which assumes that (1) protoprotein sequences had exponentially distributed random independent lengths, (2) the length dependence of protein stability determined which of these protoproteins could fold into compact primitive proteins and thereby attain the potential for biochemical activity, (3) the useful protein sequences were preserved by the primitive genome, and (4) the resulting distribution of sequence lengths is reflected by modern proteins. The theory successfully predicts the two observed distributions which can be distinguished by the functional form of the dependence of protein stability on length. The theory leads to three interesting conclusions. First, it predicts that a tetra-nucleotide was the signal for primitive translation termination. This prediction is

  11. Sequence Design for a Test Tube of Interacting Nucleic Acid Strands.

    PubMed

    Wolfe, Brian R; Pierce, Niles A

    2015-10-16

    We describe an algorithm for designing the equilibrium base-pairing properties of a test tube of interacting nucleic acid strands. A target test tube is specified as a set of desired "on-target" complexes, each with a target secondary structure and target concentration, and a set of undesired "off-target" complexes, each with vanishing target concentration. Sequence design is performed by optimizing the test tube ensemble defect, corresponding to the concentration of incorrectly paired nucleotides at equilibrium evaluated over the ensemble of the test tube. To reduce the computational cost of accepting or rejecting mutations to a random initial sequence, the structural ensemble of each on-target complex is hierarchically decomposed into a tree of conditional subensembles, yielding a forest of decomposition trees. Candidate sequences are evaluated efficiently at the leaf level of the decomposition forest by estimating the test tube ensemble defect from conditional physical properties calculated over the leaf subensembles. As optimized subsequences are merged toward the root level of the forest, any emergent defects are eliminated via ensemble redecomposition and sequence reoptimization. After successfully merging subsequences to the root level, the exact test tube ensemble defect is calculated for the first time, explicitly checking for the effect of the previously neglected off-target complexes. Any off-target complexes that form at appreciable concentration are hierarchically decomposed, added to the decomposition forest, and actively destabilized during subsequent forest reoptimization. For target test tubes representative of design challenges in the molecular programming and synthetic biology communities, our test tube design algorithm typically succeeds in achieving a normalized test tube ensemble defect ≤1% at a design cost within an order of magnitude of the cost of test tube analysis.

  12. Sequence-Specific Electrical Purification of Nucleic Acids with Nanoporous Gold Electrodes.

    PubMed

    Daggumati, Pallavi; Appelt, Sandra; Matharu, Zimple; Marco, Maria L; Seker, Erkin

    2016-06-22

    Nucleic-acid-based biosensors have enabled rapid and sensitive detection of pathogenic targets; however, these devices often require purified nucleic acids for analysis since the constituents of complex biological fluids adversely affect sensor performance. This purification step is typically performed outside the device, thereby increasing sample-to-answer time and introducing contaminants. We report a novel approach using a multifunctional matrix, nanoporous gold (np-Au), which enables both detection of specific target sequences in a complex biological sample and their subsequent purification. The np-Au electrodes modified with 26-mer DNA probes (via thiol-gold chemistry) enabled sensitive detection and capture of complementary DNA targets in the presence of complex media (fetal bovine serum) and other interfering DNA fragments in the range of 50-1500 base pairs. Upon capture, the noncomplementary DNA fragments and serum constituents of varying sizes were washed away. Finally, the surface-bound DNA-DNA hybrids were released by electrochemically cleaving the thiol-gold linkage, and the hybrids were iontophoretically eluted from the nanoporous matrix. The optical and electrophoretic characterization of the analytes before and after the detection-purification process revealed that low target DNA concentrations (80 pg/μL) can be successfully detected in complex biological fluids and subsequently released to yield pure hybrids free of polydisperse digested DNA fragments and serum biomolecules. Taken together, this multifunctional platform is expected to enable seamless integration of detection and purification of nucleic acid biomarkers of pathogens and diseases in miniaturized diagnostic devices.

  13. Amino acid sequence analysis and characterization of a ribonuclease from starfish Asterias amurensis.

    PubMed

    Motoyoshi, Naomi; Kobayashi, Hiroko; Itagaki, Tadashi; Inokuchi, Norio

    2016-09-01

    The aim of this study was to phylogenetically characterize the location of the RNase T2 enzyme in the starfish (Asterias amurensis). We isolated an RNase T2 ribonuclease (RNase Aa) from the ovaries of starfish and determined its amino acid sequence by protein chemistry and cloning cDNA encoding RNase Aa. The isolated protein had 231 amino acid residues, a predicted molecular mass of 25,906 Da, and an optimal pH of 5.0. RNase Aa preferentially released guanylic acid from the RNA. The catalytic sites of the RNase T2 family are conserved in RNase Aa; furthermore, the distribution of the cysteine residues in RNase Aa is similar to that in other animal and plant T2 RNases. RNase Aa is cleaved at two points: 21 residues from the N-terminus and 29 residues from the C-terminus; however, both fragments may remain attached to the protein via disulfide bridges, leading to the maintenance of its conformation, as suggested by circular dichroism spectrum analysis. The phylogenetic analysis revealed that starfish RNase Aa is evolutionarily an intermediate between protozoan and oyster RNases. PMID:26920046

  14. Converting Transaldolase into Aldolase through Swapping of the Multifunctional Acid-Base Catalyst: Common and Divergent Catalytic Principles in F6P Aldolase and Transaldolase.

    PubMed

    Sautner, Viktor; Friedrich, Mascha Miriam; Lehwess-Litzmann, Anja; Tittmann, Kai

    2015-07-28

    Transaldolase (TAL) and fructose-6-phosphate aldolase (FSA) both belong to the class I aldolase family and share a high degree of structural similarity and sequence identity. The molecular basis of the different reaction specificities (transferase vs aldolase) has remained enigmatic. A notable difference between the active sites is the presence of either a TAL-specific Glu (Gln in FSA) or a FSA-specific Tyr (Phe in TAL). Both residues seem to have analoguous multifunctional catalytic roles but are positioned at different faces of the substrate locale. We have engineered a TAL double variant (Glu to Gln and Phe to Tyr) with an active site resembling that of FSA. This variant indeed exhibits aldolase activity as its main activity with a catalytic efficiency even larger than that of authentic FSA, while TAL activity is greatly impaired. Structural analysis of this variant in complex with the dihydroxyacetone Schiff base formed upon substrate cleavage identifies the introduced Tyr (genuine in FSA) to catalyze protonation of the central carbanion-enamine intermediate as a key determinant of the aldolase reaction. Our studies pinpoint that the Glu in TAL and the Tyr in FSA, although located at different positions at the active site, similarly act as bona fide acid-base catalysts in numerous catalytic steps, including substrate binding, dehydration of the carbinolamine, and substrate cleavage. We propose that the different spatial positions of the multifunctional Glu in TAL and of the corresponding multifunctional Tyr in FSA relative to the substrate locale are critically controlling reaction specificity through either unfavorable (TAL) or favorable (FSA) geometry of proton transfer onto the common carbanion-enamine intermediate. The presence of both potential acid-base residues, Glu and Tyr, in the active site of TAL has deleterious effects on substrate binding and cleavage, most likely resulting from a differently organized H-bonding network. Large-scale motions of the

  15. Genetic analysis of bristle loss in hybrids between Drosophila melanogaster and D. simulans provides evidence for divergence of cis-regulatory sequences in the achaete-scute gene complex.

    PubMed

    Skaer, N; Simpson, P

    2000-05-01

    The two closely related species of Drosophila, D. melanogaster and D. simulans, display an identical bristle pattern on the notum, but hybrids between the two are lacking a variable number of bristles. We show that the loss is temperature-dependent and provide evidence for two periods of temperature sensitivity. A first period of heat sensitivity occurs during larval development and corresponds to the time when the prepattern of expression of genes whose products activate achaete-scute in the proneural clusters preceding bristle precursor formation is established. A second period of cold sensitivity corresponds to the time of emergence of the bristle precursor cells and the maintenance of their neural fate, a process requiring high levels of Achaete-Scute. Expression of achaete-scute at these two critical periods depends on cis-regulatory elements of the achaete-scute complex (AS-C). The differences between males, which have only one copy of the X-linked AS-C from D. simulans, and females, which have copies from both parental species, are compared, together with the effects of crossing in different rearrangements of the D. melanogaster AS-C that delete regulatory and/or coding sequences. We provide evidence that bristle loss in the hybrids may result from a decrease in the level of transcription at the AS-C and argue that interaction between trans-acting factors and cis-regulatory elements within the AS-C has diverged between the two species.

  16. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

    SciTech Connect

    Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

    1987-07-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO/sub 4//PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene.

  17. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning.

    PubMed

    Takahashi, N; Takahashi, Y; Blumberg, B S; Putnam, F W

    1987-07-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO4/PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene.

  18. Amino acid sequences of neuropeptides in the sinus gland of the land crab Cardisoma carnifex: a novel neuropeptide proteolysis site.

    PubMed

    Newcomb, R W

    1987-08-01

    The sinus gland is a major neurosecretory structure in Crustacea. Five peptides, labeled C, D, E, F, and I, isolated from the sinus gland of the land crab have been hypothesized to arise from the incomplete proteolysis at two internal sites on a single biosynthetic intermediate peptide "H", based on amino acid composition additivities and pulse-chase radiolabeling studies. The presence of only a single major precursor for the sinus gland peptides implies that peptide H may be synthesized on a common precursor with crustacean hyperglycemic hormone forms, "J" and "L," and a peptide, "K," similar to peptides with molt inhibiting activity. Here I report amino acid sequences of these peptides. The amino terminal sequence of the parent peptide, H, (and the homologous fragments) proved refractory to Edman degradation. Data from amino acid analysis and carboxypeptidase digestion of the naturally occurring fragments and of fragments produced by endopeptidase digestion were used together with Edman degradation to obtain the sequences. Amino acid analysis of fragments of the naturally occurring "overlap" peptides (those produced by internal cleavage at one site on H) was used to obtain the sequences across the cleavage sites. The amino acid sequence of the land crab peptide H is Arg-Ser-Ala-Asp-Gly-Phe-Gly-Arg-Met-Glu-Ser-Leu-Leu-Thr-Ser-Leu-Arg-Gly- Ser-Ala-Glu- Ser-Pro-Ala-Ala-Leu-Gly-Glu-Ala-Ser-Ala-Ala-His-Pro-Leu-Glu. In vivo cleavage at one site involves excision of arginine from the sequence Leu-Arg-Gly, whereas cleavage at the other site involves excision of serine from the sequence Glu-Ser-Leu. Proteolysis at the latter sequence has not been previously reported in intact secretory granules. The aspartate at position 4 is possibly covalently modified.

  19. Pathogenic Entamoeba histolytica: cDNA cloning of a histone H3 with a divergent primary structure.

    PubMed

    Födinger, M; Ortner, S; Plaimauer, B; Wiedermann, G; Scheiner, O; Duchêne, M

    1993-06-01

    Entamoeba histolytica has an unusual nuclear structure characterized by a low degree of chromatin condensation and the absence of stainable metaphase chromosomes. Although nucleosome-like particles were observed, no information about histones was available so far. In this paper we describe a cDNA clone with significant homology to H3 histones that was isolated from a library of pathogenic E. histolytica. The complete cDNA encodes a 15-kDa polypeptide, which like the histone sequence from Volvox carteri is shorter by one residue than the human homologue. The amino acid sequence has only 69% identity with human H3.3 histone and 67% identity with the human H3.1 histone. This is the highest degree of sequence divergence observed for any eukaryote H3 histone sequence. Our results indicate that this divergence may contribute to the unusual chromatin structure of E. histolytica. PMID:8341328

  20. Enzyme-free translation of DNA into sequence-defined synthetic polymers structurally unrelated to nucleic acids

    NASA Astrophysics Data System (ADS)

    Niu, Jia; Hili, Ryan; Liu, David R.

    2013-04-01

    The translation of DNA sequences into corresponding biopolymers enables the production, function and evolution of the macromolecules of life. In contrast, methods to generate sequence-defined synthetic polymers with similar levels of control have remained elusive. Here, we report the development of a DNA-templated translation system that enables the enzyme-free translation of DNA templates into sequence-defined synthetic polymers that have no necessary structural relationship with nucleic acids. We demonstrate the efficiency, sequence-specificity and generality of this translation system by oligomerizing building blocks including polyethylene glycol, α-(D)-peptides, and β-peptides in a DNA-programmed manner. Sequence-defined synthetic polymers with molecular weights of 26 kDa containing 16 consecutively coupled building blocks and 90 densely functionalized β-amino acid residues were translated from DNA templates using this strategy. We integrated the DNA-templated translation system developed here into a complete cycle of translation, coding sequence replication, template regeneration and re-translation suitable for the iterated in vitro selection of functional sequence-defined synthetic polymers unrelated in structure to nucleic acids.

  1. Boronic acid functionalized peptidyl synthetic lectins: Combinatorial library design, peptide sequencing, and selective glycoprotein recognition

    PubMed Central

    Bicker, Kevin L.; Sun, Jing; Lavigne, John J.; Thompson, Paul R.

    2011-01-01

    Aberrant glycosylation of cell membrane and secreted glycoproteins is a hallmark of various disease states, including cancer. The natural lectins currently used in the recognition of these glycoproteins are costly, difficult to produce, and unstable towards rigorous use. Herein we describe the design and synthesis of several boronic acid functionalized peptide-based synthetic lectin (SL) libraries, as well as the optimized methodology for obtaining peptide sequences of these SLs. SL libraries were subsequently used to identify SLs with as high as 5-fold selectivity for various glycoproteins. SLs will inevitably find a role in cancer diagnositics, given that they do not suffer from the drawbacks of natural lectins and that the combinatorial nature of these libraries allows for the identification of an SL for nearly any glycosylated biomolecule. PMID:21405093

  2. Kinetics of amyloid aggregation of mammal apomyoglobins and correlation with their amino acid sequences.

    PubMed

    Vilasi, Silvia; Dosi, Roberta; Iannuzzi, Clara; Malmo, Clorinda; Parente, Augusto; Irace, Gaetano; Sirangelo, Ivana

    2006-03-01

    In protein deposition disorders, a normally soluble protein is deposited as insoluble aggregates, referred to as amyloid. The intrinsic effects of specific mutations on the rates of protein aggregation and amyloid formation of unfolded polypeptide chains can be correlated with changes in hydrophobicity, propensity to convert alpha-helical to beta sheet conformation and charge. In this paper, we report the aggregation rates of buffalo, horse and bovine apomyoglobins. The experimental values were compared with the theoretical ones evaluated considering the amino acid differences among the sequences. Our results show that the mutations which play critical roles in the rate-determining step of apomyoglobin aggregation are those located within the N-terminal region of the molecule.

  3. GAWK, a novel human pituitary polypeptide: isolation, immunocytochemical localization and complete amino acid sequence.

    PubMed

    Benjannet, S; Leduc, R; Lazure, C; Seidah, N G; Marcinkiewicz, M; Chrétien, M

    1985-01-16

    During the course of reverse-phase high pressure liquid chromatography (RP-HPLC) purification of a postulated big ACTH (1) from human pituitary gland extracts, a highly purified peptide bearing no resemblance to any known polypeptide was isolated. The complete sequence of this 74 amino acid polypeptide, called GAWK, has been determined. Search on a computer data bank on the possible homology to any known protein or fragment, using a mutation data matrix, failed to reveal any homology greater than 30%. An antibody produced against a synthetic fragment allowed us to detect several immunoreactive forms. The antisera also enabled us to localize the polypeptide, by immunocytochemistry, in the anterior lobe of the pituitary gland.

  4. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  5. Identification of amino acid sequences in the polyomavirus capsid proteins that serve as nuclear localization signals

    NASA Technical Reports Server (NTRS)

    Chang, D.; Haynes, J. I. Jr; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

    1993-01-01

    The molecular mechanism participating in the transport of newly synthesized proteins from the cytoplasm to the nucleus in mammalian cells is poorly understood. Recently, the nuclear localization signal sequences (NLS) of many nuclear proteins have been identified, and most have been found to be composed of a highly basic amino acid stretch. A genetic "subtractive" and a biochemical "additive" approach were used in our studies to identify the NLS's of the polyomavirus structural capsid proteins. An NLS was identified at the N-terminus (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) of the major capsid protein VP1 and at the C-terminus (Glu307 -Glu-Asp-Gly-Pro-Glu-Lys-Lys-Lys-Arg-Arg-Leu318) of the VP2/VP3 minor capsid proteins.

  6. Purification, properties and complete amino acid sequence of the ferredoxin from a green alga, Chlamydomonas reinhardtii.

    PubMed

    Schmitter, J M; Jacquot, J P; de Lamotte-Guéry, F; Beauvallet, C; Dutka, S; Gadal, P; Decottignies, P

    1988-03-01

    The ferredoxin was purified from the green alga, Chlamydomonas reinhardtii. The protein showed typical absorption and circular dichroism spectra of a [2Fe-2S] ferredoxin. When compared with spinach ferredoxin, the C. reinhardtii protein was less effective in the catalysis of NADP+ photoreduction, but its activity was higher in the light activation of C. reinhardtii malate dehydrogenase (NADP). The complete amino acid sequence was determined by automated Edman degradation of the whole protein and of peptides obtained by trypsin and chymotrypsin digestions and by CNBr cleavage. The protein consists of 94 residues, with Tyr at both NH2 and COOH termini. The positions of the four cysteines binding the two iron atoms are similar to those found in other [2Fe-2S] ferredoxins. The primary structure of C. reinhardtii ferredoxin showed a great homology (about 80%) with ferredoxins from two other green algae.

  7. Real-time nucleic acid sequence-based amplification in nanoliter volumes.

    PubMed

    Gulliksen, Anja; Solli, Lars; Karlsen, Frank; Rogne, Henrik; Hovig, Eivind; Nordstrøm, Trine; Sirevåg, Reidun

    2004-01-01

    Real-time nucleic acid sequence-based amplification (NASBA) is an isothermal method specifically designed for amplification of RNA. Fluorescent molecular beacon probes enable real-time monitoring of the amplification process. Successful identification, utilizing the real-time NASBA technology, was performed on a microchip with oligonucleotides at a concentration of 1.0 and 0.1 microM, in 10- and 50-nL reaction chambers, respectively. The microchip was developed in a silicon-glass structure. An instrument providing thermal control and an optical detection system was built for amplification readout. Experimental results demonstrate distinct amplification processes. Miniaturized real-time NASBA in microchips makes high-throughput diagnostics of bacteria, viruses, and cancer markers possible, at reduced cost and without contamination.

  8. Real-time nucleic acid sequence-based amplification assay for detection of hepatitis A virus.

    PubMed

    Abd el-Galil, Khaled H; el-Sokkary, M A; Kheira, S M; Salazar, Andre M; Yates, Marylynn V; Chen, Wilfred; Mulchandani, Ashok

    2005-11-01

    A nucleic acid sequence-based amplification (NASBA) assay in combination with a molecular beacon was developed for the real-time detection and quantification of hepatitis A virus (HAV). A 202-bp, highly conserved 5' noncoding region of HAV was targeted. The sensitivity of the real-time NASBA assay was tested with 10-fold dilutions of viral RNA, and a detection limit of 1 PFU was obtained. The specificity of the assay was demonstrated by testing with other environmental pathogens and indicator microorganisms, with only HAV positively identified. When combined with immunomagnetic separation, the NASBA assay successfully detected as few as 10 PFU from seeded lake water samples. Due to its isothermal nature, its speed, and its similar sensitivity compared to the real-time RT-PCR assay, this newly reported real-time NASBA method will have broad applications for the rapid detection of HAV in contaminated food or water.

  9. Detection of infectious salmon anaemia virus by real-time nucleic acid sequence based amplification.

    PubMed

    Starkey, William G; Smail, David A; Bleie, Hogne; Muir, K Fiona; Ireland, Jacqueline H; Richards, Randolph H

    2006-10-17

    We have developed a real-time nucleic acid sequence based amplification (NASBA) procedure for detection of infectious salmon anaemia virus (ISAV). Primers were designed to target a 124 nucleotide region of ISAV genome segment 8. Amplification products were detected in real-time with a molecular beacon (carboxyfluorescin [FAM]-labelled and methyl-red quenched) that recognised an internal region of the target amplicon. Amplification and detection were performed at 41 degrees C for 90 min in a Corbett Research Rotorgene. The real-time NASBA assay was compared to a conventional RT-PCR for ISAV detection. From a panel of 45 clinical samples, both assays detected ISAV in the same 19 samples. Based on the detection of a synthetic RNA target, the real-time NASBA procedure was approximately 100x more sensitive than conventional RT-PCR. These results suggest that real-time NASBA may represent a useful diagnostic procedure for ISAV.

  10. Sequence-defined shuttles for targeted nucleic acid and protein delivery.

    PubMed

    Röder, Ruth; Wagner, Ernst

    2014-01-01

    Molecular medicine opens into a space of novel specific therapeutic agents: intracellularly active drugs such as peptides, proteins or nucleic acids, which are not able to cross cell membranes and enter the intracellular space on their own. Through the development of cell-targeted shuttles for specific delivery, this restriction in delivery has the potential to be converted into an advantage. On the one hand, due to the multiple extra- and intracellular barriers, such carrier systems need to be multifunctional. On the other hand, they must be precise and reproducibly manufactured due to pharmaceutical reasons. Here we review the design of precise sequence-defined delivery carriers, including solid-phase synthesized peptides and nonpeptidic oligomers, or nucleotide-based carriers such as aptamers and origami nanoboxes.

  11. Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

    PubMed Central

    Axelsen, Jacob Bock; Yan, Koon-Kiu; Maslov, Sergei

    2007-01-01

    Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw") duplication and deletion rates rdup∗, rdel∗ which include gene copies that will be removed soon after the duplication event and their dramatically reduced long-term counterparts rdup, rdel. High deletion rate among recently duplicated proteins is consistent with a scenario in which they didn't have enough time to significantly change their functional roles and thus are to a large degree disposable. Systematic trends of each of the four duplication/deletion rates with the total number of genes in the genome were analyzed. All but the deletion rate of recent duplicates rdel∗ were shown to systematically increase with Ngenes. Abnormally flat shapes

  12. Genomic analysis of cultivated barley (Hordeum vulgare) using sequence-tagged molecular markers. Estimates of divergence based on RFLP and PCR markers derived from stress-responsive genes, and simple-sequence repeats (SSRs).

    PubMed

    Maestri, E; Malcevschi, A; Massari, A; Marmiroli, N

    2002-04-01

    Three types of molecular markers have been compared for their utility in evaluating genetic diversity among cultivars of Hordeum vulgare. Restriction fragment length polymorphisms at 71 sites were scored with the aid of probes corresponding to stress-responsive genes from barley and wheat, coding for a low-molecular-weight heat shock protein, a dehydrin, an aldose reductase homolog, and a 18.9-kDa drought-induced protein of unknown function. Indexes of genetic diversity computed in the total sample and within groups of cultivars (two-rowed and six-rowed, winter and spring varieties) indicated high values of genetic differentiation ( F (ST) >15%). A second assessment of genetic diversity was performed by PCR amplification of genomic DNA using as primers 13 arbitrary oligonucleotides derived from sequences of the same stress-responsive genes. A high degree of polymorphism was uncovered using these markers also, but they yielded low values for F (ST) (<7%) among groups of cultivars. Finally, 15 different simple-sequence repeats (AC or AG) were amplified with primers based on unique flanking sequences. Levels of polymorphism and differentiation between groups of cultivars revealed by these markers were quite high. Ordination techniques applied to measures of genetic distance among cultivars demonstrated a remarkable ability of the RFLPs associated with stress-responsive genes to discriminate on the basis of growth habit. The correlation with production data for the cultivars in different environments was also significant. This "functional genomics" strategy was therefore as informative as the "structural genomics" (SSR-based) approach, but requires the analysis of fewer probes. PMID:11976962

  13. Trypsin inhibitors from ridged gourd (Luffa acutangula Linn.) seeds: purification, properties, and amino acid sequences.

    PubMed

    Haldar, U C; Saha, S K; Beavis, R C; Sinha, N K

    1996-02-01

    Two trypsin inhibitors, LA-1 and LA-2, have been isolated from ridged gourd (Luffa acutangula Linn.) seeds and purified to homogeneity by gel filtration followed by ion-exchange chromatography. The isoelectric point is at pH 4.55 for LA-1 and at pH 5.85 for LA-2. The Stokes radius of each inhibitor is 11.4 A. The fluorescence emission spectrum of each inhibitor is similar to that of the free tyrosine. The biomolecular rate constant of acrylamide quenching is 1.0 x 10(9) M-1 sec-1 for LA-1 and 0.8 x 10(9) M-1 sec-1 for LA-2 and that of K2HPO4 quenching is 1.6 x 10(11) M-1 sec-1 for LA-1 and 1.2 x 10(11) M-1 sec-1 for LA-2. Analysis of the circular dichroic spectra yields 40% alpha-helix and 60% beta-turn for La-1 and 45% alpha-helix and 55% beta-turn for LA-2. Inhibitors LA-1 and LA-2 consist of 28 and 29 amino acid residues, respectively. They lack threonine, alanine, valine, and tryptophan. Both inhibitors strongly inhibit trypsin by forming enzyme-inhibitor complexes at a molar ratio of unity. A chemical modification study suggests the involvement of arginine of LA-1 and lysine of LA-2 in their reactive sites. The inhibitors are very similar in their amino acid sequences, and show sequence homology with other squash family inhibitors. PMID:8924202

  14. Microfluidic platform for isolating nucleic acid targets using sequence specific hybridization

    PubMed Central

    Wang, Jingjing; Morabito, Kenneth; Tang, Jay X.; Tripathi, Anubhav

    2013-01-01

    The separation of target nucleic acid sequences from biological samples has emerged as a significant process in today's diagnostics and detection strategies. In addition to the possible clinical applications, the fundamental understanding of target and sequence specific hybridization on surface modified magnetic beads is of high value. In this paper, we describe a novel microfluidic platform that utilizes a mobile magnetic field in static microfluidic channels, where single stranded DNA (ssDNA) molecules are isolated via nucleic acid hybridization. We first established efficient isolation of biotinylated capture probe (BP) using streptavidin-coated magnetic beads. Subsequently, we investigated the hybridization of target ssDNA with BP bound to beads and explained these hybridization kinetics using a dual-species kinetic model. The number of hybridized target ssDNA molecules was determined to be about 6.5 times less than that of BP on the bead surface, due to steric hindrance effects. The hybridization of target ssDNA with non-complementary BP bound to bead was also examined, and non-specific hybridization was found to be insignificant. Finally, we demonstrated highly efficient capture and isolation of target ssDNA in the presence of non-target ssDNA, where as low as 1% target ssDNA can be detected from mixture. The microfluidic method described in this paper is significantly relevant and is broadly applicable, especially towards point-of-care biological diagnostic platforms that require binding and separation of known target biomolecules, such as RNA, ssDNA, or protein. PMID:24404041

  15. Detection of Vibrio cholerae by real-time nucleic acid sequence-based amplification.

    PubMed

    Fykse, Else M; Skogan, Gunnar; Davies, William; Olsen, Jaran Strand; Blatny, Janet M

    2007-03-01

    A multitarget molecular beacon-based real-time nucleic acid sequence-based amplification (NASBA) assay for the specific detection of Vibrio cholerae has been developed. The genes encoding the cholera toxin (ctxA), the toxin-coregulated pilus (tcpA; colonization factor), the ctxA toxin regulator (toxR), hemolysin (hlyA), and the 60-kDa chaperonin product (groEL) were selected as target sequences for detection. The beacons for the five different genetic targets were evaluated by serial dilution of RNA from V. cholerae cells. RNase treatment of the nucleic acids eliminated all NASBA, whereas DNase treatment had no effect, showing that RNA and not DNA was amplified. The specificity of the assay was investigated by testing several isolates of V. cholerae, other Vibrio species, and Bacillus cereus, Salmonella enterica, and Escherichia coli strains. The toxR, groEL, and hlyA beacons identified all V. cholerae isolates, whereas the ctxA and tcpA beacons identified the O1 toxigenic clinical isolates. The NASBA assay detected V. cholerae at 50 CFU/ml by using the general marker groEL and tcpA that specifically indicates toxigenic strains. A correlation between cell viability and NASBA was demonstrated for the ctxA, toxR, and hlyA targets. RNA isolated from different environmental water samples spiked with V. cholerae was specifically detected by NASBA. These results indicate that NASBA can be used in the rapid detection of V. cholerae from various environmental water samples. This method has a strong potential for detecting toxigenic strains by using the tcpA and ctxA markers. The entire assay including RNA extraction and NASBA was completed within 3 h.

  16. Phylogenetic analysis of beta-papillomaviruses as inferred from nucleotide and amino acid sequence data.

    PubMed

    Gottschling, Marc; Köhler, Anja; Stockfleth, Eggert; Nindl, Ingo

    2007-01-01

    Human papillomaviruses (HPV) of the beta-group seem to be involved in the pathogenesis of non-melanoma skin cancer. Papillomaviruses are host specific and are considered closely co-evolving with their hosts. Evolutionary incongruence between early genes and late genes has been reported among oncogenic genital alpha-papillomaviruses and considerably challenge phylogenetic reconstructions. We investigated the relationships of 29 beta-HPV (25 types plus four putative new types, subtypes, or variants) as inferred from codon aligned and amino acid sequence data of the genes E1, E2, E6, E7, L1, and L2 using likelihood, distance, and parsimony approaches. An analysis of a L1 fragment included additional nucleotide and amino acid sequences from seven non-human beta-papillomaviruses. Early genes and late genes evolution did not conflict significantly in beta-papillomaviruses based on partition homogeneity tests (p > or = 0.001). As inferred from the complete genome analyses, beta-papillomaviruses were monophyletic and segregated into four highly supported monophyletic assemblages corresponding to the species 1, 2, 3, and fused 4/5. They basically split into the species 1 and the remainder of beta-papillomaviruses, whose species 3, 4, and 5 constituted the sistergroup of species 2. beta-Papillomaviruses have been isolated from humans, apes, and monkeys, and phylogenetic analyses of the L1 fragment showed non-human papillomaviruses highly polyphyletic nesting within the HPV species. Thus, host and virus phylogenies were not congruent in beta-papillomaviruses, and multiple invasions across species borders may contribute (additionally to host-linked evolution) to their diversification.

  17. Evolutionary Divergence of Arabidopsis thaliana Classical Peroxidases.

    PubMed

    Kupriyanova, E V; Mamoshina, P O; Ezhova, T A

    2015-10-01

    Polymorphisms of 62 peroxidase genes derived from Arabidopsis thaliana were investigated to evaluate evolutionary dynamics and divergence of peroxidase proteins. By comparing divergence of duplicated genes AtPrx53-AtPrx54 and AtPrx36-AtPrx72 and their products, nucleotide and amino acid substitutions were identified that were apparently targets of positive selection. These substitutions were detected among paralogs of 461 ecotypes from Arabidopsis thaliana. Some of these substitutions are conservative and matched paralogous peroxidases in other Brassicaceae species. These results suggest that after duplication, peroxidase genes evolved under the pressure of positive selection, and amino acid substitutions identified during our study provided divergence of properties and physiological functions in peroxidases. Our predictions regarding functional significance for amino acid residues identified in variable sites of peroxidases may allow further experimental assessment of evolution of peroxidases after gene duplication.

  18. A single molecular beacon probe is sufficient for the analysis of multiple nucleic acid sequences.

    PubMed

    Gerasimova, Yulia V; Hayson, Aaron; Ballantyne, Jack; Kolpashchikov, Dmitry M

    2010-08-16

    Molecular beacon (MB) probes are dual-labeled hairpin-shaped oligodeoxyribonucleotides that are extensively used for real-time detection of specific RNA/DNA analytes. In the MB probe, the loop fragment is complementary to the analyte: therefore, a unique probe is required for the analysis of each new analyte sequence. The conjugation of an oligonucleotide with two dyes and subsequent purification procedures add to the cost of MB probes, thus reducing their application in multiplex formats. Here we demonstrate how one MB probe can be used for the analysis of an arbitrary nucleic acid. The approach takes advantage of two oligonucleotide adaptor strands, each of which contains a fragment complementary to the analyte and a fragment complementary to an MB probe. The presence of the analyte leads to association of MB probe and the two DNA strands in quadripartite complex. The MB probe fluorescently reports the formation of this complex. In this design, the MB does not bind the analyte directly; therefore, the MB sequence is independent of the analyte. In this study one universal MB probe was used to genotype three human polymorphic sites. This approach promises to reduce the cost of multiplex real-time assays and improve the accuracy of single-nucleotide polymorphism genotyping.

  19. Short and Divergent Total Synthesis of (+)-Machaeriol B, (+)-Machaeriol D, (+)-Δ(8)-THC, and Analogues.

    PubMed

    Klotter, Felix; Studer, Armido

    2015-07-13

    Short and highly efficient stereoselective syntheses provide machaeriols and cannabinoids in a divergent approach starting from a common precursor, commercially available (S)-perillic acid. Key features of the novel strategy are a stereospecific palladium-catalyzed decarboxylative arylation and a one-pot sequence comprising a stereoselective hydroboration followed by oxidation or reduction of the corresponding intermediary boranes. The divergent approach is convincingly demonstrated by the five-step syntheses of (+)-machaeriol B, (+)-machaeriol D, and related analogues, and the four-step synthesis of (+)-Δ(8)-THC and an analogue.

  20. Canine amino acid transport system Xc(-): cDNA sequence, distribution and cystine transport activity in lens epithelial cells.

    PubMed

    Maruo, Takuya; Kanemaki, Nobuyuki; Onda, Ken; Sato, Reiichiro; Ichihara, Nobuteru; Ochiai, Hideharu

    2014-04-01

    The cystine transport activity of a lens epithelial cell line originated from a canine mature cataract was investigated. The distinct cystine transport activity was observed, which was inhibited to 28% by extracellular 1 mM glutamate. The cDNA sequences of canine cysteine/glutamate exchanger (xCT) and 4F2hc were determined. The predicted amino acid sequences were 527 and 533 amino acid polypeptides, respectively. The amino acid sequences of canine xCT and 4F2hc showed high similarities (>80%) to those of humans. The expression of xCT in lens epithelial cell line was confirmed by western blot analysis. RT-PCR analysis revealed high level expression only in the brain, and it was below the detectable level in other tissues.

  1. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    SciTech Connect

    Myers, G.; Korber, B.; Wain-Hobson, S.; Smith, R.F.; Pavlakis, G.N.

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  2. Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

    PubMed

    Liang, Shaobo; McDonald, Armando G; Coats, Erik R

    2015-11-01

    Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT.

  3. Amino acid sequence surrounding the chondroitin sulfate attachment site of thrombomodulin regulates chondroitin polymerization.

    PubMed

    Izumikawa, Tomomi; Kitagawa, Hiroshi

    2015-05-01

    Thrombomodulin (TM) is a cell-surface glycoprotein and a critical mediator of endothelial anticoagulant function. TM exists as both a chondroitin sulfate (CS) proteoglycan (PG) form and a non-PG form lacking a CS chain (α-TM); therefore, TM can be described as a part-time PG. Previously, we reported that α-TM bears an immature, truncated linkage tetrasaccharide structure (GlcAβ1-3Galβ1-3Galβ1-4Xyl). However, the biosynthetic mechanism to generate part-time PGs remains unclear. In this study, we used several mutants to demonstrate that the amino acid sequence surrounding the CS attachment site influences the efficiency of chondroitin polymerization. In particular, the presence of acidic residues surrounding the CS attachment site was indispensable for the elongation of CS. In addition, mutants defective in CS elongation did not exhibit anti-coagulant activity, as in the case with α-TM. Together, these data support a model for CS chain assembly in which specific core protein determinants are recognized by a key biosynthetic enzyme involved in chondroitin polymerization.

  4. Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

    PubMed

    Liang, Shaobo; McDonald, Armando G; Coats, Erik R

    2015-11-01

    Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT. PMID:25708409

  5. A novel phytase with sequence similarity to purple acid phosphatases is expressed in cotyledons of germinating soybean seedlings.

    PubMed

    Hegeman, C E; Grabau, E A

    2001-08-01

    Phytic acid (myo-inositol hexakisphosphate) is the major storage form of phosphorus in plant seeds. During germination, stored reserves are used as a source of nutrients by the plant seedling. Phytic acid is degraded by the activity of phytases to yield inositol and free phosphate. Due to the lack of phytases in the non-ruminant digestive tract, monogastric animals cannot utilize dietary phytic acid and it is excreted into manure. High phytic acid content in manure results in elevated phosphorus levels in soil and water and accompanying environmental concerns. The use of phytases to degrade seed phytic acid has potential for reducing the negative environmental impact of livestock production. A phytase was purified to electrophoretic homogeneity from cotyledons of germinated soybeans (Glycine max L. Merr.). Peptide sequence data generated from the purified enzyme facilitated the cloning of the phytase sequence (GmPhy) employing a polymerase chain reaction strategy. The introduction of GmPhy into soybean tissue culture resulted in increased phytase activity in transformed cells, which confirmed the identity of the phytase gene. It is surprising that the soybean phytase was unrelated to previously characterized microbial or maize (Zea mays) phytases, which were classified as histidine acid phosphatases. The soybean phytase sequence exhibited a high degree of similarity to purple acid phosphatases, a class of metallophosphoesterases.

  6. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    PubMed

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein.

  7. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    NASA Astrophysics Data System (ADS)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  8. The amino-acid sequence of the alpha-crystallin A chains of red kangaroo and Virginia opossum.

    PubMed

    De Jong, W W; Terwindt, E C

    1976-08-16

    The amino acid sequence of the A chain of the eye lens protein alpha-crystallin from the red kangaroo (Macropus rufus) was completely determined by manual Edman degradation of tryptic, thermolytic and cyanogen bromide peptides. The sequence of the alpha-crystallin A chain from the Virginia opossum (Didelphis marsupialis) was deduced from amino acid analyses and partial Edman degradation of peptides. The 173-residue A chains of kangaroo and opossum differ in six positions, whereas comparison with the bovine alpha-crystallin A chain reveals 17 and 22 substitutions, respectively. Most substitutions occur in the COOH-terminal part of the chain.

  9. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    SciTech Connect

    Chang, Soo-Ik ); Hammes, G.G. )

    1989-11-01

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the {beta}-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution.

  10. Polyvinyl-alcohol-based magnetic beads for rapid and efficient separation of specific or unspecific nucleic acid sequences

    NASA Astrophysics Data System (ADS)

    Oster, Jürgen; Parker, Jeffrey; à Brassard, Lothar

    2001-01-01

    The versatile application of polyvinyl-alcohol-based magnetic M-PVA beads is demonstrated in the separation of genomic DNA, sequence specific nucleic acid purification, and binding of bacteria for subsequent DNA extraction and detection. It is shown that nucleic acids can be obtained in high yield and purity using M-PVA beads, making sample preparation efficient, fast and highly adaptable for automation processes.

  11. Sequence change and phylogenetic signal in muscoid COII DNA sequences.

    PubMed

    Szalanski, Allen L; Owens, Carrie B

    2003-08-01

    The complete DNA sequence of the mtDNA cytochrome oxidase II gene from house fly, Musca domestica, face fly, Musca autumnalis, stable fly, Stomoxys calcitrans, horn fly, Haematobia irritans, and black garbage fly, Hydrotaea aenescens, are reported. The nucleotide sequence codes for a 229 amino acid peptide. The COII sequence is A + T rich (74.1%), with up to 12.3% nucleotide and 8.4% amino acid divergence among the five taxa. Of the 688 nucleotides encoding for the gene, 135 nucleotide sites (19.6%) are variable, and 55 (8.0%) are phylogenetically informative. A phylogenetic analysis using three calliphorids as the outgroup taxa, indicates that the two haematophagus species, horn fly and stable fly, form a sister group.

  12. Pancreatic ribonucleases of mammals with ruminant-like digestion. Amino-acid sequences of hippopotamus and sloth ribonucleases.

    PubMed

    Havinga, J; Beintema, J J

    1980-09-01

    High levels of pancreatic ribonucleases are found in ruminants, species that have a ruminant-like digestion and several species with coecal digestion. Pancreatic ribonucleases from several independently evolved species with ruminant-like digestion were investigated to test a hypothesis that glycosylation of ribonucleases may have some function in species with coecal digestion and that glycosylation of the enzyme may not be advantageous for ruminants. Ribonucleases from the hippopotamus, two-toed sloth and three-toed sloth were isolated by extraction with sulfuric acid and affinity chromatography. Complete amino acid sequences were determined for the ribonucleases from the hippopotamus and two-toed sloth and a partial sequence for the enzyme from the three-toed sloth. The amino acids 75-78 of hippopotamus ribonuclease were positioned by homology with other artiodactyl ribonucleases. In hippopotamus ribonuclease a heterogeneity was found at position 37, half of the molecules containing glutamine acid the other half lysine. Hippopotamus ribonuclease differs less from pig and bovine ribonuclease than these differ from each other, because more ancestral characteristics have been retained. Although hippopotamus ribonuclease contains all four Asn-X-Ser/Thr sequences previously found to be glycosylation sites in one or more pancreatic ribonucleases, only the sequence Ans-Met-Thr (34-36) is glycosylated in the variant with glutamine at position 37, while the variant with lysine at this position is carbohydrate-free. Both sloth ribonucleases are completely glycosylated at the sequence Ans-Met-Thr (34-36) with a simple type of carbohydrate chain. The amino acid sequence of two-toed sloth ribonuclease shows some interesting coupled replacements.

  13. Genetic divergence between populations of feral and domestic forms of a mosquito disease vector assessed by transcriptomics

    PubMed Central

    2015-01-01

    Culex pipiens, an invasive mosquito and vector of West Nile virus in the US, has two morphologically indistinguishable forms that differ dramatically in behavior and physiology. Cx. pipiens form pipiens is primarily a bird-feeding temperate mosquito, while the sub-tropical Cx. pipiens form molestus thrives in sewers and feeds on mammals. Because the feral form can diapause during the cold winters but the domestic form cannot, the two Cx. pipiens forms are allopatric in northern Europe and, although viable, hybrids are rare. Cx. pipiens form molestus has spread across all inhabited continents and hybrids of the two forms are common in the US. Here we elucidate the genes and gene families with the greatest divergence rates between these phenotypically diverged mosquito populations, and discuss them in light of their potential biological and ecological effects. After generating and assembling novel transcriptome data for each population, we performed pairwise tests for nonsynonymous divergence (Ka) of homologous coding sequences and examined gene ontology terms that were statistically over-represented in those sequences with the greatest divergence rates. We identified genes involved in digestion (serine endopeptidases), innate immunity (fibrinogens and α-macroglobulins), hemostasis (D7 salivary proteins), olfaction (odorant binding proteins) and chitin binding (peritrophic matrix proteins). By examining molecular divergence between closely related yet phenotypically divergent forms of the same species, our results provide insights into the identity of rapidly-evolving genes between incipient species. Additionally, we found that families of signal transducers, ATP synthases and transcription regulators remained identical at the amino acid level, thus constituting conserved components of the Cx. pipiens proteome. We provide a reference with which to gauge the divergence reported in this analysis by performing a comparison of transcriptome sequences from conspecific

  14. Sperm Bindin Divergence under Sexual Selection and Concerted Evolution in Sea Stars.

    PubMed

    Patiño, Susana; Keever, Carson C; Sunday, Jennifer M; Popovic, Iva; Byrne, Maria; Hart, Michael W

    2016-08-01

    Selection associated with competition among males or sexual conflict between mates can create positive selection for high rates of molecular evolution of gamete recognition genes and lead to reproductive isolation between species. We analyzed coding sequence and repetitive domain variation in the gene encoding the sperm acrosomal protein bindin in 13 diverse sea star species. We found that bindin has a conserved coding sequence domain structure in all 13 species, with several repeated motifs in a large central region that is similar among all sea stars in organization but highly divergent among genera in nucleotide and predicted amino acid sequence. More bindin codons and lineages showed positive selection for high relative rates of amino acid substitution in genera with gonochoric outcrossing adults (and greater expected strength of sexual selection) than in selfing hermaphrodites. That difference is consistent with the expectation that selfing (a highly derived mating system) may moderate the strength of sexual selection and limit the accumulation of bindin amino acid differences. The results implicate both positive selection on single codons and concerted evolution within the repetitive region in bindin divergence, and suggest that both single amino acid differences and repeat differences may affect sperm-egg binding and reproductive compatibility.

  15. Sperm Bindin Divergence under Sexual Selection and Concerted Evolution in Sea Stars.

    PubMed

    Patiño, Susana; Keever, Carson C; Sunday, Jennifer M; Popovic, Iva; Byrne, Maria; Hart, Michael W

    2016-08-01

    Selection associated with competition among males or sexual conflict between mates can create positive selection for high rates of molecular evolution of gamete recognition genes and lead to reproductive isolation between species. We analyzed coding sequence and repetitive domain variation in the gene encoding the sperm acrosomal protein bindin in 13 diverse sea star species. We found that bindin has a conserved coding sequence domain structure in all 13 species, with several repeated motifs in a large central region that is similar among all sea stars in organization but highly divergent among genera in nucleotide and predicted amino acid sequence. More bindin codons and lineages showed positive selection for high relative rates of amino acid substitution in genera with gonochoric outcrossing adults (and greater expected strength of sexual selection) than in selfing hermaphrodites. That difference is consistent with the expectation that selfing (a highly derived mating system) may moderate the strength of sexual selection and limit the accumulation of bindin amino acid differences. The results implicate both positive selection on single codons and concerted evolution within the repetitive region in bindin divergence, and suggest that both single amino acid differences and repeat differences may affect sperm-egg binding and reproductive compatibility. PMID:27189549

  16. Complete mitochondrial genomes of three neobatrachian anurans: a case study of divergence time estimation using different data and calibration settings.

    PubMed

    Igawa, Takeshi; Kurabayashi, Atsushi; Usuki, Chisako; Fujii, Tamotsu; Sumida, Masayuki

    2008-01-15

    We sequenced the whole mitochondrial (mt) genomes of three neobatrachian species: Japanese tree frog Hyla japonica, Japanese common toad Bufo japonicus, and narrow-mouthed toad Microhyla okinavensis. The gene arrangements of these genomes diverged from that of basal anurans (suborder Archaeobatrachia), but are the same as that of the members of derived frogs (i.e., superfamily Hyloidae and Ranoidae in suborder Neobatrachia), suggesting the one-time occurrence of a gene rearrangement event in an ancestral lineage of derived anurans. Furthermore, several distinct repeat motifs including putative termination-associated sequences (TASs) and conserved sequence blocks (CSBs) were observed in the control regions (CRs) of B. japonicus and H. japonica, while no repeat motifs were found in that of M. okinavensis. Phylogenetic analyses using both nucleotide and amino acid data of mt genes support monophyly of neobatrachians. The estimated divergence time based on amino acid data with multiple reference points suggests that the three living amphibian orders may have originated in the Carboniferous period, and that the divergences of anurans had occurred between the Permian and Tertiary periods. We also checked the influence of the data types and the settings of reference times on divergence time estimation. The resultant divergence times estimated from several datasets and reference time settings suggest that the substitution saturation of nucleotide data may lead to overestimated (i.e., older) branching times, especially for early divergent taxa. We also found a highly accelerated substitution rate in neobatrachian mt genes, and fast substitution possibly resulted in overestimation. To correct this erroneous estimation, it is efficient to apply several reference points among neobatrachians.

  17. Complete mitochondrial genomes of three neobatrachian anurans: a case study of divergence time estimation using different data and calibration settings.

    PubMed

    Igawa, Takeshi; Kurabayashi, Atsushi; Usuki, Chisako; Fujii, Tamotsu; Sumida, Masayuki

    2008-01-15

    We sequenced the whole mitochondrial (mt) genomes of three neobatrachian species: Japanese tree frog Hyla japonica, Japanese common toad Bufo japonicus, and narrow-mouthed toad Microhyla okinavensis. The gene arrangements of these genomes diverged from that of basal anurans (suborder Archaeobatrachia), but are the same as that of the members of derived frogs (i.e., superfamily Hyloidae and Ranoidae in suborder Neobatrachia), suggesting the one-time occurrence of a gene rearrangement event in an ancestral lineage of derived anurans. Furthermore, several distinct repeat motifs including putative termination-associated sequences (TASs) and conserved sequence blocks (CSBs) were observed in the control regions (CRs) of B. japonicus and H. japonica, while no repeat motifs were found in that of M. okinavensis. Phylogenetic analyses using both nucleotide and amino acid data of mt genes support monophyly of neobatrachians. The estimated divergence time based on amino acid data with multiple reference points suggests that the three living amphibian orders may have originated in the Carboniferous period, and that the divergences of anurans had occurred between the Permian and Tertiary periods. We also checked the influence of the data types and the settings of reference times on divergence time estimation. The resultant divergence times estimated from several datasets and reference time settings suggest that the substitution saturation of nucleotide data may lead to overestimated (i.e., older) branching times, especially for early divergent taxa. We also found a highly accelerated substitution rate in neobatrachian mt genes, and fast substitution possibly resulted in overestimation. To correct this erroneous estimation, it is efficient to apply several reference points among neobatrachians. PMID:17997052

  18. Ribonucleotide reductases: divergent evolution of an ancient enzyme.

    PubMed

    Torrents, Eduard; Aloy, Patrick; Gibert, Isidre; Rodríguez-Trelles, Francisco

    2002-08-01

    Ribonucleotide reductases (RNRs) are uniquely responsible for converting nucleotides to deoxynucleotides in all dividing cells. The three known classes of RNRs operate through a free radical mechanism but differ in the way in which the protein radical is generated. Class I enzymes depend on oxygen for radical generation, class II uses adenosylcobalamin, and the anaerobic class III requires S-adenosylmethionine and an iron-sulfur cluster. Despite their metabolic prominence, the evolutionary origin and relationships between these enzymes remain elusive. This gap in RNR knowledge can, to a major extent, be attributed to the fact that different RNR classes exhibit greatly diverged polypeptide chains, rendering homology assessments inconclusive. Evolutionary studies of RNRs conducted until now have focused on comparison of the amino acid sequence of the proteins, without considering how they fold into space. The present study is an attempt to understand the evolutionary history of RNRs taking into account their three-dimensional structure. We first infer the structural alignment by superposing the equivalent stretches of the three-dimensional structures of representatives of each family. We then use the structural alignment to guide the alignment of all publicly available RNR sequences. Our results support the hypothesis that the three RNR classes diverged from a common ancestor currently represented by the anaerobic class III. Also, lateral transfer appears to have played a significant role in the evolution of this protein family. PMID:12107591

  19. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  20. Detection of Dengue Viral RNA Using a Nucleic Acid Sequence-Based Amplification Assay

    PubMed Central

    Wu, Shuenn-Jue L.; Lee, Eun Mi; Putvatana, Ravithat; Shurtliff, Roxanne N.; Porter, Kevin R.; Suharyono, Wuryadi; Watts, Douglas M.; King, Chwan-Chuen; Murphy, Gerald S.; Hayes, Curtis G.; Romano, Joseph W.

    2001-01-01

    Faster techniques are needed for the early diagnosis of dengue fever and dengue hemorrhagic fever during the acute viremic phase of infection. An isothermal nucleic acid sequence-based amplification (NASBA) assay was optimized to amplify viral RNA of all four dengue virus serotypes by a set of universal primers and to type the amplified products by serotype-specific capture probes. The NASBA assay involved the use of silica to extract viral nucleic acid, which was amplified without thermocycling. The amplified product was detected by a probe-hybridization method that utilized electrochemiluminescence. Using normal human plasma spiked with dengue viruses, the NASBA assay had a detection threshold of 1 to 10 PFU/ml. The sensitivity and specificity of the assay were determined by testing 67 dengue virus-positive and 21 dengue virus-negative human serum or plasma samples. The “gold standard” used for comparison and evaluation was the mosquito C6/36 cell culture assay followed by an immunofluorescent assay. Viral infectivity titers in test samples were also determined by a direct plaque assay in Vero cells. The NASBA assay was able to detect dengue viral RNA in the clinical samples at plaque titers below 25 PFU/ml (the detection limit of the plaque assay). Of the 67 samples found positive by the C6/36 assay, 66 were found positive by the NASBA assay, for a sensitivity of 98.5%. The NASBA assay had a specificity of 100% based on the negative test results for the 21 normal human serum or plasma samples. These results indicate that the NASBA assay is a promising assay for the early diagnosis of dengue infections. PMID:11473994

  1. Identification of tropomyosins as major allergens in antarctic krill and mantis shrimp and their amino acid sequence characteristics.

    PubMed

    Motoyama, Kanna; Suma, Yota; Ishizaki, Shoichiro; Nagashima, Yuji; Lu, Ying; Ushio, Hideki; Shiomi, Kazuo

    2008-01-01

    Tropomyosin represents a major allergen of decapod crustaceans such as shrimps and crabs, and its highly conserved amino acid sequence (>90% identity) is a molecular basis of the immunoglobulin E (IgE) cross-reactivity among decapods. At present, however, little information is available about allergens in edible crustaceans other than decapods. In this study, the major allergen in two species of edible crustaceans, Antarctic krill Euphausia superba and mantis shrimp Oratosquilla oratoria that are taxonomically distinct from decapods, was demonstrated to be tropomyosin by IgE-immunoblotting using patient sera. The cross-reactivity of the tropomyosins from both species with decapod tropomyosins was also confirmed by inhibition IgE immunoblotting. Sequences of the tropomyosins from both species were determined by complementary deoxyribonucleic acid cloning. The mantis shrimp tropomyosin has high sequence identity (>90% identity) with decapod tropomyosins, especially with fast-type tropomyosins. On the other hand, the Antarctic krill tropomyosin is characterized by diverse alterations in region 13-42, the amino acid sequence of which is highly conserved for decapod tropomyosins, and hence, it shares somewhat lower sequence identity (82.4-89.8% identity) with decapod tropomyosins than the mantis shrimp tropomyosin. Quantification by enzyme-linked immunosorbent assay revealed that Antarctic krill contains tropomyosin at almost the same level as decapods, suggesting that its allergenicity is equivalent to decapods. However, mantis shrimp was assumed to be substantially not allergenic because of the extremely low content of tropomyosin. PMID:18521668

  2. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk

    PubMed Central

    Meneghel, Julie; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  3. Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.

    PubMed

    Rao, H B; Zhu, F; Yang, G B; Li, Z R; Chen, Y Z

    2011-07-01

    Sequence-derived structural and physicochemical features have been extensively used for analyzing and predicting structural, functional, expression and interaction profiles of proteins and peptides. PROFEAT has been developed as a web server for computing commonly used features of proteins and peptides from amino acid sequence. To facilitate more extensive studies of protein and peptides, numerous improvements and updates have been made to PROFEAT. We added new functions for computing descriptors of protein-protein and protein-small molecule interactions, segment descriptors for local properties of protein sequences, topological descriptors for peptide sequences and small molecule structures. We also added new feature groups for proteins and peptides (pseudo-amino acid composition, amphiphilic pseudo-amino acid composition, total amino acid properties and atomic-level topological descriptors) as well as for small molecules (atomic-level topological descriptors). Overall, PROFEAT computes 11 feature groups of descriptors for proteins and peptides, and a feature group of more than 400 descriptors for small molecules plus the derived features for protein-protein and protein-small molecule interactions. Our computational algorithms have been extensively tested and used in a number of published works for predicting proteins of specific structural or functional classes, protein-protein interactions, peptides of specific functions and quantitative structure activity relationships of small molecules. PROFEAT is accessible free of charge at http://bidd.cz3.nus.edu.sg/cgi-bin/prof/protein/profnew.cgi.

  4. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing

    PubMed Central

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G.

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  5. Draft Genome Sequence of Burkholderia stabilis LA20W, a Trehalose Producer That Uses Levulinic Acid as a Substrate

    PubMed Central

    Sato, Yuya; Koike, Hideaki; Kondo, Susumu; Hori, Tomoyuki; Kanno, Manabu; Kimura, Nobutada; Morita, Tomotake; Kirimura, Kohtaro

    2016-01-01

    Burkholderia stabilis LA20W produces trehalose using levulinic acid (LA) as a substrate. Here, we report the 7.97-Mb draft genome sequence of B. stabilis LA20W, which will be useful in investigations of the enzymes involved in LA metabolism and the mechanism of LA-induced trehalose production. PMID:27491978

  6. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    PubMed

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  7. Draft Genome Sequence of Acetobacter tropicalis Type Strain NBRC16470, a Producer of Optically Pure d-Glyceric Acid

    PubMed Central

    Koike, Hideaki; Sato, Shun; Morita, Tomotake; Fukuoka, Tokuma

    2014-01-01

    Here we report the 3.7-Mb draft genome sequence of Acetobacter tropicalis NBRC16470T, which can produce optically pure d-glyceric acid (d-GA; 99% enantiomeric excess) from raw glycerol feedstock derived from biodiesel fuel production processes. PMID:25523780

  8. Complete genome sequence of Lactobacillus plantarum ZS2058, a probiotic strain with high conjugated linoleic acid production ability.

    PubMed

    Yang, Bo; Chen, Haiqin; Tian, Fengwei; Zhao, Jianxin; Gu, Zhennan; Zhang, Hao; Chen, Yong Q; Chen, Wei

    2015-11-20

    Lactobacillus plantarum ZS2058 was isolated from sauerkraut and identified to synthesize the beneficial metabolite conjugated linoleic acid. The genome contains a 319,7363-bp chromosome and three plasmids. The sequence will facilitate identification and characterization of the genetic determinants for its putative biological benefits.

  9. Draft Genome Sequence of Cutaneotrichosporon curvatus DSM 101032 (Formerly Cryptococcus curvatus), an Oleaginous Yeast Producing Polyunsaturated Fatty Acids

    PubMed Central

    Hofmeyer, Thomas; Hackenschmidt, Silke; Nadler, Florian; Thürmer, Andrea; Daniel, Rolf

    2016-01-01

    Cutaneotrichosporon curvatus DSM 101032 is an oleaginous yeast that can be isolated from various habitats and is capable of producing substantial amounts of polyunsaturated fatty acids. Here, we present the first draft genome sequence of any C. curvatus species. PMID:27174275

  10. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    PubMed

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-03-03

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes.

  11. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing.

    PubMed

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G; Baylis, Sally A

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  12. Ultra high-throughput nucleic acid sequencing as a tool for virus discovery in the turkey gut.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recently, the use of the next generation of nucleic acid sequencing technology (i.e., 454 pyrosequencing, as developed by Roche/454 Life Sciences) has allowed an in-depth look at the uncultivated microorganisms present in complex environmental samples, including samples with agricultural importance....

  13. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing.

    PubMed

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G; Baylis, Sally A

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA.

  14. Gene duplication and divergence affecting drug content in Cannabis sativa.

    PubMed

    Weiblen, George D; Wenger, Jonathan P; Craft, Kathleen J; ElSohly, Mahmoud A; Mehmedic, Zlatko; Treiber, Erin L; Marks, M David

    2015-12-01

    Cannabis sativa is an economically important source of durable fibers, nutritious seeds, and psychoactive drugs but few economic plants are so poorly understood genetically. Marijuana and hemp were crossed to evaluate competing models of cannabinoid inheritance and to explain the predominance of tetrahydrocannabinolic acid (THCA) in marijuana compared with cannabidiolic acid (CBDA) in hemp. Individuals in the resulting F2 population were assessed for differential expression of cannabinoid synthase genes and were used in linkage mapping. Genetic markers associated with divergent cannabinoid phenotypes were identified. Although phenotypic segregation and a major quantitative trait locus (QTL) for the THCA/CBDA ratio were consistent with a simple model of codominant alleles at a single locus, the diversity of THCA and CBDA synthase sequences observed in the mapping population, the position of enzyme coding loci on the map, and patterns of expression suggest multiple linked loci. Phylogenetic analysis further suggests a history of duplication and divergence affecting drug content. Marijuana is distinguished from hemp by a nonfunctional CBDA synthase that appears to have been positively selected to enhance psychoactivity. An unlinked QTL for cannabinoid quantity may also have played a role in the recent escalation of drug potency. PMID:26189495

  15. Gene duplication and divergence affecting drug content in Cannabis sativa.

    PubMed

    Weiblen, George D; Wenger, Jonathan P; Craft, Kathleen J; ElSohly, Mahmoud A; Mehmedic, Zlatko; Treiber, Erin L; Marks, M David

    2015-12-01

    Cannabis sativa is an economically important source of durable fibers, nutritious seeds, and psychoactive drugs but few economic plants are so poorly understood genetically. Marijuana and hemp were crossed to evaluate competing models of cannabinoid inheritance and to explain the predominance of tetrahydrocannabinolic acid (THCA) in marijuana compared with cannabidiolic acid (CBDA) in hemp. Individuals in the resulting F2 population were assessed for differential expression of cannabinoid synthase genes and were used in linkage mapping. Genetic markers associated with divergent cannabinoid phenotypes were identified. Although phenotypic segregation and a major quantitative trait locus (QTL) for the THCA/CBDA ratio were consistent with a simple model of codominant alleles at a single locus, the diversity of THCA and CBDA synthase sequences observed in the mapping population, the position of enzyme coding loci on the map, and patterns of expression suggest multiple linked loci. Phylogenetic analysis further suggests a history of duplication and divergence affecting drug content. Marijuana is distinguished from hemp by a nonfunctional CBDA synthase that appears to have been positively selected to enhance psychoactivity. An unlinked QTL for cannabinoid quantity may also have played a role in the recent escalation of drug potency.

  16. Amino acid sequence of Coprinus macrorhizus peroxidase and cDNA sequence encoding Coprinus cinereus peroxidase. A new family of fungal peroxidases.

    PubMed

    Baunsgaard, L; Dalbøge, H; Houen, G; Rasmussen, E M; Welinder, K G

    1993-04-01

    Sequence analysis and cDNA cloning of Coprinus peroxidase (CIP) were undertaken to expand the understanding of the relationships of structure, function and molecular genetics of the secretory heme peroxidases from fungi and plants. Amino acid sequencing of Coprinus macrorhizus peroxidase, and cDNA sequencing of Coprinus cinereus peroxidase showed that the mature proteins are identical in amino acid sequence, 343 residues in size and preceded by a 20-residue signal peptide. Their likely identity to peroxidase from Arthromyces ramosus is discussed. CIP has an 8-residue, glycine-rich N-terminal extension blocked with a pyroglutamate residue which is absent in other fungal peroxidases. The presence of pyroglutamate, formed by cyclization of glutamine, and the finding of a minor fraction of a variant form lacking the N-terminal residue, indicate that signal peptidase cleavage is followed by further enzymic processing. CIP is 40-45% identical in amino-acid sequence to 11 lignin peroxidases from four fungal species, and 42-43% identical to the two known Mn-peroxidases. Like these white-rot fungal peroxidases, CIP has an additional segment of approximately 40 residues at the C-terminus which is absent in plant peroxidases. Although CIP is much more similar to horseradish peroxidase (HRP C) in substrate specificity, specific activity and pH optimum than to white-rot fungal peroxidases, the sequences of CIP and HRP C showed only 18% identity. Hence, CIP qualifies as the first member of a new family of fungal peroxidases. The nine invariant residues present in all plant, fungal and bacterial heme peroxidases are also found in CIP. The present data support the hypothesis that only one chromosomal CIP gene exists. In contrast, a large number of secretory plant and fungal peroxidases are expressed from several peroxidase gene clusters. Analyses of three batches of CIP protein and of 49 CIP clones revealed the existence of only two highly similar alleles indicating less

  17. Cry1Aa binding to the cadherin receptor does not require conserved amino acid sequences in the domain II loops

    PubMed Central

    Fujii, Yuki; Tanaka, Shiho; Otsuki, Manami; Hoshino, Yasushi; Morimoto, Chinatsu; Kotani, Takuya; Harashima, Yuko; Endo, Haruka; Yoshizawa, Yasutaka; Sato, Ryoichi

    2012-01-01

    Characterizing the binding mechanism of Bt (Bacillus thuringiensis) Cry toxin to the cadherin receptor is indispensable to understanding the specific insecticidal activity of this toxin. To this end, we constructed 30 loop mutants by randomly inserting four serial amino acids covering all four receptor binding loops (loops α8, 1, 2 and 3) and analysed their binding affinities for Bombyx mori cadherin receptors via Biacore. High binding affinities were confirmed for all 30 mutants containing loop sequences that differed from those of wild-type. Insecticidal activities were confirmed in at least one mutant from loops 1, 2 and 3, suggesting that there is no critical amino acid sequence for the binding of the four loops to BtR175. When two mutations at different loops were integrated into one molecule, no reduction in binding affinity was observed compared with wild-type sequences. Based on these results, we discussed the binding mechanism of Cry toxin to cadherin protein. PMID:23145814

  18. The amino acid sequence of protein SCMK-B2C from the high-sulphur fraction of wool keratin.

    PubMed

    Elleman, T C

    1972-08-01

    1. The amino acid sequence of a protein from the reduced and carboxymethylated high-sulphur fraction of wool has been determined. 2. The sequence of this S-carboxymethylkerateine (SCMK-B2C) of 151 amino acid residues displays much internal homology and an unusual residue distribution. Thus a ten-residue sequence occurs four times near the N-terminus and five times near the C-terminus with few changes. These regions contain much of the molecule's half-cystine, whereas between them there is a region of 19 residues that are mainly small and devoid of cystine and proline. 3. Certain models of the wool fibre based on its mechanical and physical properties propose a matrix of small compact globular units linked together to form beaded chains. The unusual distribution of the component residues of protein SCMK-B2C suggests structures in the wool-fibre matrix compatible with certain features of the proposed models.

  19. Sequence-Specific Recognition of MicroRNAs and Other Short Nucleic Acids with Solid-State Nanopores.

    PubMed

    Zahid, Osama K; Wang, Fanny; Ruzicka, Jan A; Taylor, Ethan W; Hall, Adam R

    2016-03-01

    The detection and quantification of short nucleic acid sequences has many potential applications in studying biological processes, monitoring disease initiation and progression, and evaluating environmental systems, but is challenging by nature. We present here an assay based on the solid-state nanopore platform for the identification of specific sequences in solution. We demonstrate that hybridization of a target nucleic acid with a synthetic probe molecule enables discrimination between duplex and single-stranded molecules with high efficacy. Our approach requires limited preparation of samples and yields an unambiguous translocation event rate enhancement that can be used to determine the presence and abundance of a single sequence within a background of nontarget oligonucleotides. PMID:26824296

  20. Sequence of cDNA for rat cystathionine gamma-lyase and comparison of deduced amino acid sequence with related Escherichia coli enzymes.

    PubMed Central

    Erickson, P F; Maxwell, I H; Su, L J; Baumann, M; Glode, L M

    1990-01-01

    A cDNA clone for cystathionine gamma-lyase was isolated from a rat cDNA library in lambda gt11 by screening with a monospecific antiserum. The identity of this clone, containing 600 bp proximal to the 3'-end of the gene, was confirmed by positive hybridization selection. Northern-blot hybridization showed the expected higher abundance of the corresponding mRNA in liver than in brain. Two further cDNA clones from a plasmid pcD library were isolated by colony hybridization with the first clone and were found to contain inserts of 1600 and 1850 bp. One of these was confirmed as encoding cystathionine gamma-lyase by hybridization with two independent pools of oligodeoxynucleotides corresponding to partial amino acid sequence information for cystathionine gamma-lyase. The other clone (estimated to represent all but 8% of the 5'-end of the mRNA) was sequenced and its deduced amino acid sequence showed similarity to those of the Escherichia coli enzymes cystathionine beta-lyase and cystathionine gamma-synthase throughout its length, especially to that of the latter. Images Fig. 1. Fig. 2. Fig. 3. Fig. 5. PMID:2201285

  1. Sequence dependent N-terminal rearrangement and degradation of peptide nucleic acid (PNA) in aqueous solution

    NASA Technical Reports Server (NTRS)

    Eriksson, M.; Christensen, L.; Schmidt, J.; Haaima, G.; Orgel, L.; Nielsen, P. E.

    1998-01-01

    The stability of the PNA (peptide nucleic acid) thymine monomer inverted question markN-[2-(thymin-1-ylacetyl)]-N-(2-aminoaminoethyl)glycine inverted question mark and those of various PNA oligomers (5-8-mers) have been measured at room temperature (20 degrees C) as a function of pH. The thymine monomer undergoes N-acyl transfer rearrangement with a half-life of 34 days at pH 11 as analyzed by 1H NMR; and two reactions, the N-acyl transfer and a sequential degradation, are found by HPLC analysis to occur at measurable rates for the oligomers at pH 9 or above. Dependent on the amino-terminal sequence, half-lives of 350 h to 163 days were found at pH 9. At pH 12 the half-lives ranged from 1.5 h to 21 days. The results are discussed in terms of PNA as a gene therapeutic drug as well as a possible prebiotic genetic material.

  2. The cDNA-derived amino acid sequence of hemoglobin II from Lucina pectinata.

    PubMed

    Torres-Mercado, Elineth; Renta, Jessicca Y; Rodríguez, Yolanda; López-Garriga, Juan; Cadilla, Carmen L

    2003-11-01

    Hemoglobin II from the clam Lucina pectinata is an oxygen-reactive protein with a unique structural organization in the heme pocket involving residues Gln65 (E7), Tyr30 (B10), Phe44 (CD1), and Phe69 (E11). We employed the reverse transcriptase-polymerase chain reaction (RT-PCR) and methods to synthesize various cDNA(HbII). An initial 300-bp cDNA clone was amplified from total RNA by RT-PCR using degenerate oligonucleotides. Gene-specific primers derived from the HbII-partial cDNA sequence were used to obtain the 5' and 3' ends of the cDNA by RACE. The length of the HbII cDNA, estimated from overlapping clones, was approximately 2114 bases. Northern blot analysis revealed that the mRNA size of HbII agrees with the estimated size using cDNA data. The coding region of the full-length HbII cDNA codes for 151 amino acids. The calculated molecular weight of HbII, including the heme group and acetylated N-terminal residue, is 17,654.07 Da.

  3. Amino acid sequence of rabbit kidney neutral endopeptidase 24.11 (enkephalinase) deduced from a complementary DNA.

    PubMed Central

    Devault, A; Lazure, C; Nault, C; Le Moual, H; Seidah, N G; Chrétien, M; Kahn, P; Powell, J; Mallet, J; Beaumont, A

    1987-01-01

    Neutral endopeptidase (EC 3.4.24.11) is a major constituent of kidney brush border membranes. It is also present in the brain where it has been shown to be involved in the inactivation of opioid peptides, methionine- and leucine-enkephalins. For this reason this enzyme is often called 'enkephalinase'. In order to characterize the primary structure of the enzyme, oligonucleotide probes were designed from partial amino acid sequences and used to isolate clones from kidney cDNA libraries. Sequencing of the cDNA inserts revealed the complete primary structure of the enzyme. Neutral endopeptidase consists of 750 amino acids. It contains a short N-terminal cytoplasmic domain (27 amino acids), a single membrane-spanning segment (23 amino acids) and an extracellular domain that comprises most of the protein mass. The comparison of the primary structure of neutral endopeptidase with that of thermolysin, a bacterial Zn-metallopeptidase, indicates that most of the amino acid residues involved in Zn coordination and catalytic activity in thermolysin are found within highly honmologous sequences in neutral endopeptidase. Images Fig. 1. Fig. 3. PMID:2440677

  4. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    NASA Technical Reports Server (NTRS)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  5. Molecular cloning, coding nucleotides and the deduced amino acid sequence of P-450BM-1 from Bacillus megaterium.

    PubMed

    He, J S; Ruettinger, R T; Liu, H M; Fulco, A J

    1989-12-22

    The gene encoding barbiturate-inducible cytochrome P-450BM-1 from Bacillus megaterium ATCC 14581 has been cloned and sequenced. An open reading frame in the 1.9 kb of cloned DNA correctly predicted the NH2-terminal sequence of P-450BM-1 previously determined by protein sequencing, and, in toto, predicted a polypeptide of 410 amino acid residues with an Mr of 47,439. The sequence is most, but less than 27%, similar to that of P-450CAM from Pseudomonas putida, so that P-450BM-1 clearly belongs to a new P-450-gene family, distinct especially from that of the P-450 domain of P-450BM-3, a barbiturate-inducible single polypeptide cytochrome P-450:NADPH-P-450 reductase from the same strain of B. megaterium (Ruettinger, R.T., Wen, L.-P. and Fulco, A.J. (1989) J. Biol. Chem. 264, 10987-10995). PMID:2597681

  6. Sample Prep, Workflow Automation and Nucleic Acid Fractionation for Next Generation Sequencing

    SciTech Connect

    Roskey, Mark

    2010-06-03

    Mark Roskey of Caliper LifeSciences discusses how the company's technologies fit into the next generation sequencing workflow on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  7. Evolution of vertebrate IgM: complete amino acid sequence of the constant region of Ambystoma mexicanum mu chain deduced from cDNA sequence.

    PubMed

    Fellah, J S; Wiles, M V; Charlemagne, J; Schwager, J

    1992-10-01

    cDNA clones coding for the constant region of the Mexican axolotl (Ambystoma mexicanum) mu heavy immunoglobulin chain were selected from total spleen RNA, using a cDNA polymerase chain reaction technique. The specific 5'-end primer was an oligonucleotide homologous to the JH segment of Xenopus laevis mu chain. One of the clones, JHA/3, corresponded to the complete constant region of the axolotl mu chain, consisting of a 1362-nucleotide sequence coding for a polypeptide of 454 amino acids followed in 3' direction by a 179-nucleotide untranslated region and a polyA+ tail. The axolotl C mu is divided into four typical domains (C mu 1-C mu 4) and can be aligned with the Xenopus C mu with an overall identity of 56% at the nucleotide level. Percent identities were particularly high between C mu 1 (59%) and C mu 4 (71%). The C-terminal 20-amino acid segment which constitutes the secretory part of the mu chain is strongly homologous to the equivalent sequences of chondrichthyans and of other tetrapods, including a conserved N-linked oligosaccharide, the penultimate cysteine and the C-terminal lysine. The four C mu domains of 13 vertebrate species ranging from chondrichthyans to mammals were aligned and compared at the amino acid level. The significant number of mu-specific residues which are conserved into each of the four C mu domains argues for a continuous line of evolution of the vertebrate mu chain. This notion was confirmed by the ability to reconstitute a consistent vertebrate evolution tree based on the phylogenic parsimony analysis of the C mu 4 sequences. PMID:1382992

  8. Tsallis and Rényi divergences of generalized Jacobi polynomials

    NASA Astrophysics Data System (ADS)

    Sfetcu, Răzvan-Cornel

    2016-10-01

    We consider some discrete probability distribution ψn(x) =(ψn,1(x) ,ψn,2(x) , … ,ψn,n(x)) . We show that, under suitable conditions, the sequence of Tsallis divergence (DT(ψn(x))) n and the sequence of Rényi divergence (DR(ψn(x))) n are convergent for any x ∈(- 1 , 1) .

  9. Genetic divergence of tomato ringspot virus.

    PubMed

    Rivera, Lucia; Zamorano, Alan; Fiore, Nicola

    2016-05-01

    Tomato ringspot virus (ToRSV) has been detected in Chile, causing economically important diseases in a wide range of hosts. A ToRSV isolate was obtained from raspberry cv Heritage (Rasp-CL) showing leaf yellowing and stunting. The complete genome of Rasp-CL was sequenced by deep sequencing. The Rasp-CL RNA1 sequence shared 97.4 % nucleotide sequence identity with divergent RNA1 of isolate Rasp1-2014, while Rasp-CL RNA2 showed high divergence from all four isolates available in the database, sharing only 63.9-72.7 % nucleotide sequence identity. This difference was mainly based on the X4 coding region, which has been reported to be a high-variability region. Moreover, based on differences in the X4 region, three Rasp-CL RNA2 variants of different length were identified in the same host. One putative recombination event was identified between the Rasp-CL and GYV-2014 X4 genes. Phylogenetic analysis suggested that ToRSV isolates with currently available sequences form three distinct groups. Our results suggest that, for an accurate phylogenetic classification of ToRSV, it is necessary to obtain sequences of both RNAs. This is the first report of a complete ToRSV genome sequence from South America.

  10. Sequence Comparison and Phylogeny of Nucleotide Sequence of Coat Protein and Nucleic Acid Binding Protein of a Distinct Isolate of Shallot virus X from India.

    PubMed

    Majumder, S; Baranwal, V K

    2011-06-01

    Shallot virus X (ShVX), a type species in the genus Allexivirus of the family Alfaflexiviridae has been associated with shallot plants in India and other shallot growing countries like Russia, Germany, Netherland, and New Zealand. Coat protein (CP) and nucleic acid binding protein (NB) region of the virus was obtained by reverse transcriptase polymerase chain reaction from scales leaves of shallot bulbs. The partial cDNA contained two open reading frames encoding proteins of molecular weights of 28.66 and 14.18 kDa belonging to Flexi_CP super-family and viral NB super-family, respectively. The percent identity and phylogenetic analysis of amino acid sequences of CP and NB region of the virus associated with shallot indicated that it was a distinct isolate of ShVX.

  11. Sequence Comparison and Phylogeny of Nucleotide Sequence of Coat Protein and Nucleic Acid Binding Protein of a Distinct Isolate of Shallot virus X from India.

    PubMed

    Majumder, S; Baranwal, V K

    2011-06-01

    Shallot virus X (ShVX), a type species in the genus Allexivirus of the family Alfaflexiviridae has been associated with shallot plants in India and other shallot growing countries like Russia, Germany, Netherland, and New Zealand. Coat protein (CP) and nucleic acid binding protein (NB) region of the virus was obtained by reverse transcriptase polymerase chain reaction from scales leaves of shallot bulbs. The partial cDNA contained two open reading frames encoding proteins of molecular weights of 28.66 and 14.18 kDa belonging to Flexi_CP super-family and viral NB super-family, respectively. The percent identity and phylogenetic analysis of amino acid sequences of CP and NB region of the virus associated with shallot indicated that it was a distinct isolate of ShVX. PMID:23637504

  12. Gene structure and amino acid sequence of Latimeria chalumnae (coelacanth) myelin DM20: phylogenetic relation of the fish.

    PubMed

    Tohyama, Y; Kasama-Yoshida, H; Sakuma, M; Kobayashi, Y; Cao, Y; Hasegawa, M; Kojima, H; Tamai, Y; Tanokura, M; Kurihara, T

    1999-07-01

    The structure of Latimeria chalumnae (coelacanth) proteolipid protein/DM20 gene excluding exon 1 was determined, and the amino acid sequence of Latimeria DM20 corresponding to exons 2-7 was deduced. The nucleotide sequence of exon 3 suggests that only DM20 isoform is expressed in Latimeria. The structure of proteolipid protein/DM20 gene is well preserved among human, dog, mouse, and Latimeria. Southern blot analysis indicates that Latimeria DM20 gene is a single-copy gene. When the amino acid sequences of DM20 were compared among various species, Latimeria was more similar to tetrapods than other fishes including lungfish, confirming the previous finding by immunoreactivity (Waehneldt and Malotka 1989 J. Neurochem. 52:1941-1943). However, when phylogenetic trees were constructed from the DM20 sequences, lungfish was clearly the closest to tetrapods. Latimeria was situated outside of lungfish by the maximum likelihood method. The apparent similarity of Latimeria DM20 to tetrapod proteolipid protein/DM20 is explained by the slow amino acid substitution rate of Latimeria DM20.

  13. Amino acid sequence and structural properties of protein p12, an African swine fever virus attachment protein.

    PubMed Central

    Alcamí, A; Angulo, A; López-Otín, C; Muñoz, M; Freije, J M; Carrascosa, A L; Viñuela, E

    1992-01-01

    The gene encoding the African swine fever virus protein p12, which is involved in virus attachment to the host cell, has been mapped and sequenced in the genome of the Vero-adapted virus strain BA71V. The determination of the N-terminal amino acid sequence and the hybridization of oligonucleotide probes derived from this sequence to cloned restriction fragments allowed the mapping of the gene in fragment EcoRI-O, located in the central region of the viral genome. The DNA sequence of an EcoRI-XbaI fragment showed an open reading frame which is predicted to encode a polypeptide of 61 amino acids. The expression of this open reading frame in rabbit reticulocyte lysates and in Escherichia coli gave rise to a 12-kDa polypeptide that was immunoprecipitated with a monoclonal antibody specific for protein p12. The hydrophilicity profile indicated the existence of a stretch of 22 hydrophobic residues in the central part that may anchor the protein in the virus envelope. Three forms of the protein with apparent molecular masses of 17, 12, and 10 kDa in sodium dodecyl sulfate-polyacrylamide gel electrophoresis have been observed, depending on the presence of 2-mercaptoethanol and alkylation with 4-vinylpyridine, indicating that disulfide bonds are responsible for the multimerization of the protein. This result was in agreement with the existence of a cysteine-rich domain in the C-terminal region of the predicted amino acid sequence. The protein was synthesized at late times of infection, and no posttranslational modifications such as glycosylation, phosphorylation, or fatty acid acylation were detected. Images PMID:1583732

  14. Cloning and sequencing of the Bet v 1-homologous allergen Fra a 1 in strawberry (Fragaria ananassa) shows the presence of an intron and little variability in amino acid sequence.

    PubMed

    Musidlowska-Persson, Anna; Alm, Rikard; Emanuelsson, Cecilia

    2007-02-01

    The Fra a 1 allergen in strawberry (Fragaria ananassa) is homologous to the major birch pollen allergen Bet v 1, which has numerous isoforms differing in terms of amino acid sequence and immunological impact. To map the extent of sequence differences in the Fra a 1 allergen, PCR cloning and sequencing was applied. Several genomic sequences of Fra a 1, with a length of either 584, 591 or 594 nucleotides, were obtained from three different strawberry varieties. All contained one intron, with the length of either 101 or 110 nucleotides. By sequencing 30 different clones, eight different DNA sequences were obtained, giving in total five potential Fra a 1 protein isoforms, with high sequence similarity (>97% sequence identity) and only seven positions of amino acid variability, which were largely confirmed by mass spectrometry of expressed proteins. We conclude that the sequence variability in the strawberry allergen Fra a 1 is small, within and between strawberry varieties, and that multiple spots, previously detected in 2DE, are presumably due to differences in post-translational modification rather than differences in amino acid sequence. The most abundant Fra a 1 isoform sequence, recombinantly expressed in Escherichia coli after removal of the intron, was recognized by IgE from strawberry allergic patients. It cross-reacted with antibodies to Bet v 1 and the homologous apple allergen Mal d 1 (61 and 78% sequence identity, respectively), and will be used in further analyses of variation in Fra a 1-expression.

  15. Amino acid sequence of an intracellular, phosphate-starvation-induced ribonuclease from cultured tomato (Lycopersicon esculentum) cells.

    PubMed

    Löffler, A; Glund, K; Irie, M

    1993-06-15

    The primary structure of an intracellular ribonuclease (RNase LX) from cultured tomato (Lycopersicon esculentum) cells has been determined. Previous studies have shown that the protein is located inside the tomato cells but outside the vacuoles and that its synthesis is induced after depleting the cells for phosphate [Löffler, A., Abel, S., Jost, W., Beintema, J. J., Glund, K. (1992) Plant Physiol. 98, 1472-1478]. Sequence analysis was carried out by analysis of peptides isolated after enzymatic and chemical cleavage of the protein. RNase LX consists of 213 amino acids and has a molecular mass of 24300 Da and an isoelectric point of 5.33. The enzyme contains 10 half-cystines and there are no potential N-glycosylation sites detectable in the sequence. RNase LX, as compared to an extracellular tomato RNase (RNase LE), which is also phosphate regulated and the amino acid sequence of which was recently established [Jost, W., Bak, H., Glund, K., Terpstra, P. & Beintema, J. J. (1991) Eur. J. Biochem. 198, 1-6] has 60% of all amino acids identical and in identical positions, revealing a high degree of similarity between both proteins. In contrast to RNase LE, RNase LX has a C-terminal extension of nine amino acids. The C-terminal tetrapeptide HDEF may be a retention signal of the protein in the endoplasmic reticulum. PMID:8319673

  16. Complete genome sequence of Enterococcus mundtii QU 25, an efficient L-(+)-lactic acid-producing bacterium.

    PubMed

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-08-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified-one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci.

  17. Rational design of translational pausing without altering the amino acid sequence dramatically promotes soluble protein expression: a strategic demonstration.

    PubMed

    Chen, Wei; Jin, Jingjie; Gu, Wei; Wei, Bo; Lei, Yun; Xiong, Sheng; Zhang, Gong

    2014-11-10

    The production of many pharmaceutical and industrial proteins in prokaryotic hosts is hindered by the insolubility of industrial expression products resulting from misfolding. Even with a correct