Sample records for genes encoding predicted

  1. A deep auto-encoder model for gene expression prediction.

    PubMed

    Xie, Rui; Wen, Jia; Quitadamo, Andrew; Cheng, Jianlin; Shi, Xinghua

    2017-11-17

    Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance. To demonstrate the usage of this model, we apply MLP-SAE to a real genomic datasets with genotypes and gene expression profiles measured in yeast. Our results show that the MLP-SAE model with dropout outperforms other models including Lasso, Random Forests and the MLP-SAE model without dropout. Using the MLP-SAE model with dropout, we show that gene expression quantifications predicted by the model solely based on genotypes, align well with true gene expression patterns. We provide a deep auto-encoder model for predicting gene expression from SNP genotypes. This study demonstrates that deep learning is appropriate for tackling another genomic problem, i.e., building predictive models to understand genotypes' contribution to gene expression. With the emerging availability of richer genomic data, we anticipate that deep learning models play a bigger role in modeling and interpreting genomics.

  2. Identifying metabolic enzymes with multiple types of association evidence

    PubMed Central

    Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M

    2006-01-01

    Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130

  3. Transcriptomic analysis of Arabidopsis developing stems: a close-up on cell wall genes

    PubMed Central

    Minic, Zoran; Jamet, Elisabeth; San-Clemente, Hélène; Pelletier, Sandra; Renou, Jean-Pierre; Rihouey, Christophe; Okinyo, Denis PO; Proux, Caroline; Lerouge, Patrice; Jouanin, Lise

    2009-01-01

    Background Different strategies (genetics, biochemistry, and proteomics) can be used to study proteins involved in cell biogenesis. The availability of the complete sequences of several plant genomes allowed the development of transcriptomic studies. Although the expression patterns of some Arabidopsis thaliana genes involved in cell wall biogenesis were identified at different physiological stages, detailed microarray analysis of plant cell wall genes has not been performed on any plant tissues. Using transcriptomic and bioinformatic tools, we studied the regulation of cell wall genes in Arabidopsis stems, i.e. genes encoding proteins involved in cell wall biogenesis and genes encoding secreted proteins. Results Transcriptomic analyses of stems were performed at three different developmental stages, i.e., young stems, intermediate stage, and mature stems. Many genes involved in the synthesis of cell wall components such as polysaccharides and monolignols were identified. A total of 345 genes encoding predicted secreted proteins with moderate or high level of transcripts were analyzed in details. The encoded proteins were distributed into 8 classes, based on the presence of predicted functional domains. Proteins acting on carbohydrates and proteins of unknown function constituted the two most abundant classes. Other proteins were proteases, oxido-reductases, proteins with interacting domains, proteins involved in signalling, and structural proteins. Particularly high levels of expression were established for genes encoding pectin methylesterases, germin-like proteins, arabinogalactan proteins, fasciclin-like arabinogalactan proteins, and structural proteins. Finally, the results of this transcriptomic analyses were compared with those obtained through a cell wall proteomic analysis from the same material. Only a small proportion of genes identified by previous proteomic analyses were identified by transcriptomics. Conversely, only a few proteins encoded by genes having moderate or high level of transcripts were identified by proteomics. Conclusion Analysis of the genes predicted to encode cell wall proteins revealed that about 345 genes had moderate or high levels of transcripts. Among them, we identified many new genes possibly involved in cell wall biogenesis. The discrepancies observed between results of this transcriptomic study and a previous proteomic study on the same material revealed post-transcriptional mechanisms of regulation of expression of genes encoding cell wall proteins. PMID:19149885

  4. Molecular cloning and characterization of alpha - galactosidase gene from Glaciozyma antarctica

    NASA Astrophysics Data System (ADS)

    Moheer, Reyad Qaed Al; Bakar, Farah Diba Abu; Murad, Abdul Munir Abdul

    2015-09-01

    Psychrophilic enzymes are proteins produced by psychrophilic organisms which recently are the limelight for industrial applications. A gene encoding α-galactosidase from a psychrophilic yeast, Glaciozyma antarctica PI12 which belongs to glycoside hydrolase family 27, was isolated and analyzed using several bioinformatic tools. The cDNA of the gene with the size of 1,404-bp encodes a protein with 467 amino acid residues. Predicted molecular weight of protein was 48.59 kDa and hence we name the gene encoding α-galactosidase as GAL48. We found that the predicted protein sequences possessed signal peptide sequence and are highly conserved among other fungal α-galactosidase.

  5. Cloning and sequencing the genes encoding goldfish and carp ependymin.

    PubMed

    Adams, D S; Shashoua, V E

    1994-04-20

    Ependymins (EPNs) are brain glycoproteins thought to function in optic nerve regeneration and long-term memory consolidation. To date, epn genes have been characterized in two orders of teleost fish. In this study, polymerase chain reactions (PCR) were used to amplify the complete 1.6-kb epn genes, gf-I and cc-I, from genomic DNA of Cypriniformes, goldfish and carp, respectively. Amplified bands were cloned and sequenced. Each gene consists of six exons and five introns. The exon portion of gf-I encodes a predicted 215-amino-acid (aa) protein previously characterized as GF-I, while cc-I encodes a predicted 215-aa protein 95% homologous to GF-I.

  6. Complete genome sequence of Nitrosospira multiformis, an ammonia-oxidizing bacterium from the soil environment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Norton, Jeanette M.; Klotz, Martin G; Stein, Lisa Y

    2008-01-01

    The complete genome of the ammonia-oxidizing bacterium, Nitrosospira multiformis (ATCC 25196T), consists of a circular chromosome and three small plasmids totaling 3,234,309 bp and encoding 2827 putative proteins. Of these, 2026 proteins have predicted functions and 801 are without conserved functional domains, yet 747 of these have similarity to other predicted proteins in databases. Gene homologs from Nitrosomonas europaea and N. eutropha were the best match for 42% of the predicted genes in N. multiformis. The genome contains three nearly identical copies of amo and hao gene clusters as large repeats. Distinguishing features compared to N. europaea include: the presencemore » of gene clusters encoding urease and hydrogenase, a RuBisCO-encoding operon of distinctive structure and phylogeny, and a relatively small complement of genes related to Fe acquisition. Systems for synthesis of a pyoverdine-like siderophore and for acyl-homoserine lactone were unique to N. multiformis among the sequenced AOB genomes. Gene clusters encoding proteins associated with outer membrane and cell envelope functions including transporters, porins, exopolysaccharide synthesis, capsule formation and protein sorting/export were abundant. Numerous sensory transduction and response regulator gene systems directed towards sensing of the extracellular environment are described. Gene clusters for glycogen, polyphosphate and cyanophycin storage and utilization were identified providing mechanisms for meeting energy requirements under substrate-limited conditions. The genome of N. multiformis encodes the core pathways for chemolithoautotrophy along with adaptations for surface growth and survival in soil environments.« less

  7. Two pheromone precursor genes are transcriptionally expressed in the homothallic ascomycete Sordaria macrospora.

    PubMed

    Pöggeler, S

    2000-06-01

    In order to analyze the involvement of pheromones in cell recognition and mating in a homothallic fungus, two putative pheromone precursor genes, named ppg1 and ppg2, were isolated from a genomic library of Sordaria macrospora. The ppg1 gene is predicted to encode a precursor pheromone that is processed by a Kex2-like protease to yield a pheromone that is structurally similar to the alpha-factor of the yeast Saccharomyces cerevisiae. The ppg2 gene encodes a 24-amino-acid polypeptide that contains a putative farnesylated and carboxy methylated C-terminal cysteine residue. The sequences of the predicted pheromones display strong structural similarity to those encoded by putative pheromones of heterothallic filamentous ascomycetes. Both genes are expressed during the life cycle of S. macrospora. This is the first description of pheromone precursor genes encoded by a homothallic fungus. Southern-hybridization experiments indicated that ppg1 and ppg2 homologues are also present in other homothallic ascomycetes.

  8. Genes encoding giant danio and golden shiner ependymin.

    PubMed

    Adams, D S; Kiyokawa, M; Getman, M E; Shashoua, V E

    1996-03-01

    Ependymin (EPN) is a brain glycoprotein that functions as a neurotrophic factor in optic nerve regeneration and long-term memory consolidation in goldfish. To date, true epn genes have been characterized in one order of teleost fish, Cypriniformes. In the study presented here, polymerase chain reactions were used to analyze the complete epn genes, gd (1480 bp), and sh (2071 bp), from Cypriniformes giant danio and shiner, respectively. Southern hybridizations demonstrated the existence of one copy of each gene per corresponding haploid genome. Each gene was found to contain six exons and five introns. Gene gd encodes a predicted 218-amino acid (aa) protein GD 93 percent conserved to goldfish EPN, while sh encodes a predicted 214-aa protein SH 91 percent homologous to goldfish. Evidence is presented classifying proteins previously termed "EPNs" into two major categories: true EPNs and non-EPN cerebrospinal fluid glycoproteins. Proteins GD and SH contain all the hallmark, features of true EPNs.

  9. Cloning and characterization of a mouse gene with homology to the human von Hippel-Lindau disease tumor suppressor gene: implications for the potential organization of the human von Hippel-Lindau disease gene.

    PubMed

    Gao, J; Naglich, J G; Laidlaw, J; Whaley, J M; Seizinger, B R; Kley, N

    1995-02-15

    The human von Hippel-Lindau disease (VHL) gene has recently been identified and, based on the nucleotide sequence of a partial cDNA clone, has been predicted to encode a novel protein with as yet unknown functions [F. Latif et al., Science (Washington DC), 260: 1317-1320, 1993]. The length of the encoded protein and the characteristics of the cellular expressed protein are as yet unclear. Here we report the cloning and characterization of a mouse gene (mVHLh1) that is widely expressed in different mouse tissues and shares high homology with the human VHL gene. It predicts a protein 181 residues long (and/or 162 amino acids, considering a potential alternative start codon), which across a core region of approximately 140 residues displays a high degree of sequence identity (98%) to the predicted human VHL protein. High stringency DNA and RNA hybridization experiments and protein expression analyses indicate that this gene is the most highly VHL-related mouse gene, suggesting that it represents the mouse VHL gene homologue rather than a related gene sharing a conserved functional domain. These findings provide new insights into the potential organization of the VHL gene and nature of its encoded protein.

  10. Heterologous production and characterization of two glyoxal oxidases from Pycnoporus cinnabarinus

    Treesearch

    Marianne Daou; François Piumi; Daniel Cullen; Eric Record; Craig B. Faulds

    2016-01-01

    The genome of the white rot fungus Pycnoporus cinnabarinus includes a large number of genes encoding enzymes implicated in lignin degradation. Among these, three genes are predicted to encode glyoxal oxidase, an enzyme previously isolated from Phanerochaete chrysosporium. The glyoxal oxidase of P. chrysosporium...

  11. Implications of Cognitive Load for Hypothesis Generation and Probability Judgment

    PubMed Central

    Sprenger, Amber M.; Dougherty, Michael R.; Atkins, Sharona M.; Franco-Watkins, Ana M.; Thomas, Rick P.; Lange, Nicholas; Abbs, Brandon

    2011-01-01

    We tested the predictions of HyGene (Thomas et al., 2008) that both divided attention at encoding and judgment should affect the degree to which participants’ probability judgments violate the principle of additivity. In two experiments, we showed that divided attention during judgment leads to an increase in subadditivity, suggesting that the comparison process for probability judgments is capacity limited. Contrary to the predictions of HyGene, a third experiment revealed that divided attention during encoding leads to an increase in later probability judgment made under full attention. The effect of divided attention during encoding on judgment was completely mediated by the number of hypotheses participants generated, indicating that limitations in both encoding and recall can cascade into biases in judgments. PMID:21734897

  12. Degradation of triglycerides by a pseudomonad isolated from milk: molecular analysis of a lipase-encoding gene and its expression in Escherichia coli.

    PubMed Central

    Johnson, L A; Beacham, I R; MacRae, I C; Free, M L

    1992-01-01

    Psychrotrophic lipolytic bacteria represent a significant problem in the storage of refrigerated dairy products. A lipase-encoding gene has been cloned and characterized from a highly lipolytic strain of Pseudomonas. The nucleotide sequence of the gene predicts a polypeptide of M(r) 49,905, which was identified when the gene was expressed in Escherichia coli. Images PMID:1622251

  13. Draft Genome Sequence of Ezakiella peruensis Strain M6.X2, a Human Gut Gram-Positive Anaerobic Coccus.

    PubMed

    Diop, Awa; Diop, Khoudia; Tomei, Enora; Raoult, Didier; Fenollar, Florence; Fournier, Pierre-Edouard

    2018-03-01

    We report here the draft genome sequence of Ezakiella peruensis strain M6.X2 T The draft genome is 1,672,788 bp long and harbors 1,589 predicted protein-encoding genes, including 26 antibiotic resistance genes with 1 gene encoding vancomycin resistance. The genome also exhibits 1 clustered regularly interspaced short palindromic repeat region and 333 genes acquired by horizontal gene transfer. Copyright © 2018 Diop et al.

  14. β-Lactamase Genes of the Penicillin-Susceptible Bacillus anthracis Sterne Strain

    PubMed Central

    Chen, Yahua; Succi, Janice; Tenover, Fred C.; Koehler, Theresa M.

    2003-01-01

    Susceptibility to penicillin and other β-lactam-containing compounds is a common trait of Bacillus anthracis. β-lactam agents, particularly penicillin, have been used worldwide to treat anthrax in humans. Nonetheless, surveys of clinical and soil-derived strains reveal penicillin G resistance in 2 to 16% of isolates tested. Bacterial resistance to β-lactam agents is often mediated by production of one or more types of β-lactamases that hydrolyze the β-lactam ring, inactivating the antimicrobial agent. Here, we report the presence of two β-lactamase (bla) genes in the penicillin-susceptible Sterne strain of B. anthracis. We identified bla1 by functional cloning with Escherichia coli. bla1 is a 927-nucleotide (nt) gene predicted to encode a protein with 93.8% identity to the type I β-lactamase gene of Bacillus cereus. A second gene, bla2, was identified by searching the unfinished B. anthracis chromosome sequence database of The Institute for Genome Research for open reading frames (ORFs) predicted to encode β-lactamases. We found a partial ORF predicted to encode a protein with significant similarity to the carboxy-terminal end of the type II β-lactamase of B. cereus. DNA adjacent to the 5′ end of the partial ORF was cloned using inverse PCR. bla2 is a 768-nt gene predicted to encode a protein with 92% identity to the B. cereus type II enzyme. The bla1 and bla2 genes confer ampicillin resistance to E. coli and Bacillus subtilis when cloned individually in these species. The MICs of various antimicrobial agents for the E. coli clones indicate that the two β-lactamase genes confer different susceptibility profiles to E. coli; bla1 is a penicillinase, while bla2 appears to be a cephalosporinase. The β-galactosidase activities of B. cereus group species harboring bla promoter-lacZ transcriptional fusions indicate that bla1 is poorly transcribed in B. anthracis, B. cereus, and B. thuringiensis. The bla2 gene is strongly expressed in B. cereus and B. thuringiensis and weakly expressed in B. anthracis. Taken together, these data indicate that the bla1 and bla2 genes of the B. anthracis Sterne strain encode functional β-lactamases of different types, but gene expression is usually not sufficient to confer resistance to β-lactam agents. PMID:12533457

  15. Isolation and characterization of polygalacturonase genes (pecA and pecB) from Aspergillus flavus.

    PubMed Central

    Whitehead, M P; Shieh, M T; Cleveland, T E; Cary, J W; Dean, R A

    1995-01-01

    Two genes, pecA and pecB, encoding endopolyglacturonases were cloned from a highly aggressive strain of Aspergillus flavus. The pecA gene consisted of 1,228 bp encoding a protein of 363 amino acids with a predicted molecular mass of 37.6 kDa, interrupted by two introns of 58 and 81 bp in length. Accumulation of pecA mRNA in both pectin- or glucose-grown mycelia in the highly aggressive strain matched the activity profile of a pectinase previously identified as P2c. Transformants of a weakly aggressive strain containing a functional copy of the pecA gene produced P2c in vitro, confirming that pecA encodes P2c. The coding region of pecB was determined to be 1,217 bp in length interrupted by two introns of 65 and 54 bp in length. The predicted protein of 366 amino acids had an estimated molecular mass of 38 kDa. Transcripts of this gene accumulated in mycelia grown in medium containing pectin alone, never in mycelia grown in glucose-containing medium, for both highly and weakly aggressive strains. Thus, pecB encodes the activity previously identified as P1 or P3. pecA and pecB share a high degree of sequence identity with polygalacturonase genes from Aspergillus parasiticus and Aspergillus oryzae, further establishing the close relationships between members of the A. flavus group. Conservation of intron positions in these genes also indicates that they share a common ancestor with genes encoding endopolyglacturonases of Aspergillus niger. PMID:7574642

  16. Role and Regulation of the Flp/Tad Pilus in the Virulence of Pectobacterium atrosepticum SCRI1043 and Pectobacterium wasabiae SCC3193

    PubMed Central

    Nykyri, Johanna; Mattinen, Laura; Niemi, Outi; Adhikari, Satish; Kõiv, Viia; Somervuo, Panu; Fang, Xin; Auvinen, Petri; Mäe, Andres; Palva, E. Tapio; Pirhonen, Minna

    2013-01-01

    In this study, we characterized a putative Flp/Tad pilus-encoding gene cluster, and we examined its regulation at the transcriptional level and its role in the virulence of potato pathogenic enterobacteria of the genus Pectobacterium. The Flp/Tad pilus-encoding gene clusters in Pectobacterium atrosepticum, Pectobacterium wasabiae and Pectobacterium aroidearum were compared to previously characterized flp/tad gene clusters, including that of the well-studied Flp/Tad pilus model organism Aggregatibacter actinomycetemcomitans, in which this pilus is a major virulence determinant. Comparative analyses revealed substantial protein sequence similarity and open reading frame synteny between the previously characterized flp/tad gene clusters and the cluster in Pectobacterium, suggesting that the predicted flp/tad gene cluster in Pectobacterium encodes a Flp/Tad pilus-like structure. We detected genes for a novel two-component system adjacent to the flp/tad gene cluster in Pectobacterium, and mutant analysis demonstrated that this system has a positive effect on the transcription of selected Flp/Tad pilus biogenesis genes, suggesting that this response regulator regulate the flp/tad gene cluster. Mutagenesis of either the predicted regulator gene or selected Flp/Tad pilus biogenesis genes had a significant impact on the maceration ability of the bacterial strains in potato tubers, indicating that the Flp/Tad pilus-encoding gene cluster represents a novel virulence determinant in Pectobacterium. Soft-rot enterobacteria in the genera Pectobacterium and Dickeya are of great agricultural importance, and an investigation of the virulence of these pathogens could facilitate improvements in agricultural practices, thus benefiting farmers, the potato industry and consumers. PMID:24040039

  17. Role and regulation of the Flp/Tad pilus in the virulence of Pectobacterium atrosepticum SCRI1043 and Pectobacterium wasabiae SCC3193.

    PubMed

    Nykyri, Johanna; Mattinen, Laura; Niemi, Outi; Adhikari, Satish; Kõiv, Viia; Somervuo, Panu; Fang, Xin; Auvinen, Petri; Mäe, Andres; Palva, E Tapio; Pirhonen, Minna

    2013-01-01

    In this study, we characterized a putative Flp/Tad pilus-encoding gene cluster, and we examined its regulation at the transcriptional level and its role in the virulence of potato pathogenic enterobacteria of the genus Pectobacterium. The Flp/Tad pilus-encoding gene clusters in Pectobacterium atrosepticum, Pectobacterium wasabiae and Pectobacterium aroidearum were compared to previously characterized flp/tad gene clusters, including that of the well-studied Flp/Tad pilus model organism Aggregatibacter actinomycetemcomitans, in which this pilus is a major virulence determinant. Comparative analyses revealed substantial protein sequence similarity and open reading frame synteny between the previously characterized flp/tad gene clusters and the cluster in Pectobacterium, suggesting that the predicted flp/tad gene cluster in Pectobacterium encodes a Flp/Tad pilus-like structure. We detected genes for a novel two-component system adjacent to the flp/tad gene cluster in Pectobacterium, and mutant analysis demonstrated that this system has a positive effect on the transcription of selected Flp/Tad pilus biogenesis genes, suggesting that this response regulator regulate the flp/tad gene cluster. Mutagenesis of either the predicted regulator gene or selected Flp/Tad pilus biogenesis genes had a significant impact on the maceration ability of the bacterial strains in potato tubers, indicating that the Flp/Tad pilus-encoding gene cluster represents a novel virulence determinant in Pectobacterium. Soft-rot enterobacteria in the genera Pectobacterium and Dickeya are of great agricultural importance, and an investigation of the virulence of these pathogens could facilitate improvements in agricultural practices, thus benefiting farmers, the potato industry and consumers.

  18. Identification of a Linked Set of Genes in Serpulina hyodysenteriae (B204) Predicted To Encode Closely Related 39-Kilodalton Extracytoplasmic Proteins

    PubMed Central

    Gabe, Jeffrey D.; Dragon, Elizabeth; Chang, Ray-Jen; McCaman, Michael T.

    1998-01-01

    A tandem pair of nearly identical genes from Serpulina hyodysenteriae (B204) were cloned and sequenced. The full open reading frame of one gene and the partial open reading frame of the neighboring gene appear to encode secreted proteins which are homologous to, yet distinct from, the 39-kDa extracytoplasmic protein purified from the membrane fraction of S. hyodysenteriae. We have designated these newly identified genes vspA and vspB (for variable surface protein). PMID:9440540

  19. Draft Genome Sequence of Enterohemorrhagic Escherichia coli O157:H7 Strain MC2 Isolated from Cattle in France

    PubMed Central

    Auffret, Pauline; Segura, Audrey; Klopp, Christophe; Bouchez, Olivier; Kérourédan, Monique; Bibbal, Delphine; Brugère, Hubert; Forano, Evelyne

    2017-01-01

    ABSTRACT Enterohemorrhagic Escherichia coli (EHEC) with serotype O157:H7 is a major foodborne pathogen. Here, we report the draft genome sequence of EHEC O157:H7 strain MC2 isolated from cattle in France. The assembly contains 5,400,376 bp that encoded 5,914 predicted genes (5,805 protein-encoding genes and 109 RNA genes). PMID:28983004

  20. The prediction of biogenic magnetic nanoparticles biomineralization in human tissues and organs

    NASA Astrophysics Data System (ADS)

    Medviediev, O.; Gorobets, O. Yu; Gorobets, S. V.; Yadrykhins'ky, V. S.

    2017-10-01

    In this study, human homologs of magnetosome island proteins basing on pairwise and multiple alignment of amino acid sequences were found. The expression levels of genes, which encode magnetosome island proteins of M. gryphiswaldense MSR-1, that were cultured under oxygen deficiency conditions and also under microaerobic conditions were compared to the expression levels of genes that encode the relevant homologs in human organism. The possibility of BMN biomineralization in human tissues and organs, in which BMN were not experimentally found before, was predicted.

  1. Genetic Insights Into Pyralomicin Biosynthesis in Nonomuraea spiralis IMC A-0156

    PubMed Central

    Flatt, Patricia M.; Wu, Xiumei; Perry, Steven; Mahmud, Taifo

    2013-01-01

    The biosynthetic gene cluster for the pyralomicin antibiotics has been cloned and sequenced from Nonomuraea spiralis IMC A-0156. The 41-kb gene cluster contains 27 ORFs predicted to encode all of the functions for pyralomicin biosynthesis. This includes non-ribosomal peptide synthetases (NRPS) and polyketide synthases (PKS) required for the formation of the benzopyranopyrrole core unit, as well as a suite of tailoring enzymes (e.g., four halogenases, an O-methyltransferase, and an N-glycosyltransferase) necessary for further modifications of the core structure. The N-glycosyltransferase is predicted to transfer either glucose or a pseudosugar (cyclitol) to the aglycone. A gene cassette encoding C7-cyclitol biosynthetic enzymes was identified upstream of the benzopyranopyrrole-specific ORFs. Targeted disruption of the gene encoding the N-glycosyltransferase, prlH, abolished pyralomicin production and recombinant expression of PrlA confirms the activity of this enzyme as a sugar phosphate cyclase (SPC) involved in the formation of the C7-cyclitol moiety. PMID:23607523

  2. Regulation of Sulfur Assimilation Pathways in Burkholderia cenocepacia through Control of Genes by the SsuR Transcription Factor▿

    PubMed Central

    Łochowska, Anna; Iwanicka-Nowicka, Roksana; Zielak, Agata; Modelewska, Anna; Thomas, Mark S.; Hryniewicz, Monika M.

    2011-01-01

    The genome of Burkholderia cenocepacia contains two genes encoding closely related LysR-type transcriptional regulators, CysB and SsuR, involved in control of sulfur assimilation processes. In this study we show that the function of SsuR is essential for the utilization of a number of organic sulfur sources of either environmental or human origin. Among the genes upregulated by SsuR identified here are the tauABC operon encoding a predicted taurine transporter, three tauD-type genes encoding putative taurine dioxygenases, and atsA encoding a putative arylsulfatase. The role of SsuR in expression of these genes/operons was characterized through (i) construction of transcriptional reporter fusions to candidate promoter regions and analysis of their expression in the presence/absence of SsuR and (ii) testing the ability of SsuR to bind SsuR-responsive promoter regions. We also demonstrate that expression of SsuR-activated genes is not repressed in the presence of inorganic sulfate. A more detailed analysis of four SsuR-responsive promoter regions indicated that ∼44 bp of the DNA sequence preceding and/or overlapping the predicted −35 element of such promoters is sufficient for SsuR binding. The DNA sequence homology among SsuR “recognition motifs” at different responsive promoters appears to be limited. PMID:21317335

  3. Molecular cloning and expression of the gene encoding the kinetoplast-associated type II DNA topoisomerase of Crithidia fasciculata.

    PubMed

    Pasion, S G; Hines, J C; Aebersold, R; Ray, D S

    1992-01-01

    A type II DNA topoisomerase, topoIImt, was shown previously to be associated with the kinetoplast DNA of the trypanosomatid Crithidia fasciculata. The gene encoding this kinetoplast-associated topoisomerase has been cloned by immunological screening of a Crithidia genomic expression library with monoclonal antibodies raised against the purified enzyme. The gene CfaTOP2 is a single copy gene and is expressed as a 4.8-kb polyadenylated transcript. The nucleotide sequence of CfaTOP2 has been determined and encodes a predicted polypeptide of 1239 amino acids with a molecular mass of 138,445. The identification of the cloned gene is supported by immunoblot analysis of the beta-galactosidase-CfaTOP2 fusion protein expressed in Escherichia coli and by analysis of tryptic peptide sequences derived from purified topoIImt. CfaTOP2 shares significant homology with nuclear type II DNA topoisomerases of other eukaryotes suggesting that in Crithidia both nuclear and mitochondrial forms of topoisomerase II are encoded by the same gene.

  4. Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster.

    PubMed

    Robertson, Hugh M; Warr, Coral G; Carlson, John R

    2003-11-25

    The insect chemoreceptor superfamily in Drosophila melanogaster is predicted to consist of 62 odorant receptor (Or) and 68 gustatory receptor (Gr) proteins, encoded by families of 60 Or and 60 Gr genes through alternative splicing. We include two previously undescribed Or genes and two previously undescribed Gr genes; two previously predicted Or genes are shown to be alternative splice forms. Three polymorphic pseudogenes and one highly defective pseudogene are recognized. Phylogenetic analysis reveals deep branches connecting multiple highly divergent clades within the Gr family, and the Or family appears to be a single highly expanded lineage within the superfamily. The genes are spread throughout the Drosophila genome, with some relatively recently diverged genes still clustered in the genome. The Gr5a gene on the X chromosome, which encodes a receptor for the sugar trehalose, has transposed from one such tandem cluster of six genes at cytological location 64, as has Gr61a, and all eight of these receptors might bind sugars. Analysis of intron evolution suggests that the common ancestor consisted of a long N-terminal exon encoding transmembrane domains 1-5 followed by three exons encoding transmembrane domains 6-7. As many as 57 additional introns have been acquired idiosyncratically during the evolution of the superfamily, whereas the ancestral introns and some of the older idiosyncratic introns have been lost at least 48 times independently. Altogether, these patterns of molecular evolution suggest that this is an ancient superfamily of chemoreceptors, probably dating back at least to the origin of the arthropods.

  5. Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster

    PubMed Central

    Robertson, Hugh M.; Warr, Coral G.; Carlson, John R.

    2003-01-01

    The insect chemoreceptor superfamily in Drosophila melanogaster is predicted to consist of 62 odorant receptor (Or) and 68 gustatory receptor (Gr) proteins, encoded by families of 60 Or and 60 Gr genes through alternative splicing. We include two previously undescribed Or genes and two previously undescribed Gr genes; two previously predicted Or genes are shown to be alternative splice forms. Three polymorphic pseudogenes and one highly defective pseudogene are recognized. Phylogenetic analysis reveals deep branches connecting multiple highly divergent clades within the Gr family, and the Or family appears to be a single highly expanded lineage within the superfamily. The genes are spread throughout the Drosophila genome, with some relatively recently diverged genes still clustered in the genome. The Gr5a gene on the X chromosome, which encodes a receptor for the sugar trehalose, has transposed from one such tandem cluster of six genes at cytological location 64, as has Gr61a, and all eight of these receptors might bind sugars. Analysis of intron evolution suggests that the common ancestor consisted of a long N-terminal exon encoding transmembrane domains 1-5 followed by three exons encoding transmembrane domains 6-7. As many as 57 additional introns have been acquired idiosyncratically during the evolution of the superfamily, whereas the ancestral introns and some of the older idiosyncratic introns have been lost at least 48 times independently. Altogether, these patterns of molecular evolution suggest that this is an ancient superfamily of chemoreceptors, probably dating back at least to the origin of the arthropods. PMID:14608037

  6. Identification of a melanosomal membrane protein encoded by the pink-eyed dilution (type II oculocutaneous albinism) gene.

    PubMed Central

    Rosemblat, S; Durham-Pierre, D; Gardner, J M; Nakatsu, Y; Brilliant, M H; Orlow, S J

    1994-01-01

    The pink-eyed dilution (p) locus in the mouse is critical to melanogenesis; mutations in the homologous locus in humans, P, are a cause of type II oculocutaneous albinism. Although a cDNA encoded by the p gene has recently been identified, nothing is known about the protein product of this gene. To characterize the protein encoded by the p gene, we performed immunoblot analysis of extracts of melanocytes cultured from wild-type mice with an antiserum from rabbits immunized with a peptide corresponding to amino acids 285-298 of the predicted protein product of the murine p gene. This antiserum recognized a 110-kDa protein. The protein was absent from extracts of melanocytes cultured from mice with two mutations (pcp and p) in which transcripts of the p gene are absent or greatly reduced. Introduction of the cDNA for the p gene into pcp melanocytes by electroporation resulted in expression of the 3.3-kb mRNA and the 110-kDa protein. Upon subcellular fractionation of cultured melanocytes, the 110-kDa protein was found to be present in melanosomes but absent from the vesicular fraction; phase separation performed with the nonionic detergent Triton X-114 confirmed the predicted hydrophobic nature of the protein. These results demonstrate that the p gene encodes a 110-kDa integral melanosomal membrane protein and establish a framework by which mutations at this locus, which diminish pigmentation, can be analyzed at the cellular and biochemical levels. Images PMID:7991586

  7. The restriction-modification genes of Escherichia coli K-12 may not be selfish: they do not resist loss and are readily replaced by alleles conferring different specificities.

    PubMed

    O'Neill, M; Chen, A; Murray, N E

    1997-12-23

    Type II restriction and modification (R-M) genes have been described as selfish because they have been shown to impose selection for the maintenance of the plasmid that encodes them. In our experiments, the type I R-M system EcoKI does not behave in the same way. The genes specifying EcoKI are, however, normally residents of the chromosome and therefore our analyses were extended to monitor the deletion of chromosomal genes rather than loss of plasmid vector. If EcoKI were to behave in the same way as the plasmid-encoded type II R-M systems, the loss of the relevant chromosomal genes by mutation or recombination should lead to cell death because the cell would become deficient in modification enzyme and the bacterial chromosome would be vulnerable to the restriction endonuclease. Our data contradict this prediction; they reveal that functional type I R-M genes in the chromosome are readily replaced by mutant alleles and by alleles encoding a type I R-M system of different specificity. The acquisition of allelic genes conferring a new sequence specificity, but not the loss of the resident genes, is dependent on the product of an unlinked gene, one predicted [Prakash-Cheng, A., Chung, S. S. & Ryu, J. (1993) Mol. Gen. Genet. 241, 491-496] to be relevant to control of expression of the genes that encode EcoKI. Our evidence suggests that not all R-M systems are evolving as "selfish" units; rather, the diversity and distribution of the family of type I enzymes we have investigated require an alternative selective pressure.

  8. A highly divergent gene cluster in honey bees encodes a novel silk family.

    PubMed

    Sutherland, Tara D; Campbell, Peter M; Weisman, Sarah; Trueman, Holly E; Sriskantha, Alagacone; Wanjura, Wolfgang J; Haritos, Victoria S

    2006-11-01

    The pupal cocoon of the domesticated silk moth Bombyx mori is the best known and most extensively studied insect silk. It is not widely known that Apis mellifera larvae also produce silk. We have used a combination of genomic and proteomic techniques to identify four honey bee fiber genes (AmelFibroin1-4) and two silk-associated genes (AmelSA1 and 2). The four fiber genes are small, comprise a single exon each, and are clustered on a short genomic region where the open reading frames are GC-rich amid low GC intergenic regions. The genes encode similar proteins that are highly helical and predicted to form unusually tight coiled coils. Despite the similarity in size, structure, and composition of the encoded proteins, the genes have low primary sequence identity. We propose that the four fiber genes have arisen from gene duplication events but have subsequently diverged significantly. The silk-associated genes encode proteins likely to act as a glue (AmelSA1) and involved in silk processing (AmelSA2). Although the silks of honey bees and silkmoths both originate in larval labial glands, the silk proteins are completely different in their primary, secondary, and tertiary structures as well as the genomic arrangement of the genes encoding them. This implies independent evolutionary origins for these functionally related proteins.

  9. Proteins of Unknown Biochemical Function: A Persistent Problem and a Roadmap to Help Overcome It.

    PubMed

    Niehaus, Thomas D; Thamm, Antje M K; de Crécy-Lagard, Valérie; Hanson, Andrew D

    2015-11-01

    The number of sequenced genomes is rapidly increasing, but functional annotation of the genes in these genomes lags far behind. Even in Arabidopsis (Arabidopsis thaliana), only approximately 40% of enzyme- and transporter-encoding genes have credible functional annotations, and this number is even lower in nonmodel plants. Functional characterization of unknown genes is a challenge, but various databases (e.g. for protein localization and coexpression) can be mined to provide clues. If homologous microbial genes exist-and about one-half the genes encoding unknown enzymes and transporters in Arabidopsis have microbial homologs-cross-kingdom comparative genomics can powerfully complement plant-based data. Multiple lines of evidence can strengthen predictions and warrant experimental characterization. In some cases, relatively quick tests in genetically tractable microbes can determine whether a prediction merits biochemical validation, which is costly and demands specialized skills. © 2015 American Society of Plant Biologists. All Rights Reserved.

  10. [Expression changes of major outer membrane protein antigens in Leptospira interrogans during infection and its mechanism].

    PubMed

    Zheng, Linli; Ge, Yumei; Hu, Weilin; Yan, Jie

    2013-03-01

    To determine expression changes of major outer membrane protein(OMP) antigens of Leptospira interrogans serogroup Icterohaemorrhagiae serovar Lai strain Lai during infection of human macrophages and its mechanism. OmpR encoding genes and OmpR-related histidine kinase (HK) encoding gene of L.interrogans strain Lai and their functional domains were predicted using bioinformatics technique. mRNA level changes of the leptospiral major OMP-encoding genes before and after infection of human THP-1 macrophages were detected by real-time fluorescence quantitative RT-PCR. Effects of the OmpR-encoding genes and HK-encoding gene on the expression of leptospiral OMPs during infection were determined by HK-peptide antiserum block assay and closantel inhibitive assays. The bioinformatics analysis indicated that LB015 and LB333 were referred to OmpR-encoding genes of the spirochete, while LB014 might act as a OmpR-related HK-encoding gene. After the spirochete infecting THP-1 cells, mRNA levels of leptospiral lipL21, lipL32 and lipL41 genes were rapidly and persistently down-regulated (P <0.01), whereas mRNA levels of leptospiral groEL, mce, loa22 and ligB genes were rapidly but transiently up-regulated (P<0.01). The treatment with closantel and HK-peptide antiserum partly reversed the infection-based down-regulated mRNA levels of lipL21 and lipL48 genes (P <0.01). Moreover, closantel caused a decrease of the infection-based up-regulated mRNA levels of groEL, mce, loa22 and ligB genes (P <0.01). Expression levels of L.interrogans strain Lai major OMP antigens present notable changes during infection of human macrophages. There is a group of OmpR-and HK-encoding genes which may play a major role in down-regulation of expression levels of partial OMP antigens during infection.

  11. Genome-Wide Architecture of Disease Resistance Genes in Lettuce

    PubMed Central

    Christopoulou, Marilena; Wo, Sebastian Reyes-Chin; Kozik, Alex; McHale, Leah K.; Truco, Maria-Jose; Wroblewski, Tadeusz; Michelmore, Richard W.

    2015-01-01

    Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes. PMID:26449254

  12. Draft genome sequence of Xylaria sp., the causal agent of taproot decline of soybean in the southern United States.

    PubMed

    Sharma, Sandeep; Zaccaron, Alex Z; Ridenour, John B; Allen, Tom W; Conner, Kassie; Doyle, Vinson P; Price, Trey; Sikora, Edward; Singh, Raghuwinder; Spurlock, Terry; Tomaso-Peterson, Maria; Wilkerson, Tessie; Bluhm, Burton H

    2018-04-01

    The draft genome of Xylaria sp. isolate MSU_SB201401, causal agent of taproot decline of soybean in the southern U.S., is presented here. The genome assembly was 56.7 Mb in size with an L50 of 246. A total of 10,880 putative protein-encoding genes were predicted, including 647 genes encoding carbohydrate-active enzymes and 1053 genes encoding secreted proteins. This is the first draft genome of a plant-pathogenic Xylaria sp. associated with soybean. The draft genome of Xylaria sp. isolate MSU_SB201401 will provide an important resource for future experiments to determine the molecular basis of pathogenesis.

  13. CD8 T cell response and evolutionary pressure to HIV-1 cryptic epitopes derived from antisense transcription

    PubMed Central

    Carlson, Jonathan; Yan, Jiyu; Akinsiku, Olusimidele T.; Schaefer, Malinda; Sabbaj, Steffanie; Bet, Anne; Levy, David N.; Heath, Sonya; Tang, Jianming; Kaslow, Richard A.; Walker, Bruce D.; Ndung’u, Thumbi; Goulder, Philip J.; Heckerman, David; Hunter, Eric; Goepfert, Paul A.

    2010-01-01

    Retroviruses pack multiple genes into relatively small genomes by encoding several genes in the same genomic region with overlapping reading frames. Both sense and antisense HIV-1 transcripts contain open reading frames for known functional proteins as well as numerous alternative reading frames (ARFs). At least some ARFs have the potential to encode proteins of unknown function, and their antigenic properties can be considered as cryptic epitopes (CEs). To examine the extent of active immune response to virally encoded CEs, we analyzed human leukocyte antigen class I–associated polymorphisms in HIV-1 gag, pol, and nef genes from a large cohort of South Africans with chronic infection. In all, 391 CEs and 168 conventional epitopes were predicted, with the majority (307; 79%) of CEs derived from antisense transcripts. In further evaluation of CD8 T cell responses to a subset of the predicted CEs in patients with primary or chronic infection, both sense- and antisense-encoded CEs were immunogenic at both stages of infection. In addition, CEs often mutated during the first year of infection, which was consistent with immune selection for escape variants. These findings indicate that the HIV-1 genome might encode and deploy a large potential repertoire of unconventional epitopes to enhance vaccine-induced antiviral immunity. PMID:20065064

  14. In silico analysis of β-mannanases and β-mannosidase from Aspergillus flavus and Trichoderma virens UKM1

    NASA Astrophysics Data System (ADS)

    Yee, Chai Sin; Murad, Abdul Munir Abdul; Bakar, Farah Diba Abu

    2013-11-01

    A gene encoding an endo-β-1,4-mannanase from Trichoderma virens UKM1 (manTV) and Aspergillus flavus UKM1 (manAF) was analysed with bioinformatic tools. In addition, A. flavus NRRL 3357 genome database was screened for a β-mannosidase gene and analysed (mndA-AF). These three genes were analysed to understand their gene properties. manTV and manAF both consists of 1,332-bp and 1,386-bp nucleotides encoding 443 and 461 amino acid residues, respectively. Both the endo-β-1,4-mannanases belong to the glycosyl hydrolase family 5 and contain a carbohydrate-binding module family 1 (CBM1). On the other hand, mndA-AF which is a 2,745-bp gene encodes a protein sequence of 914 amino acid residues. This β-mannosidase belongs to the glycosyl hydrolase family 2. Predicted molecular weight of manTV, manAF and mndA-AF are 47.74 kDa, 49.71 kDa and 103 kDa, respectively. All three predicted protein sequences possessed signal peptide sequence and are highly conserved among other fungal β-mannanases and β-mannosidases.

  15. High-quality draft genome sequence of the Thermus amyloliquefaciens type strain YIM 77409 T with an incomplete denitrification pathway

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, En -Min; Murugapiran, Senthil K.; Mefferd, Chrisabelle C.

    Thermus amyloliquefaciens type strain YIM 77409 T is a thermophilic, Gram-negative, non-motile and rod-shaped bacterium isolated from Niujie Hot Spring in Eryuan County, Yunnan Province, southwest China. In the present study we describe the features of strain YIM 77409 T together with its genome sequence and annotation. The genome is 2,160,855 bp long and consists of 6 scaffolds with 67.4 % average GC content. A total of 2,313 genes were predicted, comprising 2,257 protein-coding and 56 RNA genes. The genome is predicted to encode a complete glycolysis, pentose phosphate pathway, and tricarboxylic acid cycle. Additionally, a large number of transportersmore » and enzymes for heterotrophy highlight the broad heterotrophic lifestyle of this organism. Furthermore, a denitrification gene cluster included genes predicted to encode enzymes for the sequential reduction of nitrate to nitrous oxide, consistent with the incomplete denitrification phenotype of this strain.« less

  16. High-quality draft genome sequence of the Thermus amyloliquefaciens type strain YIM 77409 T with an incomplete denitrification pathway

    DOE PAGES

    Zhou, En -Min; Murugapiran, Senthil K.; Mefferd, Chrisabelle C.; ...

    2016-02-27

    Thermus amyloliquefaciens type strain YIM 77409 T is a thermophilic, Gram-negative, non-motile and rod-shaped bacterium isolated from Niujie Hot Spring in Eryuan County, Yunnan Province, southwest China. In the present study we describe the features of strain YIM 77409 T together with its genome sequence and annotation. The genome is 2,160,855 bp long and consists of 6 scaffolds with 67.4 % average GC content. A total of 2,313 genes were predicted, comprising 2,257 protein-coding and 56 RNA genes. The genome is predicted to encode a complete glycolysis, pentose phosphate pathway, and tricarboxylic acid cycle. Additionally, a large number of transportersmore » and enzymes for heterotrophy highlight the broad heterotrophic lifestyle of this organism. Furthermore, a denitrification gene cluster included genes predicted to encode enzymes for the sequential reduction of nitrate to nitrous oxide, consistent with the incomplete denitrification phenotype of this strain.« less

  17. EGASP: the human ENCODE Genome Annotation Assessment Project

    PubMed Central

    Guigó, Roderic; Flicek, Paul; Abril, Josep F; Reymond, Alexandre; Lagarde, Julien; Denoeud, France; Antonarakis, Stylianos; Ashburner, Michael; Bajic, Vladimir B; Birney, Ewan; Castelo, Robert; Eyras, Eduardo; Ucla, Catherine; Gingeras, Thomas R; Harrow, Jennifer; Hubbard, Tim; Lewis, Suzanna E; Reese, Martin G

    2006-01-01

    Background We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a 'reference set' of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment. Results The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified. Conclusion This is the first such experiment in human DNA, and we have followed the standards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe the results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence. PMID:16925836

  18. Molecular characterization and expression of the M6 gene of grass carp hemorrhage virus (GCHV), an aquareovirus.

    PubMed

    Qiu, T; Lu, R H; Zhang, J; Zhu, Z Y

    2001-07-01

    The complete nucleotide sequence of M6 gene of grass carp hemorrhage virus (GCHV) was determined. It is 2039 nucleotides in length and contains a single large open reading frame that could encode a protein of 648 amino acids with predicted molecular mass of 68.7 kDa. Amino acid sequence comparison revealed that the protein encoded by GCHV M6 is closely related to the protein mu1 of mammalian reovirus. The M6 gene, encoding the major outer-capsid protein, was expressed using the pET fusion protein vector in Escherichia coli and detected by Western blotting using chicken anti-GCHV immunoglobulin (IgY). The result indicates that the protein encoded by M6 may share a putative Asn-42-Pro-43 proteolytic cleavage site with mu1.

  19. Identification and in vitro characterization of a Marek’s disease virus encoded ribonucleotide reductase

    USDA-ARS?s Scientific Manuscript database

    Marek’s disease virus (MDV) encodes a ribonucleotide reductase (RR), a key regulatory enzyme in the DNA synthesis pathway. The gene coding for the RR of MDV is located in the unique long (UL) region of the genome. The large subunit is encoded by UL39 (RR1) and is predicted to comprise 860 amino acid...

  20. Genome analysis of Daldinia eschscholtzii strains UM 1400 and UM 1020, wood-decaying fungi isolated from human hosts

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chan, Chai Ling; Yew, Su Mei; Ngeow, Yun Fong

    Background: Daldinia eschscholtzii is a wood-inhabiting fungus that causes wood decay under certain conditions. It has a broad host range and produces a large repertoire of potentially bioactive compounds. However, there is no extensive genome analysis on this fungal species. Results: Two fungal isolates (UM 1400 and UM 1020) from human specimens were identified as Daldinia eschscholtzii by morphological features and ITS-based phylogenetic analysis. Both genomes were similar in size with 10,822 predicted genes in UM 1400 (35.8 Mb) and 11,120 predicted genes in UM 1020 (35.5 Mb). A total of 751 gene families were shared among both UM isolates,more » including gene families associated with fungus-host interactions. In the CAZyme comparative analysis, both genomes were found to contain arrays of CAZyme related to plant cell wall degradation. Genes encoding secreted peptidases were found in the genomes, which encode for the peptidases involved in the degradation of structural proteins in plant cell wall. In addition, arrays of secondary metabolite backbone genes were identified in both genomes, indicating of their potential to produce bioactive secondary metabolites. Both genomes also contained an abundance of gene encoding signaling components, with three proposed MAPK cascades involved in cell wall integrity, osmoregulation, and mating/filamentation. Besides genomic evidence for degrading capability, both isolates also harbored an array of genes encoding stress response proteins that are potentially significant for adaptation to living in the hostile environments. In conclusion: Our genomic studies provide further information for the biological understanding of the D. eschscholtzii and suggest that these wood-decaying fungi are also equipped for adaptation to adverse environments in the human host.« less

  1. Genome analysis of Daldinia eschscholtzii strains UM 1400 and UM 1020, wood-decaying fungi isolated from human hosts

    DOE PAGES

    Chan, Chai Ling; Yew, Su Mei; Ngeow, Yun Fong; ...

    2015-11-18

    Background: Daldinia eschscholtzii is a wood-inhabiting fungus that causes wood decay under certain conditions. It has a broad host range and produces a large repertoire of potentially bioactive compounds. However, there is no extensive genome analysis on this fungal species. Results: Two fungal isolates (UM 1400 and UM 1020) from human specimens were identified as Daldinia eschscholtzii by morphological features and ITS-based phylogenetic analysis. Both genomes were similar in size with 10,822 predicted genes in UM 1400 (35.8 Mb) and 11,120 predicted genes in UM 1020 (35.5 Mb). A total of 751 gene families were shared among both UM isolates,more » including gene families associated with fungus-host interactions. In the CAZyme comparative analysis, both genomes were found to contain arrays of CAZyme related to plant cell wall degradation. Genes encoding secreted peptidases were found in the genomes, which encode for the peptidases involved in the degradation of structural proteins in plant cell wall. In addition, arrays of secondary metabolite backbone genes were identified in both genomes, indicating of their potential to produce bioactive secondary metabolites. Both genomes also contained an abundance of gene encoding signaling components, with three proposed MAPK cascades involved in cell wall integrity, osmoregulation, and mating/filamentation. Besides genomic evidence for degrading capability, both isolates also harbored an array of genes encoding stress response proteins that are potentially significant for adaptation to living in the hostile environments. In conclusion: Our genomic studies provide further information for the biological understanding of the D. eschscholtzii and suggest that these wood-decaying fungi are also equipped for adaptation to adverse environments in the human host.« less

  2. Export of extracellular polysaccharides modulates adherence of the Cyanobacterium synechocystis.

    PubMed

    Fisher, Michael L; Allen, Rebecca; Luo, Yingqin; Curtiss, Roy

    2013-01-01

    The field of cyanobacterial biofuel production is advancing rapidly, yet we know little of the basic biology of these organisms outside of their photosynthetic pathways. We aimed to gain a greater understanding of how the cyanobacterium Synechocystis PCC 6803 (Synechocystis, hereafter) modulates its cell surface. Such understanding will allow for the creation of mutants that autoflocculate in a regulated way, thus avoiding energy intensive centrifugation in the creation of biofuels. We constructed mutant strains lacking genes predicted to function in carbohydrate transport or synthesis. Strains with gene deletions of slr0977 (predicted to encode a permease component of an ABC transporter), slr0982 (predicted to encode an ATP binding component of an ABC transporter) and slr1610 (predicted to encode a methyltransferase) demonstrated flocculent phenotypes and increased adherence to glass. Upon bioinformatic inspection, the gene products of slr0977, slr0982, and slr1610 appear to function in O-antigen (OAg) transport and synthesis. However, the analysis provided here demonstrated no differences between OAg purified from wild-type and mutants. However, exopolysaccharides (EPS) purified from mutants were altered in composition when compared to wild-type. Our data suggest that there are multiple means to modulate the cell surface of Synechocystis by disrupting different combinations of ABC transporters and/or glycosyl transferases. Further understanding of these mechanisms may allow for the development of industrially and ecologically useful strains of cyanobacteria. Additionally, these data imply that many cyanobacterial gene products may possess as-yet undiscovered functions, and are meritorious of further study.

  3. Prioritization of candidate disease genes by combining topological similarity and semantic similarity.

    PubMed

    Liu, Bin; Jin, Min; Zeng, Pan

    2015-10-01

    The identification of gene-phenotype relationships is very important for the treatment of human diseases. Studies have shown that genes causing the same or similar phenotypes tend to interact with each other in a protein-protein interaction (PPI) network. Thus, many identification methods based on the PPI network model have achieved good results. However, in the PPI network, some interactions between the proteins encoded by candidate gene and the proteins encoded by known disease genes are very weak. Therefore, some studies have combined the PPI network with other genomic information and reported good predictive performances. However, we believe that the results could be further improved. In this paper, we propose a new method that uses the semantic similarity between the candidate gene and known disease genes to set the initial probability vector of a random walk with a restart algorithm in a human PPI network. The effectiveness of our method was demonstrated by leave-one-out cross-validation, and the experimental results indicated that our method outperformed other methods. Additionally, our method can predict new causative genes of multifactor diseases, including Parkinson's disease, breast cancer and obesity. The top predictions were good and consistent with the findings in the literature, which further illustrates the effectiveness of our method. Copyright © 2015 Elsevier Inc. All rights reserved.

  4. Molecular cloning and characterization of chitinase genes from Candida albicans.

    PubMed

    McCreath, K J; Specht, C A; Robbins, P W

    1995-03-28

    Chitinase (EC 3.2.1.14) is an important enzyme for the remodeling of chitin in the cell wall of fungi. We have cloned three chitinase genes (CHT1, CHT2, and CHT3) from the dimorphic human pathogen Candida albicans. CHT2 and CHT3 have been sequenced in full and their primary structures have been analyzed: CHT2 encodes a protein of 583 aa with a predicted size of 60.8 kDa; CHT3 encodes a protein of 567 aa with a predicted size of 60 kDa. All three genes show striking similarity to other chitinase genes in the literature, especially in the proposed catalytic domain. Transcription of CHT2 and CHT3 was greater when C. albicans was grown in a yeast phase as compared to a mycelial phase. A transcript of CHT1 could not be detected in either growth condition.

  5. Regulation Mechanism Mediated by Trans-Encoded sRNA Nc117 in Short Chain Alcohols Tolerance in Synechocystis sp. PCC 6803.

    PubMed

    Bi, Yanqi; Pei, Guangsheng; Sun, Tao; Chen, Zixi; Chen, Lei; Zhang, Weiwen

    2018-01-01

    Microbial small RNAs (sRNAs) play essential roles against many stress conditions in cyanobacteria. However, little is known on their regulatory mechanisms on biofuels tolerance. In our previous sRNA analysis, a trans -encoded sRNA Nc117 was found involved in the tolerance to ethanol and 1-butanol in Synechocystis sp. PCC 6803. However, its functional mechanism is yet to be determined. In this study, functional characterization of sRNA Nc117 was performed. Briefly, the exact length of the trans -encoded sRNA Nc117 was determined to be 102 nucleotides using 3' RACE, and the positive regulation of Nc117 on short chain alcohols tolerance was further confirmed. Then, computational target prediction and transcriptomic analysis were integrated to explore the potential targets of Nc117. A total of 119 up-regulated and 116 down-regulated genes were identified in nc117 overexpression strain compared with the wild type by comparative transcriptomic analysis, among which the upstream regions of five genes were overlapped with those predicted by computational target approach. Based on the phenotype analysis of gene deletion and overexpression strains under short chain alcohols stress, one gene slr0007 encoding D-glycero-alpha-D-manno-heptose 1-phosphate guanylyltransferase was determined as a potential target of Nc117, suggesting that the synthesis of LPS or S-layer glycoprotein may be responsible for the tolerance enhancement. As the first reported trans -encoded sRNA positively regulating biofuels tolerance in cyanobacteria, this study not only provided evidence for a new regulatory mechanism of trans -encoded sRNA in cyanobacteria, but also valuable information for rational construction of high-tolerant cyanobacterial chassis.

  6. Characterization of a Gene Encoding Clathrin Heavy Chain in Maize Up-Regulated by Salicylic Acid, Abscisic Acid and High Boron Supply

    PubMed Central

    Zeng, Mu-Heng; Liu, Sheng-Hong; Yang, Miao-Xian; Zhang, Ya-Jun; Liang, Jia-Yong; Wan, Xiao-Rong; Liang, Hong

    2013-01-01

    Clathrin, a three-legged triskelion composed of three clathrin heavy chains (CHCs) and three light chains (CLCs), plays a critical role in clathrin-mediated endocytosis (CME) in eukaryotic cells. In this study, the genes ZmCHC1 and ZmCHC2 encoding clathrin heavy chain in maize were cloned and characterized for the first time in monocots. ZmCHC1 encodes a 1693-amino acid-protein including 29 exons and 28 introns, and ZmCHC2 encodes a 1746-amino acid-protein including 28 exons and 27 introns. The high similarities of gene structure, protein sequences and 3D models among ZmCHC1, and Arabidopsis AtCHC1 and AtCHC2 suggest their similar functions in CME. ZmCHC1 gene is predominantly expressed in maize roots instead of ubiquitous expression of ZmCHC2. Consistent with a typical predicted salicylic acid (SA)-responsive element and four predicted ABA-responsive elements (ABREs) in the promoter sequence of ZmCHC1, the expression of ZmCHC1 instead of ZmCHC2 in maize roots is significantly up-regulated by SA or ABA, suggesting that ZmCHC1 gene may be involved in the SA signaling pathway in maize defense responses. The expressions of ZmCHC1 and ZmCHC2 genes in maize are down-regulated by azide or cold treatment, further revealing the energy requirement of CME and suggesting that CME in plants is sensitive to low temperatures. PMID:23880865

  7. Gene encoding γ-carbonic anhydrase is cotranscribed with argC and induced in response to stationary phase and high CO2 in Azospirillum brasilense Sp7

    PubMed Central

    2010-01-01

    Background Carbonic anhydrase (CA) is a ubiquitous enzyme catalyzing the reversible hydration of CO2 to bicarbonate, a reaction underlying diverse biochemical and physiological processes. Gamma class carbonic anhydrases (γ-CAs) are widespread in prokaryotes but their physiological roles remain elusive. At present, only γ-CA of Methanosarcina thermophila (Cam) has been shown to have CA activity. Genome analysis of a rhizobacterium Azospirillum brasilense, revealed occurrence of ORFs encoding one β-CA and two γ-CAs. Results One of the putative γ-CA encoding genes of A. brasilense was cloned and overexpressed in E. coli. Electrometric assays for CA activity of the whole cell extracts overexpressing recombinant GCA1 did not show CO2 hydration activity. Reverse transcription-PCR analysis indicated that gca1 in A. brasilense is co-transcribed with its upstream gene annotated as argC, which encodes a putative N-acetyl-γ-glutamate-phosphate reductase. 5'-RACE also demonstrated that there was no transcription start site between argC and gca1, and the transcription start site located upstream of argC transcribed both the genes (argC-gca1). Using transcriptional fusions of argC-gca1 upstream region with promoterless lacZ, we further demonstrated that gca1 upstream region did not have any promoter and its transcription occurred from a promoter located in the argC upstream region. The transcription of argC-gca1 operon was upregulated in stationary phase and at elevated CO2 atmosphere. Conclusions This study shows lack of CO2 hydration activity in a recombinant protein expressed from a gene predicted to encode a γ-carbonic anhydrase in A. brasilense although it cross reacts with anti-Cam antibody raised against a well characterized γ-CA. The organization and regulation of this gene along with the putative argC gene suggests its involvement in arginine biosynthetic pathway instead of the predicted CO2 hydration. PMID:20598158

  8. Gene encoding gamma-carbonic anhydrase is cotranscribed with argC and induced in response to stationary phase and high CO2 in Azospirillum brasilense Sp7.

    PubMed

    Kaur, Simarjot; Mishra, Mukti N; Tripathi, Anil K

    2010-07-04

    Carbonic anhydrase (CA) is a ubiquitous enzyme catalyzing the reversible hydration of CO2 to bicarbonate, a reaction underlying diverse biochemical and physiological processes. Gamma class carbonic anhydrases (gamma-CAs) are widespread in prokaryotes but their physiological roles remain elusive. At present, only gamma-CA of Methanosarcina thermophila (Cam) has been shown to have CA activity. Genome analysis of a rhizobacterium Azospirillum brasilense, revealed occurrence of ORFs encoding one beta-CA and two gamma-CAs. One of the putative gamma-CA encoding genes of A. brasilense was cloned and overexpressed in E. coli. Electrometric assays for CA activity of the whole cell extracts overexpressing recombinant GCA1 did not show CO2 hydration activity. Reverse transcription-PCR analysis indicated that gca1 in A. brasilense is co-transcribed with its upstream gene annotated as argC, which encodes a putative N-acetyl-gamma-glutamate-phosphate reductase. 5'-RACE also demonstrated that there was no transcription start site between argC and gca1, and the transcription start site located upstream of argC transcribed both the genes (argC-gca1). Using transcriptional fusions of argC-gca1 upstream region with promoterless lacZ, we further demonstrated that gca1 upstream region did not have any promoter and its transcription occurred from a promoter located in the argC upstream region. The transcription of argC-gca1 operon was upregulated in stationary phase and at elevated CO2 atmosphere. This study shows lack of CO2 hydration activity in a recombinant protein expressed from a gene predicted to encode a gamma-carbonic anhydrase in A. brasilense although it cross reacts with anti-Cam antibody raised against a well characterized gamma-CA. The organization and regulation of this gene along with the putative argC gene suggests its involvement in arginine biosynthetic pathway instead of the predicted CO2 hydration.

  9. Response of a rice paddy soil methanogen to syntrophic growth as revealed by transcriptional analyses.

    PubMed

    Liu, Pengfei; Yang, Yanxiang; Lü, Zhe; Lu, Yahai

    2014-08-01

    Members of Methanocellales are widespread in paddy field soils and play the key role in methane production. These methanogens feature largely in these organisms’ adaptation to low H2 and syntrophic growth with anaerobic fatty acid oxidizers. The adaptive mechanisms, however, remain unknown. In the present study, we determined the transcripts of 21 genes involved in the key steps of methanogenesis and acetate assimilation of Methanocella conradii HZ254, a strain recently isolated from paddy field soil. M. conradii was grown in monoculture and syntrophically with Pelotomaculum thermopropionicum (a propionate syntroph) or Syntrophothermus lipocalidus (a butyrate syntroph). Comparison of the relative transcript abundances showed that three hydrogenase-encoding genes and all methanogenesis-related genes tested were upregulated in cocultures relative to monoculture. The genes encoding formylmethanofuran dehydrogenase (Fwd), heterodisulfide reductase (Hdr), and the membrane-bound energy-converting hydrogenase (Ech) were the most upregulated among the evaluated genes. The expression of the formate dehydrogenase (Fdh)-encoding gene also was significantly upregulated. In contrast, an acetate assimilation gene was downregulated in cocultures. The genes coding for Fwd, Hdr, and the D subunit of F420-nonreducing hydrogenase (Mvh) form a large predicted transcription unit; therefore, the Mvh/Hdr/Fwd complex, capable of mediating the electron bifurcation and connecting the first and last steps of methanogenesis, was predicted to be formed in M. conradii. We propose that Methanocella methanogens cope with low H2 and syntrophic growth by (i) stabilizing the Mvh/Hdr/Fwd complex and (ii) activating formatedependent methanogenesis.

  10. New FeFe-hydrogenase genes identified in a metagenomic fosmid library from a municipal wastewater treatment plant as revealed by high-throughput sequencing.

    PubMed

    Tomazetto, Geizecler; Wibberg, Daniel; Schlüter, Andreas; Oliveira, Valéria M

    2015-01-01

    A fosmid metagenomic library was constructed with total community DNA obtained from a municipal wastewater treatment plant (MWWTP), with the aim of identifying new FeFe-hydrogenase genes encoding the enzymes most important for hydrogen metabolism. The dataset generated by pyrosequencing of a fosmid library was mined to identify environmental gene tags (EGTs) assigned to FeFe-hydrogenase. The majority of EGTs representing FeFe-hydrogenase genes were affiliated with the class Clostridia, suggesting that this group is the main hydrogen producer in the MWWTP analyzed. Based on assembled sequences, three FeFe-hydrogenase genes were predicted based on detection of the L2 motif (MPCxxKxxE) in the encoded gene product, confirming true FeFe-hydrogenase sequences. These sequences were used to design specific primers to detect fosmids encoding FeFe-hydrogenase genes predicted from the dataset. Three identified fosmids were completely sequenced. The cloned genomic fragments within these fosmids are closely related to members of the Spirochaetaceae, Bacteroidales and Firmicutes, and their FeFe-hydrogenase sequences are characterized by the structure type M3, which is common to clostridial enzymes. FeFe-hydrogenase sequences found in this study represent hitherto undetected sequences, indicating the high genetic diversity regarding these enzymes in MWWTP. Results suggest that MWWTP have to be considered as reservoirs for new FeFe-hydrogenase genes. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  11. Meta-omic signatures of microbial metal and nitrogen cycling in marine oxygen minimum zones

    PubMed Central

    Glass, Jennifer B.; Kretz, Cecilia B.; Ganesh, Sangita; Ranjan, Piyush; Seston, Sherry L.; Buck, Kristen N.; Landing, William M.; Morton, Peter L.; Moffett, James W.; Giovannoni, Stephen J.; Vergin, Kevin L.; Stewart, Frank J.

    2015-01-01

    Iron (Fe) and copper (Cu) are essential cofactors for microbial metalloenzymes, but little is known about the metalloenyzme inventory of anaerobic marine microbial communities despite their importance to the nitrogen cycle. We compared dissolved O2, NO3−, NO2−, Fe and Cu concentrations with nucleic acid sequences encoding Fe and Cu-binding proteins in 21 metagenomes and 9 metatranscriptomes from Eastern Tropical North and South Pacific oxygen minimum zones and 7 metagenomes from the Bermuda Atlantic Time-series Station. Dissolved Fe concentrations increased sharply at upper oxic-anoxic transition zones, with the highest Fe:Cu molar ratio (1.8) occurring at the anoxic core of the Eastern Tropical North Pacific oxygen minimum zone and matching the predicted maximum ratio based on data from diverse ocean sites. The relative abundance of genes encoding Fe-binding proteins was negatively correlated with O2, driven by significant increases in genes encoding Fe-proteins involved in dissimilatory nitrogen metabolisms under anoxia. Transcripts encoding cytochrome c oxidase, the Fe- and Cu-containing terminal reductase in aerobic respiration, were positively correlated with O2 content. A comparison of the taxonomy of genes encoding Fe- and Cu-binding vs. bulk proteins in OMZs revealed that Planctomycetes represented a higher percentage of Fe genes while Thaumarchaeota represented a higher percentage of Cu genes, particularly at oxyclines. These results are broadly consistent with higher relative abundance of genes encoding Fe-proteins in the genome of a marine planctomycete vs. higher relative abundance of genes encoding Cu-proteins in the genome of a marine thaumarchaeote. These findings highlight the importance of metalloenzymes for microbial processes in oxygen minimum zones and suggest preferential Cu use in oxic habitats with Cu > Fe vs. preferential Fe use in anoxic niches with Fe > Cu. PMID:26441925

  12. Meta-omic signatures of microbial metal and nitrogen cycling in marine oxygen minimum zones.

    PubMed

    Glass, Jennifer B; Kretz, Cecilia B; Ganesh, Sangita; Ranjan, Piyush; Seston, Sherry L; Buck, Kristen N; Landing, William M; Morton, Peter L; Moffett, James W; Giovannoni, Stephen J; Vergin, Kevin L; Stewart, Frank J

    2015-01-01

    Iron (Fe) and copper (Cu) are essential cofactors for microbial metalloenzymes, but little is known about the metalloenyzme inventory of anaerobic marine microbial communities despite their importance to the nitrogen cycle. We compared dissolved O2, NO[Formula: see text], NO[Formula: see text], Fe and Cu concentrations with nucleic acid sequences encoding Fe and Cu-binding proteins in 21 metagenomes and 9 metatranscriptomes from Eastern Tropical North and South Pacific oxygen minimum zones and 7 metagenomes from the Bermuda Atlantic Time-series Station. Dissolved Fe concentrations increased sharply at upper oxic-anoxic transition zones, with the highest Fe:Cu molar ratio (1.8) occurring at the anoxic core of the Eastern Tropical North Pacific oxygen minimum zone and matching the predicted maximum ratio based on data from diverse ocean sites. The relative abundance of genes encoding Fe-binding proteins was negatively correlated with O2, driven by significant increases in genes encoding Fe-proteins involved in dissimilatory nitrogen metabolisms under anoxia. Transcripts encoding cytochrome c oxidase, the Fe- and Cu-containing terminal reductase in aerobic respiration, were positively correlated with O2 content. A comparison of the taxonomy of genes encoding Fe- and Cu-binding vs. bulk proteins in OMZs revealed that Planctomycetes represented a higher percentage of Fe genes while Thaumarchaeota represented a higher percentage of Cu genes, particularly at oxyclines. These results are broadly consistent with higher relative abundance of genes encoding Fe-proteins in the genome of a marine planctomycete vs. higher relative abundance of genes encoding Cu-proteins in the genome of a marine thaumarchaeote. These findings highlight the importance of metalloenzymes for microbial processes in oxygen minimum zones and suggest preferential Cu use in oxic habitats with Cu > Fe vs. preferential Fe use in anoxic niches with Fe > Cu.

  13. The genome of the insecticidal Chromobacterium subtsugae PRAA4-1 and its comparison with that of Chromobacterium violaceum ATCC 12472.

    PubMed

    Blackburn, Michael B; Sparks, Michael E; Gundersen-Rindal, Dawn E

    2016-12-01

    The genome of Chromobacterium subtsugae strain PRAA4-1, a betaproteobacterium producing insecticidal compounds, was sequenced and compared with the genome of C. violaceum ATCC 12472. The genome of C. subtsugae displayed a reduction in genes devoted to capsular and extracellular polysaccharide, possessed no genes encoding nitrate reductases, and exhibited many more phage-related sequences than were observed for C. violaceum. The genomes of both species possess a number of gene clusters predicted to encode biosynthetic complexes for secondary metabolites; these clusters suggest they produce overlapping, but distinct assortments of metabolites.

  14. CHLORELLA VIRUSES

    PubMed Central

    Yamada, Takashi; Onimatsu, Hideki; Van Etten, James L.

    2007-01-01

    Chlorella viruses or chloroviruses are large, icosahedral, plaque‐forming, double‐stranded‐DNA—containing viruses that replicate in certain strains of the unicellular green alga Chlorella. DNA sequence analysis of the 330‐kbp genome of Paramecium bursaria chlorella virus 1 (PBCV‐1), the prototype of this virus family (Phycodnaviridae), predict ∼366 protein‐encoding genes and 11 tRNA genes. The predicted gene products of ∼50% of these genes resemble proteins of known function, including many that are completely unexpected for a virus. In addition, the chlorella viruses have several features and encode many gene products that distinguish them from most viruses. These products include: (1) multiple DNA methyltransferases and DNA site‐specific endonucleases, (2) the enzymes required to glycosylate their proteins and synthesize polysaccharides such as hyaluronan and chitin, (3) a virus‐encoded K+ channel (called Kcv) located in the internal membrane of the virions, (4) a SET domain containing protein (referred to as vSET) that dimethylates Lys27 in histone 3, and (5) PBCV‐1 has three types of introns; a self‐splicing intron, a spliceosomal processed intron, and a small tRNA intron. Accumulating evidence indicates that the chlorella viruses have a very long evolutionary history. This review mainly deals with research on the virion structure, genome rearrangements, gene expression, cell wall degradation, polysaccharide synthesis, and evolution of PBCV‐1 as well as other related viruses. PMID:16877063

  15. Gene family encoding the major toxins of lethal Amanita mushrooms

    PubMed Central

    Hallen, Heather E.; Luo, Hong; Scott-Craig, John S.; Walton, Jonathan D.

    2007-01-01

    Amatoxins, the lethal constituents of poisonous mushrooms in the genus Amanita, are bicyclic octapeptides. Two genes in A. bisporigera, AMA1 and PHA1, directly encode α-amanitin, an amatoxin, and the related bicyclic heptapeptide phallacidin, a phallotoxin, indicating that these compounds are synthesized on ribosomes and not by nonribosomal peptide synthetases. α-Amanitin and phallacidin are synthesized as proproteins of 35 and 34 amino acids, respectively, from which they are predicted to be cleaved by a prolyl oligopeptidase. AMA1 and PHA1 are present in other toxic species of Amanita section Phalloidae but are absent from nontoxic species in other sections. The genomes of A. bisporigera and A. phalloides contain multiple sequences related to AMA1 and PHA1. The predicted protein products of this family of genes are characterized by a hypervariable “toxin” region capable of encoding a wide variety of peptides of 7–10 amino acids flanked by conserved sequences. Our results suggest that these fungi have a broad capacity to synthesize cyclic peptides on ribosomes. PMID:18025465

  16. RGAugury: a pipeline for genome-wide prediction of resistance gene analogs (RGAs) in plants.

    PubMed

    Li, Pingchuan; Quan, Xiande; Jia, Gaofeng; Xiao, Jin; Cloutier, Sylvie; You, Frank M

    2016-11-02

    Resistance gene analogs (RGAs), such as NBS-encoding proteins, receptor-like protein kinases (RLKs) and receptor-like proteins (RLPs), are potential R-genes that contain specific conserved domains and motifs. Thus, RGAs can be predicted based on their conserved structural features using bioinformatics tools. Computer programs have been developed for the identification of individual domains and motifs from the protein sequences of RGAs but none offer a systematic assessment of the different types of RGAs. A user-friendly and efficient pipeline is needed for large-scale genome-wide RGA predictions of the growing number of sequenced plant genomes. An integrative pipeline, named RGAugury, was developed to automate RGA prediction. The pipeline first identifies RGA-related protein domains and motifs, namely nucleotide binding site (NB-ARC), leucine rich repeat (LRR), transmembrane (TM), serine/threonine and tyrosine kinase (STTK), lysin motif (LysM), coiled-coil (CC) and Toll/Interleukin-1 receptor (TIR). RGA candidates are identified and classified into four major families based on the presence of combinations of these RGA domains and motifs: NBS-encoding, TM-CC, and membrane associated RLP and RLK. All time-consuming analyses of the pipeline are paralleled to improve performance. The pipeline was evaluated using the well-annotated Arabidopsis genome. A total of 98.5, 85.2, and 100 % of the reported NBS-encoding genes, membrane associated RLPs and RLKs were validated, respectively. The pipeline was also successfully applied to predict RGAs for 50 sequenced plant genomes. A user-friendly web interface was implemented to ease command line operations, facilitate visualization and simplify result management for multiple datasets. RGAugury is an efficiently integrative bioinformatics tool for large scale genome-wide identification of RGAs. It is freely available at Bitbucket: https://bitbucket.org/yaanlpc/rgaugury .

  17. Draft Map of Human Proteome Published | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    In a recently published article in the journal Nature, researchers have developed a draft map of the human proteome.  Striving for the protein equivalent of the Human Genome Project, an international team of researchers has created an initial catalog of the human proteome. In total, using 30 different human tissues, the researchers identified proteins encoded by 17,294 genes, which is approximately 84 percent of all of the genes in the human genome predicted to encode proteins.

  18. Identification and Characterization of Putative Integron-Like Elements of the Heavy-Metal-Hypertolerant Strains of Pseudomonas spp.

    PubMed

    Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz

    2016-11-28

    Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .

  19. Genome sequence of the model medicinal mushroom Ganoderma lucidum

    PubMed Central

    Chen, Shilin; Xu, Jiang; Liu, Chang; Zhu, Yingjie; Nelson, David R.; Zhou, Shiguo; Li, Chunfang; Wang, Lizhi; Guo, Xu; Sun, Yongzhen; Luo, Hongmei; Li, Ying; Song, Jingyuan; Henrissat, Bernard; Levasseur, Anthony; Qian, Jun; Li, Jianqin; Luo, Xiang; Shi, Linchun; He, Liu; Xiang, Li; Xu, Xiaolan; Niu, Yunyun; Li, Qiushi; Han, Mira V.; Yan, Haixia; Zhang, Jin; Chen, Haimei; Lv, Aiping; Wang, Zhen; Liu, Mingzhu; Schwartz, David C.; Sun, Chao

    2012-01-01

    Ganoderma lucidum is a widely used medicinal macrofungus in traditional Chinese medicine that creates a diverse set of bioactive compounds. Here we report its 43.3-Mb genome, encoding 16,113 predicted genes, obtained using next-generation sequencing and optical mapping approaches. The sequence analysis reveals an impressive array of genes encoding cytochrome P450s (CYPs), transporters and regulatory proteins that cooperate in secondary metabolism. The genome also encodes one of the richest sets of wood degradation enzymes among all of the sequenced basidiomycetes. In all, 24 physical CYP gene clusters are identified. Moreover, 78 CYP genes are coexpressed with lanosterol synthase, and 16 of these show high similarity to fungal CYPs that specifically hydroxylate testosterone, suggesting their possible roles in triterpenoid biosynthesis. The elucidation of the G. lucidum genome makes this organism a potential model system for the study of secondary metabolic pathways and their regulation in medicinal fungi. PMID:22735441

  20. Export of Extracellular Polysaccharides Modulates Adherence of the Cyanobacterium Synechocystis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fisher, ML; Allen, R; Luo, YQ

    2013-09-10

    The field of cyanobacterial biofuel production is advancing rapidly, yet we know little of the basic biology of these organisms outside of their photosynthetic pathways. We aimed to gain a greater understanding of how the cyanobacterium Synechocystis PCC 6803 (Synechocystis, hereafter) modulates its cell surface. Such understanding will allow for the creation of mutants that autoflocculate in a regulated way, thus avoiding energy intensive centrifugation in the creation of biofuels. We constructed mutant strains lacking genes predicted to function in carbohydrate transport or synthesis. Strains with gene deletions of slr0977 (predicted to encode a permease component of an ABC transporter),more » slr0982 (predicted to encode an ATP binding component of an ABC transporter) and slr1610 (predicted to encode a methyltransferase) demonstrated flocculent phenotypes and increased adherence to glass. Upon bioinformatic inspection, the gene products of slr0977, slr0982, and slr1610 appear to function in O-antigen (OAg) transport and synthesis. However, the analysis provided here demonstrated no differences between OAg purified from wild-type and mutants. However, exopolysaccharides (EPS) purified from mutants were altered in composition when compared to wild-type. Our data suggest that there are multiple means to modulate the cell surface of Synechocystis by disrupting different combinations of ABC transporters and/or glycosyl transferases. Further understanding of these mechanisms may allow for the development of industrially and ecologically useful strains of cyanobacteria. Additionally, these data imply that many cyanobacterial gene products may possess as-yet undiscovered functions, and are meritorious of further study.« less

  1. Bioinformatics analysis and characteristics of VP23 encoded by the newly identified UL18 gene of duck enteritis virus

    NASA Astrophysics Data System (ADS)

    Chen, Xiwen; Cheng, Anchun; Wang, Mingshu; Xiang, Jun

    2011-10-01

    In this study, the predicted information about structures and functions of VP23 encoded by the newly identified DEV UL18 gene through bioinformatics softwares and tools. The DEV UL18 was predicted to encode a polypeptide with 322 amino acids, termed VP23, with a putative molecular mass of 35.250 kDa and a predicted isoelectric point (PI) of 8.37, no signal peptide and transmembrane domain in the polypeptide. The prediction of subcellular localization showed that the DEV-VP23 located at endoplasmic reticulum with 33.3%, mitochondrial with 22.2%, extracellular, including cell wall with 11.1%, vesicles of secretory system with 11.1%, Golgi with 11.1%, and plasma membrane with 11.1%. The acid sequence of analysis showed that the potential antigenic epitopes are situated in 45-47, 53-60, 102-105, 173-180, 185-189, 260-265, 267-271, and 292-299 amino acids. All the consequences inevitably provide some insights for further research about the DEV-VP23 and also provide a fundament for further study on the the new type clinical diagnosis of DEV and can be used for the development of new DEV vaccine.

  2. Isolation of pheromone precursor genes of Magnaporthe grisea.

    PubMed

    Shen, W C; Bobrowicz, P; Ebbole, D J

    1999-01-01

    In heterothallic ascomycetes one mating partner serves as the source of female tissue and is fertilized with spermatia from a partner of the opposite mating type. The role of pheromone signaling in mating is thought to involve recognition of cells of the opposite mating type. We have isolated two putative pheromone precursor genes of Magnaporthe grisea. The genes are present in both mating types of the fungus but they are expressed in a mating type-specific manner. The MF1-1 gene, expressed in Mat1-1 strains, is predicted to encode a 26-amino-acid polypeptide that is processed to produce a lipopeptide pheromone. The MF2-1 gene, expressed in Mat1-2 strains, is predicted to encode a precursor polypeptide that is processed by a Kex2-like protease to yield a pheromone with striking similarity to the predicted pheromone sequence of a close relative, Cryphonectria parasitica. Expression of the M. grisea putative pheromone precursor genes was observed under defined nutritional conditions and in field isolates. This suggests that the requirement for complex media for mating and the poor fertility of field isolates may not be due to limitation of pheromone precursor gene expression. Detection of putative pheromone precursor gene mRNA in conidia suggests that pheromones may be important for the fertility of conidia acting as spermatia. Copyright 1999 Academic Press.

  3. Comparative genomics of Ceriporiopsis subvermispora and Phanerochaete chrysosporium provide insight into selective ligninolysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fernandez-Fueyo, Elena; Ruiz-Duenas, Francisco J.; Ferreira, Patrica

    Efficient lignin depolymerization is unique to the wood decay basidiomycetes, collectively referred to as white rot fungi. Phanerochaete chrysosporium simultaneously degrades lignin and cellulose, whereas the closely related species, Ceriporiopsis subvermispora, also depolymerizes lignin but may do so with relatively little cellulose degradation. To investigate the basis for selective ligninolysis, we conducted comparative genome analysis of C. subvermispora and P. chrysosporium. Genes encoding manganese peroxidase numbered 13 and five in C. subvermispora and P. chrysosporium, respectively. In addition, the C. subvermispora genome contains at least seven genes predicted to encode laccases, whereas the P. chrysosporium genome contains none. We alsomore » observed expansion of the number of C. subvermispora desaturase-encoding genes putatively involved in lipid metabolism. Microarray-based transcriptome analysis showed substantial up-regulation of several desaturase and MnP genes in wood-containing medium. MS identified MnP proteins in C. subvermispora culture filtrates, but none in P. chrysosporium cultures. These results support the importance of MnP and a lignin degradation mechanism whereby cleavage of the dominant nonphenolic structures is mediated by lipid peroxidation products. Two C. subvermispora genes were predicted to encode peroxidases structurally similar to P. chrysosporium lignin peroxidase and, following heterologous expression in Escherichia coli, the enzymes were shown to oxidize high redox potential substrates, but not Mn2. Apart from oxidative lignin degradation, we also examined cellulolytic and hemicellulolytic systems in both fungi. In summary, the C. subvermispora genetic inventory and expression patterns exhibit increased oxidoreductase potential and diminished cellulolytic capability relative to P. chrysosporium.« less

  4. Comparative genomics of Ceriporiopsis subvermispora and Phanerochaete chrysosporium provide insight into selective ligninolysis

    PubMed Central

    Fernandez-Fueyo, Elena; Ruiz-Dueñas, Francisco J.; Ferreira, Patricia; Floudas, Dimitrios; Hibbett, David S.; Canessa, Paulo; Larrondo, Luis F.; James, Tim Y.; Seelenfreund, Daniela; Lobos, Sergio; Polanco, Rubén; Tello, Mario; Honda, Yoichi; Watanabe, Takahito; Watanabe, Takashi; Ryu, Jae San; Kubicek, Christian P.; Schmoll, Monika; Gaskell, Jill; Hammel, Kenneth E.; St. John, Franz J.; Vanden Wymelenberg, Amber; Sabat, Grzegorz; Splinter BonDurant, Sandra; Syed, Khajamohiddin; Yadav, Jagjit S.; Doddapaneni, Harshavardhan; Subramanian, Venkataramanan; Lavín, José L.; Oguiza, José A.; Perez, Gumer; Pisabarro, Antonio G.; Ramirez, Lucia; Santoyo, Francisco; Master, Emma; Coutinho, Pedro M.; Henrissat, Bernard; Lombard, Vincent; Magnuson, Jon Karl; Kües, Ursula; Hori, Chiaki; Igarashi, Kiyohiko; Samejima, Masahiro; Held, Benjamin W.; Barry, Kerrie W.; LaButti, Kurt M.; Lapidus, Alla; Lindquist, Erika A.; Lucas, Susan M.; Riley, Robert; Salamov, Asaf A.; Hoffmeister, Dirk; Schwenk, Daniel; Hadar, Yitzhak; Yarden, Oded; de Vries, Ronald P.; Wiebenga, Ad; Stenlid, Jan; Eastwood, Daniel; Grigoriev, Igor V.; Berka, Randy M.; Blanchette, Robert A.; Kersten, Phil; Martinez, Angel T.; Vicuna, Rafael; Cullen, Dan

    2012-01-01

    Efficient lignin depolymerization is unique to the wood decay basidiomycetes, collectively referred to as white rot fungi. Phanerochaete chrysosporium simultaneously degrades lignin and cellulose, whereas the closely related species, Ceriporiopsis subvermispora, also depolymerizes lignin but may do so with relatively little cellulose degradation. To investigate the basis for selective ligninolysis, we conducted comparative genome analysis of C. subvermispora and P. chrysosporium. Genes encoding manganese peroxidase numbered 13 and five in C. subvermispora and P. chrysosporium, respectively. In addition, the C. subvermispora genome contains at least seven genes predicted to encode laccases, whereas the P. chrysosporium genome contains none. We also observed expansion of the number of C. subvermispora desaturase-encoding genes putatively involved in lipid metabolism. Microarray-based transcriptome analysis showed substantial up-regulation of several desaturase and MnP genes in wood-containing medium. MS identified MnP proteins in C. subvermispora culture filtrates, but none in P. chrysosporium cultures. These results support the importance of MnP and a lignin degradation mechanism whereby cleavage of the dominant nonphenolic structures is mediated by lipid peroxidation products. Two C. subvermispora genes were predicted to encode peroxidases structurally similar to P. chrysosporium lignin peroxidase and, following heterologous expression in Escherichia coli, the enzymes were shown to oxidize high redox potential substrates, but not Mn2+. Apart from oxidative lignin degradation, we also examined cellulolytic and hemicellulolytic systems in both fungi. In summary, the C. subvermispora genetic inventory and expression patterns exhibit increased oxidoreductase potential and diminished cellulolytic capability relative to P. chrysosporium. PMID:22434909

  5. Discovery of new enzymes and metabolic pathways by using structure and genome context.

    PubMed

    Zhao, Suwen; Kumar, Ritesh; Sakai, Ayano; Vetting, Matthew W; Wood, B McKay; Brown, Shoshana; Bonanno, Jeffery B; Hillerich, Brandan S; Seidel, Ronald D; Babbitt, Patricia C; Almo, Steven C; Sweedler, Jonathan V; Gerlt, John A; Cronan, John E; Jacobson, Matthew P

    2013-10-31

    Assigning valid functions to proteins identified in genome projects is challenging: overprediction and database annotation errors are the principal concerns. We and others are developing computation-guided strategies for functional discovery with 'metabolite docking' to experimentally derived or homology-based three-dimensional structures. Bacterial metabolic pathways often are encoded by 'genome neighbourhoods' (gene clusters and/or operons), which can provide important clues for functional assignment. We recently demonstrated the synergy of docking and pathway context by 'predicting' the intermediates in the glycolytic pathway in Escherichia coli. Metabolite docking to multiple binding proteins and enzymes in the same pathway increases the reliability of in silico predictions of substrate specificities because the pathway intermediates are structurally similar. Here we report that structure-guided approaches for predicting the substrate specificities of several enzymes encoded by a bacterial gene cluster allowed the correct prediction of the in vitro activity of a structurally characterized enzyme of unknown function (PDB 2PMQ), 2-epimerization of trans-4-hydroxy-L-proline betaine (tHyp-B) and cis-4-hydroxy-D-proline betaine (cHyp-B), and also the correct identification of the catabolic pathway in which Hyp-B 2-epimerase participates. The substrate-liganded pose predicted by virtual library screening (docking) was confirmed experimentally. The enzymatic activities in the predicted pathway were confirmed by in vitro assays and genetic analyses; the intermediates were identified by metabolomics; and repression of the genes encoding the pathway by high salt concentrations was established by transcriptomics, confirming the osmolyte role of tHyp-B. This study establishes the utility of structure-guided functional predictions to enable the discovery of new metabolic pathways.

  6. Identification of functional elements and regulatory circuits by Drosophila modENCODE

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V.

    2010-12-22

    To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- andmore » tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation. Several years after the complete genetic sequencing of many species, it is still unclear how to translate genomic information into a functional map of cellular and developmental programs. The Encyclopedia of DNA Elements (ENCODE) (1) and model organism ENCODE (modENCODE) (2) projects use diverse genomic assays to comprehensively annotate the Homo sapiens (human), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans (worm) genomes, through systematic generation and computational integration of functional genomic data sets. Previous genomic studies in flies have made seminal contributions to our understanding of basic biological mechanisms and genome functions, facilitated by genetic, experimental, computational, and manual annotation of the euchromatic and heterochromatic genome (3), small genome size, short life cycle, and a deep knowledge of development, gene function, and chromosome biology. The functions of {approx}40% of the protein and nonprotein-coding genes [FlyBase 5.12 (4)] have been determined from cDNA collections (5, 6), manual curation of gene models (7), gene mutations and comprehensive genome-wide RNA interference screens (8-10), and comparative genomic analyses (11, 12). The Drosophila modENCODE project has generated more than 700 data sets that profile transcripts, histone modifications and physical nucleosome properties, general and specific transcription factors (TFs), and replication programs in cell lines, isolated tissues, and whole organisms across several developmental stages (Fig. 1). Here, we computationally integrate these data sets and report (i) improved and additional genome annotations, including full-length proteincoding genes and peptides as short as 21 amino acids; (ii) noncoding transcripts, including 132 candidate structural RNAs and 1608 nonstructural transcripts; (iii) additional Argonaute (Ago)-associated small RNA genes and pathways, including new microRNAs (miRNAs) encoded within protein-coding exons and endogenous small interfering RNAs (siRNAs) from 3-inch untranslated regions; (iv) chromatin 'states' defined by combinatorial patterns of 18 chromatin marks that are associated with distinct functions and properties; (v) regions of high TF occupancy and replication activity with likely epigenetic regulation; (vi)mixed TF and miRNA regulatory networks with hierarchical structure and enriched feed-forward loops; (vii) coexpression- and co-regulation-based functional annotations for nearly 3000 genes; (viii) stage- and tissue-specific regulators; and (ix) predictive models of gene expression levels and regulator function.« less

  7. Compositional profile of α/β-hydrolase fold proteins in mangrove soil metagenomes: prevalence of epoxide hydrolases and haloalkane dehalogenases in oil-contaminated sites

    PubMed Central

    Jiménez, Diego Javier; Dini-Andreote, Francisco; Ottoni, Júlia Ronzella; de Oliveira, Valéria Maia; van Elsas, Jan Dirk; Andreote, Fernando Dini

    2015-01-01

    The occurrence of genes encoding biotechnologically relevant α/β-hydrolases in mangrove soil microbial communities was assessed using data obtained by whole-metagenome sequencing of four mangroves areas, denoted BrMgv01 to BrMgv04, in São Paulo, Brazil. The sequences (215 Mb in total) were filtered based on local amino acid alignments against the Lipase Engineering Database. In total, 5923 unassembled sequences were affiliated with 30 different α/β-hydrolase fold superfamilies. The most abundant predicted proteins encompassed cytosolic hydrolases (abH08; ∼ 23%), microsomal hydrolases (abH09; ∼ 12%) and Moraxella lipase-like proteins (abH04 and abH01; < 5%). Detailed analysis of the genes predicted to encode proteins of the abH08 superfamily revealed a high proportion related to epoxide hydrolases and haloalkane dehalogenases in polluted mangroves BrMgv01-02-03. This suggested selection and putative involvement in local degradation/detoxification of the pollutants. Seven sequences that were annotated as genes for putative epoxide hydrolases and five for putative haloalkane dehalogenases were found in a fosmid library generated from BrMgv02 DNA. The latter enzymes were predicted to belong to Actinobacteria, Deinococcus-Thermus, Planctomycetes and Proteobacteria. Our integrated approach thus identified 12 genes (complete and/or partial) that may encode hitherto undescribed enzymes. The low amino acid identity (< 60%) with already-described genes opens perspectives for both production in an expression host and genetic screening of metagenomes. PMID:25171437

  8. Identification of potential platelet alloantigens in the Equidae family by comparison of gene sequences encoding major platelet membrane glycoproteins.

    PubMed

    Boudreaux, Mary K; Humphries, Drew M

    2013-12-01

    Platelet alloantigens in horses may play an important role in the development of neonatal alloimmune thrombocytopenia (NAIT). The objective of this study was to evaluate genes encoding major platelet glycoproteins within the Equidae family in an effort to identify potential alloantigens. DNA was isolated from blood samples obtained from Equidae family members, including a Holsteiner-Oldenburg cross, a Quarter horse, a donkey, and a Plains zebra (Equus burchelli). Gene sequences encoding equine platelet membrane glycoproteins IIb, IIIa (integrin subunits αIIb and β3), Ia (integrin subunit α2), and Ibα were determined using PCR. Gene sequences were compared to the equine genome available on GenBank. Polymorphisms that would be predicted to result in amino acid changes on platelet surfaces were documented and compared with known alloantigenic sites documented on human platelets. Amino acid differences were predicted based on nucleotide sequences for all 4 genes. Nine differences were documented for αIIb, 5 differences were documented for β3, 7 differences were documented for α2, and 16 differences were documented for Ibα outside the macroglycopeptide region. This study represents the first effort at identifying potential platelet alloantigens in members of the Equidae Family based on evaluation of gene sequences. The data obtained form the groundwork for identifying potential platelet alloantigens involved in transfusion reactions and neonatal alloimmune thrombocytopenia (NAIT). More work is required to determine whether the predicted amino acid differences documented in this study play a role in alloimmunity, and whether other polymorphisms not detected in this study are present that may result in alloimmunity. © 2013 American Society for Veterinary Clinical Pathology.

  9. Molecular Cloning and Characterization of cDNA Encoding a Putative Stress-Induced Heat-Shock Protein from Camelus dromedarius

    PubMed Central

    Elrobh, Mohamed S.; Alanazi, Mohammad S.; Khan, Wajahatullah; Abduljaleel, Zainularifeen; Al-Amri, Abdullah; Bazzi, Mohammad D.

    2011-01-01

    Heat shock proteins are ubiquitous, induced under a number of environmental and metabolic stresses, with highly conserved DNA sequences among mammalian species. Camelus dromedaries (the Arabian camel) domesticated under semi-desert environments, is well adapted to tolerate and survive against severe drought and high temperatures for extended periods. This is the first report of molecular cloning and characterization of full length cDNA of encoding a putative stress-induced heat shock HSPA6 protein (also called HSP70B′) from Arabian camel. A full-length cDNA (2417 bp) was obtained by rapid amplification of cDNA ends (RACE) and cloned in pET-b expression vector. The sequence analysis of HSPA6 gene showed 1932 bp-long open reading frame encoding 643 amino acids. The complete cDNA sequence of the Arabian camel HSPA6 gene was submitted to NCBI GeneBank (accession number HQ214118.1). The BLAST analysis indicated that C. dromedaries HSPA6 gene nucleotides shared high similarity (77–91%) with heat shock gene nucleotide of other mammals. The deduced 643 amino acid sequences (accession number ADO12067.1) showed that the predicted protein has an estimated molecular weight of 70.5 kDa with a predicted isoelectric point (pI) of 6.0. The comparative analyses of camel HSPA6 protein sequences with other mammalian heat shock proteins (HSPs) showed high identity (80–94%). Predicted camel HSPA6 protein structure using Protein 3D structural analysis high similarities with human and mouse HSPs. Taken together, this study indicates that the cDNA sequences of HSPA6 gene and its amino acid and protein structure from the Arabian camel are highly conserved and have similarities with other mammalian species. PMID:21845074

  10. How to kill the honey bee larva: genomic potential and virulence mechanisms of Paenibacillus larvae.

    PubMed

    Djukic, Marvin; Brzuszkiewicz, Elzbieta; Fünfhaus, Anne; Voss, Jörn; Gollnow, Kathleen; Poppinga, Lena; Liesegang, Heiko; Garcia-Gonzalez, Eva; Genersch, Elke; Daniel, Rolf

    2014-01-01

    Paenibacillus larvae, a Gram positive bacterial pathogen, causes American Foulbrood (AFB), which is the most serious infectious disease of honey bees. In order to investigate the genomic potential of P. larvae, two strains belonging to two different genotypes were sequenced and used for comparative genome analysis. The complete genome sequence of P. larvae strain DSM 25430 (genotype ERIC II) consisted of 4,056,006 bp and harbored 3,928 predicted protein-encoding genes. The draft genome sequence of P. larvae strain DSM 25719 (genotype ERIC I) comprised 4,579,589 bp and contained 4,868 protein-encoding genes. Both strains harbored a 9.7 kb plasmid and encoded a large number of virulence-associated proteins such as toxins and collagenases. In addition, genes encoding large multimodular enzymes producing nonribosomally peptides or polyketides were identified. In the genome of strain DSM 25719 seven toxin associated loci were identified and analyzed. Five of them encoded putatively functional toxins. The genome of strain DSM 25430 harbored several toxin loci that showed similarity to corresponding loci in the genome of strain DSM 25719, but were non-functional due to point mutations or disruption by transposases. Although both strains cause AFB, significant differences between the genomes were observed including genome size, number and composition of transposases, insertion elements, predicted phage regions, and strain-specific island-like regions. Transposases, integrases and recombinases are important drivers for genome plasticity. A total of 390 and 273 mobile elements were found in strain DSM 25430 and strain DSM 25719, respectively. Comparative genomics of both strains revealed acquisition of virulence factors by horizontal gene transfer and provided insights into evolution and pathogenicity.

  11. Transcriptome analysis of Aspergillus niger grown on sugarcane bagasse

    PubMed Central

    2011-01-01

    Background Considering that the costs of cellulases and hemicellulases contribute substantially to the price of bioethanol, new studies aimed at understanding and improving cellulase efficiency and productivity are of paramount importance. Aspergillus niger has been shown to produce a wide spectrum of polysaccharide hydrolytic enzymes. To understand how to improve enzymatic cocktails that can hydrolyze pretreated sugarcane bagasse, we used a genomics approach to investigate which genes and pathways are transcriptionally modulated during growth of A. niger on steam-exploded sugarcane bagasse (SEB). Results Herein we report the main cellulase- and hemicellulase-encoding genes with increased expression during growth on SEB. We also sought to determine whether the mRNA accumulation of several SEB-induced genes encoding putative transporters is induced by xylose and dependent on glucose. We identified 18 (58% of A. niger predicted cellulases) and 21 (58% of A. niger predicted hemicellulases) cellulase- and hemicellulase-encoding genes, respectively, that were highly expressed during growth on SEB. Conclusions Degradation of sugarcane bagasse requires production of many different enzymes which are regulated by the type and complexity of the available substrate. Our presently reported work opens new possibilities for understanding sugarcane biomass saccharification by A. niger hydrolases and for the construction of more efficient enzymatic cocktails for second-generation bioethanol. PMID:22008461

  12. Transcriptional response of Leptospira interrogans to iron limitation and characterization of a PerR homolog.

    PubMed

    Lo, Miranda; Murray, Gerald L; Khoo, Chen Ai; Haake, David A; Zuerner, Richard L; Adler, Ben

    2010-11-01

    Leptospirosis is a globally significant zoonosis caused by Leptospira spp. Iron is essential for growth of most bacterial species. Since iron availability is low in the host, pathogens have evolved complex iron acquisition mechanisms to survive and establish infection. In many bacteria, expression of iron uptake and storage proteins is regulated by Fur. L. interrogans encodes four predicted Fur homologs; we have constructed a mutation in one of these, la1857. We conducted microarray analysis to identify iron-responsive genes and to study the effects of la1857 mutation on gene expression. Under iron-limiting conditions, 43 genes were upregulated and 49 genes were downregulated in the wild type. Genes encoding proteins with predicted involvement in inorganic ion transport and metabolism (including TonB-dependent proteins and outer membrane transport proteins) were overrepresented in the upregulated list, while 54% of differentially expressed genes had no known function. There were 16 upregulated genes of unknown function which are absent from the saprophyte L. biflexa and which therefore may encode virulence-associated factors. Expression of iron-responsive genes was not significantly affected by mutagenesis of la1857, indicating that LA1857 is not a global regulator of iron homeostasis. Upregulation of heme biosynthetic genes and a putative catalase in the mutant suggested that LA1857 is more similar to PerR, a regulator of the oxidative stress response. Indeed, the la1857 mutant was more resistant to peroxide stress than the wild type. Our results provide insights into the role of iron in leptospiral metabolism and regulation of the oxidative stress response, including genes likely to be important for virulence.

  13. Analysis of gene expression provides insights into the mechanism of cadmium tolerance in Acidithiobacillus ferrooxidans.

    PubMed

    Chen, Minjie; Li, Yanjun; Zhang, Li; Wang, Jianying; Zheng, Chunli; Zhang, Xuefeng

    2015-02-01

    Acidithiobacillus ferrooxidans plays a critical role in metal solubilization in the biomining industry, and occupies an ecological niche characterized by high acidity and high concentrations of toxic heavy metal ions. In order to investigate the possible metal resistance mechanism, the cellular distribution of cadmium was tested. The result indicated that Cd(2+) entered the cells upon initial exposure resulting in increased intracellular concentrations, followed by its excretion from the cells during subsequent growth and adaptation. Sequence homology analyses were used to identify 10 genes predicted to participate in heavy metal homeostasis, and the expression of these genes was investigated in cells cultured in the presence of increasing concentrations of toxic divalent cadmium (Cd(2+)). The results suggested that one gene (cmtR A.f ) encoded a putative Cd(2+)/Pb(2+)-responsive transcriptional regulator; four genes (czcA1 A.f , czcA2 A.f , czcB1 A.f ; and czcC1 A.f ) encoded heavy metal efflux proteins for Cd(2+); two genes (cadA1 A.f and cadB1 A.f ) encoded putative cation channel proteins related to the transport of Cd(2+). No significant enhancement of gene expression was observed at low concentrations of Cd(2+) (5 mM) and most of the putative metal resistance genes were up-regulated except cmtR A.f , cadB3 A.f ; and czcB1 A.f at higher concentrations (15 and 30 mM) according to real-time polymerase chain reaction. A model was developed for the mechanism of resistance to cadmium ions based on homology analyses of the predicted genes, the transcription of putative Cd(2+) resistance genes, and previous work.

  14. Identification and characterization of an early gene in the Lymantria dispar multinucleocapsid nuclear polyhedrosis virus

    Treesearch

    David S. Bischoff; James M. Slavicek

    1995-01-01

    The Lymantria dispar multinucleocapsid nuclear polyhedrosis virus (LdMNPV) gene encoding G22 was cloned and sequenced. The G22 gene codes for a 191 amino acid protein with a predicted Mr of 22000. Expression of G22 in a rabbit reticulocyte system generated a protein with an M...

  15. Organization of the hao gene cluster of Nitrosomonas europaea: genes for two tetraheme c cytochromes.

    PubMed

    Bergmann, D J; Arciero, D M; Hooper, A B

    1994-06-01

    The organization of genes for three proteins involved in ammonia oxidation in Nitrosomonas europaea has been investigated. The amino acid sequence of the N-terminal region and four heme-containing peptides produced by proteolysis of the tetraheme cytochrome c554 of N. europaea were determined by Edman degradation. The gene (cycA) encoding this cytochrome is present in three copies per genome (H. McTavish, F. LaQuier, D. Arciero, M. Logan, G. Mundfrom, J.A. Fuchs, and A. B. Hooper, J. Bacteriol. 175:2445-2447, 1993). Three clones, representing at least two copies of cycA, were isolated and sequenced by the dideoxy-chain termination procedure. In both copies, the sequences of 211 amino acids derived from the gene sequence are identical and include all amino acids predicted by the proteolytic peptides. In two copies, the cycA open reading frame (ORF) is followed closely (three bases in one copy) by a second ORF predicted to encode a 28-kDa tetraheme c cytochrome not previously characterized but similar to the nirT gene product of Pseudomonas stutzeri. In one copy of the cycA gene cluster, the second ORF is absent.

  16. A putative regulatory genetic locus modulates virulence in the pathogen Leptospira interrogans.

    PubMed

    Eshghi, Azad; Becam, Jérôme; Lambert, Ambroise; Sismeiro, Odile; Dillies, Marie-Agnès; Jagla, Bernd; Wunder, Elsio A; Ko, Albert I; Coppee, Jean-Yves; Goarant, Cyrille; Picardeau, Mathieu

    2014-06-01

    Limited research has been conducted on the role of transcriptional regulators in relation to virulence in Leptospira interrogans, the etiological agent of leptospirosis. Here, we identify an L. interrogans locus that encodes a sensor protein, an anti-sigma factor antagonist, and two genes encoding proteins of unknown function. Transposon insertion into the gene encoding the sensor protein led to dampened transcription of the other 3 genes in this locus. This lb139 insertion mutant (the lb139(-) mutant) displayed attenuated virulence in the hamster model of infection and reduced motility in vitro. Whole-transcriptome analyses using RNA sequencing revealed the downregulation of 115 genes and the upregulation of 28 genes, with an overrepresentation of gene products functioning in motility and signal transduction and numerous gene products with unknown functions, predicted to be localized to the extracellular space. Another significant finding encompassed suppressed expression of the majority of the genes previously demonstrated to be upregulated at physiological osmolarity, including the sphingomyelinase C precursor Sph2 and LigB. We provide insight into a possible requirement for transcriptional regulation as it relates to leptospiral virulence and suggest various biological processes that are affected due to the loss of native expression of this genetic locus.

  17. The genome of Brucella melitensis.

    PubMed

    DelVecchio, Vito G; Kapatral, Vinayak; Elzer, Philip; Patra, Guy; Mujer, Cesar V

    2002-12-20

    The genome of Brucella melitensis strain 16M was sequenced and contained 3,294,931 bp distributed over two circular chromosomes. Chromosome I was composed of 2,117,144 bp and chromosome II has 1,177,787 bp. A total of 3,198 ORFs were predicted. The origins of replication of the chromosomes are similar to each other and to those of other alpha-proteobacteria. Housekeeping genes such as those that encode for DNA replication, protein synthesis, core metabolism, and cell-wall biosynthesis were found on both chromosomes. Genes encoding adhesins, invasins, and hemolysins were also identified.

  18. Cloning and bioinformatic analysis of lovastatin biosynthesis regulatory gene lovE.

    PubMed

    Huang, Xin; Li, Hao-ming

    2009-08-05

    Lovastatin is an effective drug for treatment of hyperlipidemia. This study aimed to clone lovastatin biosynthesis regulatory gene lovE and analyze the structure and function of its encoding protein. According to the lovastatin synthase gene sequence from genebank, primers were designed to amplify and clone the lovastatin biosynthesis regulatory gene lovE from Aspergillus terrus genomic DNA. Bioinformatic analysis of lovE and its encoding animo acid sequence was performed through internet resources and software like DNAMAN. Target fragment lovE, almost 1500 bp in length, was amplified from Aspergillus terrus genomic DNA and the secondary and three-dimensional structures of LovE protein were predicted. In the lovastatin biosynthesis process lovE is a regulatory gene and LovE protein is a GAL4-like transcriptional factor.

  19. [Cloning, mutagenesis and symbiotic phenotype of three lipid transfer protein encoding genes from Mesorhizobium huakuii 7653R].

    PubMed

    Li, Yanan; Zeng, Xiaobo; Zhou, Xuejuan; Li, Youguo

    2016-12-04

    Lipid transfer protein superfamily is involved in lipid transport and metabolism. This study aimed to construct mutants of three lipid transfer protein encoding genes in Mesorhizobium huakuii 7653R, and to study the phenotypes and function of mutations during symbiosis with Astragalus sinicus. We used bioinformatics to predict structure characteristics and biological functions of lipid transfer proteins, and conducted semi-quantitative and fluorescent quantitative real-time PCR to analyze the expression levels of target genes in free-living and symbiotic conditions. Using pK19mob insertion mutagenesis to construct mutants, we carried out pot plant experiments to observe symbiotic phenotypes. MCHK-5577, MCHK-2172 and MCHK-2779 genes encoding proteins belonged to START/RHO alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) superfamily, involved in lipid transport or metabolism, and were identical to M. loti at 95% level. Gene relative transcription level of the three genes all increased compared to free-living condition. We obtained three mutants. Compared with wild-type 7653R, above-ground biomass of plants and nodulenitrogenase activity induced by the three mutants significantly decreased. Results indicated that lipid transfer protein encoding genes of Mesorhizobium huakuii 7653R may play important roles in symbiotic nitrogen fixation, and the mutations significantly affected the symbiotic phenotypes. The present work provided a basis to study further symbiotic function mechanism associated with lipid transfer proteins from rhizobia.

  20. Evidence for the bacterial origin of genes encoding fermentation enzymes of the amitochondriate protozoan parasite Entamoeba histolytica.

    PubMed

    Rosenthal, B; Mai, Z; Caplivski, D; Ghosh, S; de la Vega, H; Graf, T; Samuelson, J

    1997-06-01

    Entamoeba histolytica is an amitochondriate protozoan parasite with numerous bacterium-like fermentation enzymes including the pyruvate:ferredoxin oxidoreductase (POR), ferredoxin (FD), and alcohol dehydrogenase E (ADHE). The goal of this study was to determine whether the genes encoding these cytosolic E. histolytica fermentation enzymes might derive from a bacterium by horizontal transfer, as has previously been suggested for E. histolytica genes encoding heat shock protein 60, nicotinamide nucleotide transhydrogenase, and superoxide dismutase. In this study, the E. histolytica por gene and the adhE gene of a second amitochondriate protozoan parasite, Giardia lamblia, were sequenced, and their phylogenetic positions were estimated in relation to POR, ADHE, and FD cloned from eukaryotic and eubacterial organisms. The E. histolytica por gene encodes a 1,620-amino-acid peptide that contained conserved iron-sulfur- and thiamine pyrophosphate-binding sites. The predicted E. histolytica POR showed fewer positional identities to the POR of G. lamblia (34%) than to the POR of the enterobacterium Klebsiella pneumoniae (49%), the cyanobacterium Anabaena sp. (44%), and the protozoan Trichomonas vaginalis (46%), which targets its POR to anaerobic organelles called hydrogenosomes. Maximum-likelihood, neighbor-joining, and parsimony analyses also suggested as less likely E. histolytica POR sharing more recent common ancestry with G. lamblia POR than with POR of bacteria and the T. vaginalis hydrogenosome. The G. lamblia adhE encodes an 888-amino-acid fusion peptide with an aldehyde dehydrogenase at its amino half and an iron-dependent (class 3) ADH at its carboxy half. The predicted G. lamblia ADHE showed extensive positional identities to ADHE of Escherichia coli (49%), Clostridium acetobutylicum (44%), and E. histolytica (43%) and lesser identities to the class 3 ADH of eubacteria and yeast (19 to 36%). Phylogenetic analyses inferred a closer relationship of the E. histolytica ADHE to bacterial ADHE than to the G. lamblia ADHE. The 6-kDa FD of E. histolytica and G. lamblia were most similar to those of the archaebacterium Methanosarcina barkeri and the delta-purple bacterium Desulfovibrio desulfuricans, respectively, while the 12-kDa FD of the T. vaginalis hydrogenosome was most similar to the 12-kDa FD of gamma-purple bacterium Pseudomonas putida. E. histolytica genes (and probably G. lamblia genes) encoding fermentation enzymes therefore likely derive from bacteria by horizontal transfer, although it is not clear from which bacteria these amebic genes derive. These are the first nonorganellar fermentation enzymes of eukaryotes implicated to have derived from bacteria.

  1. Molecular and functional characterization of novel fructosyltransferases and invertases from Agave tequilana.

    PubMed

    Cortés-Romero, Celso; Martínez-Hernández, Aída; Mellado-Mojica, Erika; López, Mercedes G; Simpson, June

    2012-01-01

    Fructans are the main storage polysaccharides found in Agave species. The synthesis of these complex carbohydrates relies on the activities of specific fructosyltransferase enzymes closely related to the hydrolytic invertases. Analysis of Agave tequilana transcriptome data led to the identification of ESTs encoding putative fructosyltransferases and invertases. Based on sequence alignments and structure/function relationships, two different genes were predicted to encode 1-SST and 6G-FFT type fructosyltransferases, in addition, 4 genes encoding putative cell wall invertases and 4 genes encoding putative vacuolar invertases were also identified. Probable functions for each gene, were assigned based on conserved amino acid sequences and confirmed for 2 fructosyltransferases and one invertase by analyzing the enzymatic activity of recombinant Agave protein s expressed and purified from Pichia pastoris. The genome organization of the fructosyltransferase/invertase genes, for which the corresponding cDNA contained the complete open reading frame, was found to be well conserved since all genes were shown to carry a 9 bp mini-exon and all showed a similar structure of 8 exons/7 introns with the exception of a cell wall invertase gene which has 7 exons and 6 introns. Fructosyltransferase genes were strongly expressed in the storage organs of the plants, especially in vegetative stages of development and to lower levels in photosynthetic tissues, in contrast to the invertase genes where higher levels of expression were observed in leaf tissues and in mature plants.

  2. Molecular and Functional Characterization of Novel Fructosyltransferases and Invertases from Agave tequilana

    PubMed Central

    Cortés-Romero, Celso; Martínez-Hernández, Aída; Mellado-Mojica, Erika; López, Mercedes G.; Simpson, June

    2012-01-01

    Fructans are the main storage polysaccharides found in Agave species. The synthesis of these complex carbohydrates relies on the activities of specific fructosyltransferase enzymes closely related to the hydrolytic invertases. Analysis of Agave tequilana transcriptome data led to the identification of ESTs encoding putative fructosyltransferases and invertases. Based on sequence alignments and structure/function relationships, two different genes were predicted to encode 1-SST and 6G-FFT type fructosyltransferases, in addition, 4 genes encoding putative cell wall invertases and 4 genes encoding putative vacuolar invertases were also identified. Probable functions for each gene, were assigned based on conserved amino acid sequences and confirmed for 2 fructosyltransferases and one invertase by analyzing the enzymatic activity of recombinant Agave protein s expressed and purified from Pichia pastoris. The genome organization of the fructosyltransferase/invertase genes, for which the corresponding cDNA contained the complete open reading frame, was found to be well conserved since all genes were shown to carry a 9 bp mini-exon and all showed a similar structure of 8 exons/7 introns with the exception of a cell wall invertase gene which has 7 exons and 6 introns. Fructosyltransferase genes were strongly expressed in the storage organs of the plants, especially in vegetative stages of development and to lower levels in photosynthetic tissues, in contrast to the invertase genes where higher levels of expression were observed in leaf tissues and in mature plants. PMID:22558253

  3. Putative Nonribosomal Peptide Synthetase and Cytochrome P450 Genes Responsible for Tentoxin Biosynthesis in Alternaria alternata ZJ33.

    PubMed

    Li, You-Hai; Han, Wen-Jin; Gui, Xi-Wu; Wei, Tao; Tang, Shuang-Yan; Jin, Jian-Ming

    2016-08-02

    Tentoxin, a cyclic tetrapeptide produced by several Alternaria species, inhibits the F₁-ATPase activity of chloroplasts, resulting in chlorosis in sensitive plants. In this study, we report two clustered genes, encoding a putative non-ribosome peptide synthetase (NRPS) TES and a cytochrome P450 protein TES1, that are required for tentoxin biosynthesis in Alternaria alternata strain ZJ33, which was isolated from blighted leaves of Eupatorium adenophorum. Using a pair of primers designed according to the consensus sequences of the adenylation domain of NRPSs, two fragments containing putative adenylation domains were amplified from A. alternata ZJ33, and subsequent PCR analyses demonstrated that these fragments belonged to the same NRPS coding sequence. With no introns, TES consists of a single 15,486 base pair open reading frame encoding a predicted 5161 amino acid protein. Meanwhile, the TES1 gene is predicted to contain five introns and encode a 506 amino acid protein. The TES protein is predicted to be comprised of four peptide synthase modules with two additional N-methylation domains, and the number and arrangement of the modules in TES were consistent with the number and arrangement of the amino acid residues of tentoxin, respectively. Notably, both TES and TES1 null mutants generated via homologous recombination failed to produce tentoxin. This study provides the first evidence concerning the biosynthesis of tentoxin in A. alternata.

  4. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ganji, Rakesh; Murugapiran, Senthil K.; Ong, John C.

    The draft genome of Thermocrinis jamiesonii GBS1 T is 1,315,625 bp in 10 contigs and encodes 1,463 predicted genes. The presence of sox genes and various glycoside hydrolases and the absence of uptake NiFe hydrogenases ( hyaB) are consistent with a requirement for thiosulfate and suggest the ability to use carbohydrate polymers.

  5. Compositional profile of α / β-hydrolase fold proteins in mangrove soil metagenomes: prevalence of epoxide hydrolases and haloalkane dehalogenases in oil-contaminated sites.

    PubMed

    Jiménez, Diego Javier; Dini-Andreote, Francisco; Ottoni, Júlia Ronzella; de Oliveira, Valéria Maia; van Elsas, Jan Dirk; Andreote, Fernando Dini

    2015-05-01

    The occurrence of genes encoding biotechnologically relevant α/β-hydrolases in mangrove soil microbial communities was assessed using data obtained by whole-metagenome sequencing of four mangroves areas, denoted BrMgv01 to BrMgv04, in São Paulo, Brazil. The sequences (215 Mb in total) were filtered based on local amino acid alignments against the Lipase Engineering Database. In total, 5923 unassembled sequences were affiliated with 30 different α/β-hydrolase fold superfamilies. The most abundant predicted proteins encompassed cytosolic hydrolases (abH08; ∼ 23%), microsomal hydrolases (abH09; ∼ 12%) and Moraxella lipase-like proteins (abH04 and abH01; < 5%). Detailed analysis of the genes predicted to encode proteins of the abH08 superfamily revealed a high proportion related to epoxide hydrolases and haloalkane dehalogenases in polluted mangroves BrMgv01-02-03. This suggested selection and putative involvement in local degradation/detoxification of the pollutants. Seven sequences that were annotated as genes for putative epoxide hydrolases and five for putative haloalkane dehalogenases were found in a fosmid library generated from BrMgv02 DNA. The latter enzymes were predicted to belong to Actinobacteria, Deinococcus-Thermus, Planctomycetes and Proteobacteria. Our integrated approach thus identified 12 genes (complete and/or partial) that may encode hitherto undescribed enzymes. The low amino acid identity (< 60%) with already-described genes opens perspectives for both production in an expression host and genetic screening of metagenomes. © 2014 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.

  6. Characterization of the Genes Encoding the Cytosolic and Plastidial Forms of ADP-Glucose Pyrophosphorylase in Wheat Endosperm1

    PubMed Central

    Burton, Rachel A.; Johnson, Philip E.; Beckles, Diane M.; Fincher, Geoffrey B.; Jenner, Helen L.; Naldrett, Mike J.; Denyer, Kay

    2002-01-01

    In most species, the synthesis of ADP-glucose (Glc) by the enzyme ADP-Glc pyrophosphorylase (AGPase) occurs entirely within the plastids in all tissues so far examined. However, in the endosperm of many, if not all grasses, a second form of AGPase synthesizes ADP-Glc outside the plastid, presumably in the cytosol. In this paper, we show that in the endosperm of wheat (Triticum aestivum), the cytosolic form accounts for most of the AGPase activity. Using a combination of molecular and biochemical approaches to identify the cytosolic and plastidial protein components of wheat endosperm AGPase we show that the large and small subunits of the cytosolic enzyme are encoded by genes previously thought to encode plastidial subunits, and that a gene, Ta.AGP.S.1, which encodes the small subunit of the cytosolic form of AGPase, also gives rise to a second transcript by the use of an alternate first exon. This second transcript encodes an AGPase small subunit with a transit peptide. However, we could not find a plastidial small subunit protein corresponding to this transcript. The protein sequence of the purified plastidial small subunit does not match precisely to that encoded by Ta.AGP.S.1 or to the predicted sequences of any other known gene from wheat or barley (Hordeum vulgare). Instead, the protein sequence is most similar to those of the plastidial small subunits from chickpea (Cicer arietinum) and maize (Zea mays) and rice (Oryza sativa) seeds. These data suggest that the gene encoding the major plastidial small subunit of AGPase in wheat endosperm has yet to be identified. PMID:12428011

  7. Complete Sequence of a 184-Kilobase Catabolic Plasmid from Sphingomonas aromaticivorans F199†

    PubMed Central

    Romine, Margaret F.; Stillwell, Lisa C.; Wong, Kwong-Kwok; Thurston, Sarah J.; Sisk, Ellen C.; Sensen, Christoph; Gaasterland, Terry; Fredrickson, Jim K.; Saffer, Jeffrey D.

    1999-01-01

    The complete 184,457-bp sequence of the aromatic catabolic plasmid, pNL1, from Sphingomonas aromaticivorans F199 has been determined. A total of 186 open reading frames (ORFs) are predicted to encode proteins, of which 79 are likely directly associated with catabolism or transport of aromatic compounds. Genes that encode enzymes associated with the degradation of biphenyl, naphthalene, m-xylene, and p-cresol are predicted to be distributed among 15 gene clusters. The unusual coclustering of genes associated with different pathways appears to have evolved in response to similarities in biochemical mechanisms required for the degradation of intermediates in different pathways. A putative efflux pump and several hypothetical membrane-associated proteins were identified and predicted to be involved in the transport of aromatic compounds and/or intermediates in catabolism across the cell wall. Several genes associated with integration and recombination, including two group II intron-associated maturases, were identified in the replication region, suggesting that pNL1 is able to undergo integration and excision events with the chromosome and/or other portions of the plasmid. Conjugative transfer of pNL1 to another Sphingomonas sp. was demonstrated, and genes associated with this function were found in two large clusters. Approximately one-third of the ORFs (59 of them) have no obvious homology to known genes. PMID:10049392

  8. Identification and Analysis of the Biosynthetic Gene Cluster Encoding the Thiopeptide Antibiotic Cyclothiazomycin in Streptomyces hygroscopicus 10-22▿ †

    PubMed Central

    Wang, Jiang; Yu, Yi; Tang, Kexuan; Liu, Wen; He, Xinyi; Huang, Xi; Deng, Zixin

    2010-01-01

    Thiopeptide antibiotics are an important class of natural products resulting from posttranslational modifications of ribosomally synthesized peptides. Cyclothiazomycin is a typical thiopeptide antibiotic that has a unique bridged macrocyclic structure derived from an 18-amino-acid structural peptide. Here we reported cloning, sequencing, and heterologous expression of the cyclothiazomycin biosynthetic gene cluster from Streptomyces hygroscopicus 10-22. Remarkably, successful heterologous expression of a 22.7-kb gene cluster in Streptomyces lividans 1326 suggested that there is a minimum set of 15 open reading frames that includes all of the functional genes required for cyclothiazomycin production. Six genes of these genes, cltBCDEFG flanking the structural gene cltA, were predicted to encode the enzymes required for the main framework of cyclothiazomycin, and two enzymes encoded by a putative operon, cltMN, were hypothesized to participate in the tailoring step to generate the tertiary thioether, leading to the final cyclization of the bridged macrocyclic structure. This rigorous bioinformatics analysis based on heterologous expression of cyclothiazomycin resulted in an ideal biosynthetic model for us to understand the biosynthesis of thiopeptides. PMID:20154110

  9. Characterization of the Lymantria dispar nucleopolyhedrovirus 25K FP gene

    Treesearch

    David S. Bischoff; James M. Slavicek

    1996-01-01

    The Lymantria dispar nucleopolyhedrovirus (LdMNPV) gene encoding the 25K FP protein has been cloned and sequenced. The 25KFP gene codes for a 217 amino acid protein with a predicted molecular mass of 24870 Da. Expression of the 25K FP protein in a rabbit reticulocyte system generated a 27 kDa protein, in close agreement with the...

  10. The cysteine rich necrotrophic effector SnTox1 produced by Stagonospora nodorum triggers susceptibility of wheat lines harboring Snn1

    USDA-ARS?s Scientific Manuscript database

    The gene encoding SnTox1, a necrotrophic effector from Stagonospora nodorum that causes necrosis of wheat lines expressing Snn1, has been verified by heterologous expression in Pichia pastoris. SnTox1 encodes a 117 amino acid cysteine rich protein with the first 17 amino acids predicted as a signal ...

  11. Comparative genome analysis reveals genetic adaptation to versatile environmental conditions and importance of biofilm lifestyle in Comamonas testosteroni.

    PubMed

    Wu, Yichao; Arumugam, Krithika; Tay, Martin Qi Xiang; Seshan, Hari; Mohanty, Anee; Cao, Bin

    2015-04-01

    Comamonas testosteroni is an important environmental bacterium capable of degrading a variety of toxic aromatic pollutants and has been demonstrated to be a promising biocatalyst for environmental decontamination. This organism is often found to be among the primary surface colonizers in various natural and engineered ecosystems, suggesting an extraordinary capability of this organism in environmental adaptation and biofilm formation. The goal of this study was to gain genetic insights into the adaption of C. testosteroni to versatile environments and the importance of a biofilm lifestyle. Specifically, a draft genome of C. testosteroni I2 was obtained. The draft genome is 5,778,710 bp in length and comprises 110 contigs. The average G+C content was 61.88 %. A total of 5365 genes with 5263 protein-coding genes were predicted, whereas 4324 (80.60 % of total genes) protein-encoding genes were associated with predicted functions. The catabolic genes responsible for biodegradation of steroid and other aromatic compounds on draft genome were identified. Plasmid pI2 was found to encode a complete pathway for aniline degradation and a partial catabolic pathway for chloroaniline. This organism was found to be equipped with a sophisticated signaling system which helps it find ideal niches and switch between planktonic and biofilm lifestyles. A large number of putative multi-drug-resistant genes coding for abundant outer membrane transporters, chaperones, and heat shock proteins for the protection of cellular function were identified in the genome of strain I2. In addition, the genome of strain I2 was predicted to encode several proteins involved in producing, secreting, and uptaking siderophores under iron-limiting conditions. The genome of strain I2 contains a number of genes responsible for the synthesis and secretion of exopolysaccharides, an extracellular component essential for biofilm formation. Overall, our results reveal the genomic features underlying the adaption of C. testosteroni to versatile environments and highlighting the importance of its biofilm lifestyle.

  12. Fine mapping of Restorer-of-fertility in pepper (Capsicum annuum L.) identified a candidate gene encoding a pentatricopeptide repeat (PPR)-containing protein.

    PubMed

    Jo, Yeong Deuk; Ha, Yeaseong; Lee, Joung-Ho; Park, Minkyu; Bergsma, Alex C; Choi, Hong-Il; Goritschnig, Sandra; Kloosterman, Bjorn; van Dijk, Peter J; Choi, Doil; Kang, Byoung-Cheorl

    2016-10-01

    Using fine mapping techniques, the genomic region co-segregating with Restorer - of - fertility ( Rf ) in pepper was delimited to a region of 821 kb in length. A PPR gene in this region, CaPPR6 , was identified as a strong candidate for Rf based on expression pattern and characteristics of encoding sequence. Cytoplasmic-genic male sterility (CGMS) has been used for the efficient production of hybrid seeds in peppers (Capsicum annuum L.). Although the mitochondrial candidate genes that might be responsible for cytoplasmic male sterility (CMS) have been identified, the nuclear Restorer-of-fertility (Rf) gene has not been isolated. To identify the genomic region co-segregating with Rf in pepper, we performed fine mapping using an Rf-segregating population consisting of 1068 F2 individuals, based on BSA-AFLP and a comparative mapping approach. Through six cycles of chromosome walking, the co-segregating region harboring the Rf locus was delimited to be within 821 kb of sequence. Prediction of expressed genes in this region based on transcription analysis revealed four candidate genes. Among these, CaPPR6 encodes a pentatricopeptide repeat (PPR) protein with PPR motifs that are repeated 14 times. Characterization of the CaPPR6 protein sequence, based on alignment with other homologs, showed that CaPPR6 is a typical Rf-like (RFL) gene reported to have undergone diversifying selection during evolution. A marker developed from a sequence near CaPPR6 showed a higher prediction rate of the Rf phenotype than those of previously developed markers when applied to a panel of breeding lines of diverse origin. These results suggest that CaPPR6 is a strong candidate for the Rf gene in pepper.

  13. Cloning and expression analysis of FaPR-1 gene in strawberry

    NASA Astrophysics Data System (ADS)

    Mo, Fan; Luo, Ya; Ge, Cong; Mo, Qin; Ling, Yajie; Luo, Shu; Tang, Haoru

    2018-04-01

    The FaPR-1 gene was cloned by RT-PCR from `Benihoppe' strawberry and its bioinformatics analysis was conducted. The results showed that the open reading frame was 483 bp encoding encoding l60 amino acids which protein molecular weight and theoretical isoelectricity were 17854.17 and 8.72 respectively. Subcellular localization prediction shows that this gene is located extracellularly. By comparing strawberry FaPR-l and other plant Pathogenesis-related protein, homology and phylogenetic tree construction showed that the homology with grapes, peach is relatively close. In the treatments of ABA, sucrose and the mixture of the two, the expression of FaPR-1 in strawberry fruit were significantly increased.

  14. A Putative Gene Cluster from a Lyngbya wollei Bloom that Encodes Paralytic Shellfish Toxin Biosynthesis

    PubMed Central

    Mihali, Troco K.; Carmichael, Wayne W.; Neilan, Brett A.

    2011-01-01

    Saxitoxin and its analogs cause the paralytic shellfish-poisoning syndrome, adversely affecting human health and coastal shellfish industries worldwide. Here we report the isolation, sequencing, annotation, and predicted pathway of the saxitoxin biosynthetic gene cluster in the cyanobacterium Lyngbya wollei. The gene cluster spans 36 kb and encodes enzymes for the biosynthesis and export of the toxins. The Lyngbya wollei saxitoxin gene cluster differs from previously identified saxitoxin clusters as it contains genes that are unique to this cluster, whereby the carbamoyltransferase is truncated and replaced by an acyltransferase, explaining the unique toxin profile presented by Lyngbya wollei. These findings will enable the creation of toxin probes, for water monitoring purposes, as well as proof-of-concept for the combinatorial biosynthesis of these natural occurring alkaloids for the production of novel, biologically active compounds. PMID:21347365

  15. Genome analysis of the foxtail millet pathogen Sclerospora graminicola reveals the complex effector repertoire of graminicolous downy mildews.

    PubMed

    Kobayashi, Michie; Hiraka, Yukie; Abe, Akira; Yaegashi, Hiroki; Natsume, Satoshi; Kikuchi, Hideko; Takagi, Hiroki; Saitoh, Hiromasa; Win, Joe; Kamoun, Sophien; Terauchi, Ryohei

    2017-11-22

    Downy mildew, caused by the oomycete pathogen Sclerospora graminicola, is an economically important disease of Gramineae crops including foxtail millet (Setaria italica). Plants infected with S. graminicola are generally stunted and often undergo a transformation of flower organs into leaves (phyllody or witches' broom), resulting in serious yield loss. To establish the molecular basis of downy mildew disease in foxtail millet, we carried out whole-genome sequencing and an RNA-seq analysis of S. graminicola. Sequence reads were generated from S. graminicola using an Illumina sequencing platform and assembled de novo into a draft genome sequence comprising approximately 360 Mbp. Of this sequence, 73% comprised repetitive elements, and a total of 16,736 genes were predicted from the RNA-seq data. The predicted genes included those encoding effector-like proteins with high sequence similarity to those previously identified in other oomycete pathogens. Genes encoding jacalin-like lectin-domain-containing secreted proteins were enriched in S. graminicola compared to other oomycetes. Of a total of 1220 genes encoding putative secreted proteins, 91 significantly changed their expression levels during the infection of plant tissues compared to the sporangia and zoospore stages of the S. graminicola lifecycle. We established the draft genome sequence of a downy mildew pathogen that infects Gramineae plants. Based on this sequence and our transcriptome analysis, we generated a catalog of in planta-induced candidate effector genes, providing a solid foundation from which to identify the effectors causing phyllody.

  16. The Draft Genome of the Non-Host-Associated Methanobrevibacter arboriphilus Strain DH1 Encodes a Large Repertoire of Adhesin-Like Proteins

    PubMed Central

    Poehlein, Anja; Daniel, Rolf

    2017-01-01

    Methanobrevibacter arboriphilus strain DH1 is an autotrophic methanogen that was isolated from the wetwood of methane-emitting trees. This species has been of considerable interest for its unusual oxygen tolerance and has been studied as a model organism for more than four decades. Strain DH1 is closely related to other host-associated Methanobrevibacter species from intestinal tracts of animals and the rumen, making this strain an interesting candidate for comparative analysis to identify factors important for colonizing intestinal environments. Here, the genome sequence of M. arboriphilus strain DH1 is reported. The draft genome is composed of 2.445.031 bp with an average GC content of 25.44% and predicted to harbour 1964 protein-encoding genes. Among the predicted genes, there are also more than 50 putative genes for the so-called adhesin-like proteins (ALPs). The presence of ALP-encoding genes in the genome of this non-host-associated methanogen strongly suggests that target surfaces for ALPs other than host tissues also need to be considered as potential interaction partners. The high abundance of ALPs may also indicate that these types of proteins are more characteristic for specific phylogenetic groups of methanogens rather than being indicative for a particular environment the methanogens thrives in. PMID:28634433

  17. A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

    PubMed

    Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.

  18. A Hybrid Approach of Gene Sets and Single Genes for the Prediction of Survival Risks with Gene Expression Data

    PubMed Central

    Seok, Junhee; Davis, Ronald W.; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn’t been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge. PMID:25933378

  19. Human myosin VIIA responsible for the Usher 1B syndrome: a predicted membrane-associated motor protein expressed in developing sensory epithelia.

    PubMed

    Weil, D; Levy, G; Sahly, I; Levi-Acobas, F; Blanchard, S; El-Amraoui, A; Crozet, F; Philippe, H; Abitbol, M; Petit, C

    1996-04-16

    The gene encoding human myosin VIIA is responsible for Usher syndrome type III (USH1B), a disease which associates profound congenital sensorineural deafness, vestibular dysfunction, and retinitis pigmentosa. The reconstituted cDNA sequence presented here predicts a 2215 amino acid protein with a typical unconventional myosin structure. This protein is expected to dimerize into a two-headed molecule. The C terminus of its tail shares homology with the membrane-binding domain of the band 4.1 protein superfamily. The gene consists of 48 coding exons. It encodes several alternatively spliced forms. In situ hybridization analysis in human embryos demonstrates that the myosin VIIA gene is expressed in the pigment epithelium and the photoreceptor cells of the retina, thus indicating that both cell types may be involved in the USH1B retinal degenerative process. In addition, the gene is expressed in the human embryonic cochlear and vestibular neuroepithelia. We suggest that deafness and vestibular dysfunction in USH1B patients result from a defect in the morphogenesis of the inner ear sensory cell stereocilia.

  20. Complete Genome Sequence of the Methanococcus maripaludis Type Strain JJ (DSM 2067), a Model for Selenoprotein Synthesis in Archaea.

    PubMed

    Poehlein, Anja; Heym, Daniel; Quitzke, Vivien; Fersch, Julia; Daniel, Rolf; Rother, Michael

    2018-04-05

    Methanococcus maripaludis type strain JJ (DSM 2067) is an important organism because it serves as a model for primary energy metabolism and hydrogenotrophic methanogenesis and is amenable to genetic manipulation. The complete genome (1.7 Mb) harbors 1,815 predicted protein-encoding genes, including 9 encoding selenoproteins. Copyright © 2018 Poehlein et al.

  1. A taxonomy of bacterial microcompartment loci constructed by a novel scoring method

    DOE PAGES

    Axen, Seth D.; Erbilgin, Onur; Kerfeld, Cheryl A.; ...

    2014-10-23

    Bacterial microcompartments (BMCs) are proteinaceous organelles involved in both autotrophic and heterotrophic metabolism. All BMCs share homologous shell proteins but differ in their complement of enzymes; these are typically encoded adjacent to shell protein genes in genetic loci, or operons. To enable the identification and prediction of functional (sub)types of BMCs, we developed LoClass, an algorithm that finds putative BMC loci and inventories, weights, and compares their constituent pfam domains to construct a locus similarity network and predict locus (sub)types. In addition to using LoClass to analyze sequences in the Non-redundant Protein Database, we compared predicted BMC loci found inmore » seven candidate bacterial phyla (six from single-cell genomic studies) to the LoClass taxonomy. Together, these analyses resulted in the identification of 23 different types of BMCs encoded in 30 distinct locus (sub)types found in 23 bacterial phyla. These include the two carboxysome types and a divergent set of metabolosomes, BMCs that share a common catalytic core and process distinct substrates via specific signature enzymes. Furthermore, many Candidate BMCs were found that lack one or more core metabolosome components, including one that is predicted to represent an entirely new paradigm for BMC-associated metabolism, joining the carboxysome and metabolosome. By placing these results in a phylogenetic context, we provide a framework for understanding the horizontal transfer of these loci, a starting point for studies aimed at understanding the evolution of BMCs. This comprehensive taxonomy of BMC loci, based on their constituent protein domains, foregrounds the functional diversity of BMCs and provides a reference for interpreting the role of BMC gene clusters encoded in isolate, single cell, and metagenomic data. Many loci encode ancillary functions such as transporters or genes for cofactor assembly; this expanded vocabulary of BMC-related functions should be useful for design of genetic modules for introducing BMCs in bioengineering applications.« less

  2. A Taxonomy of Bacterial Microcompartment Loci Constructed by a Novel Scoring Method

    PubMed Central

    Kerfeld, Cheryl A.

    2014-01-01

    Bacterial microcompartments (BMCs) are proteinaceous organelles involved in both autotrophic and heterotrophic metabolism. All BMCs share homologous shell proteins but differ in their complement of enzymes; these are typically encoded adjacent to shell protein genes in genetic loci, or operons. To enable the identification and prediction of functional (sub)types of BMCs, we developed LoClass, an algorithm that finds putative BMC loci and inventories, weights, and compares their constituent pfam domains to construct a locus similarity network and predict locus (sub)types. In addition to using LoClass to analyze sequences in the Non-redundant Protein Database, we compared predicted BMC loci found in seven candidate bacterial phyla (six from single-cell genomic studies) to the LoClass taxonomy. Together, these analyses resulted in the identification of 23 different types of BMCs encoded in 30 distinct locus (sub)types found in 23 bacterial phyla. These include the two carboxysome types and a divergent set of metabolosomes, BMCs that share a common catalytic core and process distinct substrates via specific signature enzymes. Furthermore, many Candidate BMCs were found that lack one or more core metabolosome components, including one that is predicted to represent an entirely new paradigm for BMC-associated metabolism, joining the carboxysome and metabolosome. By placing these results in a phylogenetic context, we provide a framework for understanding the horizontal transfer of these loci, a starting point for studies aimed at understanding the evolution of BMCs. This comprehensive taxonomy of BMC loci, based on their constituent protein domains, foregrounds the functional diversity of BMCs and provides a reference for interpreting the role of BMC gene clusters encoded in isolate, single cell, and metagenomic data. Many loci encode ancillary functions such as transporters or genes for cofactor assembly; this expanded vocabulary of BMC-related functions should be useful for design of genetic modules for introducing BMCs in bioengineering applications. PMID:25340524

  3. Characterization of the Tupaia rhabdovirus genome reveals a long open reading frame overlapping with P and a novel gene encoding a small hydrophobic protein.

    PubMed

    Springfeld, Christoph; Darai, Gholamreza; Cattaneo, Roberto

    2005-06-01

    Rhabdoviruses are negative-stranded RNA viruses of the order Mononegavirales and have been isolated from vertebrates, insects, and plants. Members of the genus Lyssavirus cause the invariably fatal disease rabies, and a member of the genus Vesiculovirus, Chandipura virus, has recently been associated with acute encephalitis in children. We present here the complete genome sequence and transcription map of a rhabdovirus isolated from cultivated cells of hepatocellular carcinoma tissue from a moribund tree shrew. The negative-strand genome of tupaia rhabdovirus is composed of 11,440 nucleotides and encodes six genes that are separated by one or two intergenic nucleotides. In addition to the typical rhabdovirus genes in the order N-P-M-G-L, a gene encoding a small hydrophobic putative type I transmembrane protein of approximately 11 kDa was identified between the M and G genes, and the corresponding transcript was detected in infected cells. Similar to some Vesiculoviruses and many Paramyxovirinae, the P gene has a second overlapping reading frame that can be accessed by ribosomal choice and encodes a protein of 26 kDa, predicted to be the largest C protein of these virus families. Phylogenetic analyses of the tupaia rhabdovirus N and L genes show that the virus is distantly related to the Vesiculoviruses, Ephemeroviruses, and the recently characterized Flanders virus and Oita virus and further extends the sequence territory occupied by animal rhabdoviruses.

  4. Characterization of the Tupaia Rhabdovirus Genome Reveals a Long Open Reading Frame Overlapping with P and a Novel Gene Encoding a Small Hydrophobic Protein

    PubMed Central

    Springfeld, Christoph; Darai, Gholamreza; Cattaneo, Roberto

    2005-01-01

    Rhabdoviruses are negative-stranded RNA viruses of the order Mononegavirales and have been isolated from vertebrates, insects, and plants. Members of the genus Lyssavirus cause the invariably fatal disease rabies, and a member of the genus Vesiculovirus, Chandipura virus, has recently been associated with acute encephalitis in children. We present here the complete genome sequence and transcription map of a rhabdovirus isolated from cultivated cells of hepatocellular carcinoma tissue from a moribund tree shrew. The negative-strand genome of tupaia rhabdovirus is composed of 11,440 nucleotides and encodes six genes that are separated by one or two intergenic nucleotides. In addition to the typical rhabdovirus genes in the order N-P-M-G-L, a gene encoding a small hydrophobic putative type I transmembrane protein of approximately 11 kDa was identified between the M and G genes, and the corresponding transcript was detected in infected cells. Similar to some Vesiculoviruses and many Paramyxovirinae, the P gene has a second overlapping reading frame that can be accessed by ribosomal choice and encodes a protein of 26 kDa, predicted to be the largest C protein of these virus families. Phylogenetic analyses of the tupaia rhabdovirus N and L genes show that the virus is distantly related to the Vesiculoviruses, Ephemeroviruses, and the recently characterized Flanders virus and Oita virus and further extends the sequence territory occupied by animal rhabdoviruses. PMID:15890917

  5. Identification and characterisation of the angiotensin converting enzyme-3 (ACE3) gene: a novel mammalian homologue of ACE

    PubMed Central

    Rella, Monika; Elliot, Joann L; Revett, Timothy J; Lanfear, Jerry; Phelan, Anne; Jackson, Richard M; Turner, Anthony J; Hooper, Nigel M

    2007-01-01

    Background Mammalian angiotensin converting enzyme (ACE) plays a key role in blood pressure regulation. Although multiple ACE-like proteins exist in non-mammalian organisms, to date only one other ACE homologue, ACE2, has been identified in mammals. Results Here we report the identification and characterisation of the gene encoding a third homologue of ACE, termed ACE3, in several mammalian genomes. The ACE3 gene is located on the same chromosome downstream of the ACE gene. Multiple sequence alignment and molecular modelling have been employed to characterise the predicted ACE3 protein. In mouse, rat, cow and dog, the predicted protein has mutations in some of the critical residues involved in catalysis, including the catalytic Glu in the HEXXH zinc binding motif which is Gln, and ESTs or reverse-transcription PCR indicate that the gene is expressed. In humans, the predicted ACE3 protein has an intact HEXXH motif, but there are other deletions and insertions in the gene and no ESTs have been identified. Conclusion In the genomes of several mammalian species there is a gene that encodes a novel, single domain ACE-like protein, ACE3. In mouse, rat, cow and dog ACE3, the catalytic Glu is replaced by Gln in the putative zinc binding motif, indicating that in these species ACE3 would lack catalytic activity as a zinc metalloprotease. In humans, no evidence was found that the ACE3 gene is expressed and the presence of deletions and insertions in the sequence indicate that ACE3 is a pseudogene. PMID:17597519

  6. Avirulence gene mapping in the Hessian fly (Mayetiola destructor) reveals a protein phosphatase 2C effector gene family.

    PubMed

    Zhao, Chaoyang; Shukle, Richard; Navarro-Escalante, Lucio; Chen, Mingshun; Richards, Stephen; Stuart, Jeffrey J

    2016-01-01

    The genetic tractability of the Hessian fly (HF, Mayetiola destructor) provides an opportunity to investigate the mechanisms insects use to induce plant gall formation. Here we demonstrate that capacity using the newly sequenced HF genome by identifying the gene (vH24) that elicits effector-triggered immunity in wheat (Triticum spp.) seedlings carrying HF resistance gene H24. vH24 was mapped within a 230-kb genomic fragment near the telomere of HF chromosome X1. That fragment contains only 21 putative genes. The best candidate vH24 gene in this region encodes a protein containing a secretion signal and a type-2 serine/threonine protein phosphatase (PP2C) domain. This gene has an H24-virulence associated insertion in its promoter that appears to silence transcription of the gene in H24-virulent larvae. Candidate vH24 is a member of a small family of genes that encode secretion signals and PP2C domains. It belongs to the fraction of genes in the HF genome previously predicted to encode effector proteins. Because PP2C proteins are not normally secreted, our results suggest that these are PP2C effectors that HF larvae inject into wheat cells to redirect, or interfere, with wheat signal transduction pathways. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Putative Nonribosomal Peptide Synthetase and Cytochrome P450 Genes Responsible for Tentoxin Biosynthesis in Alternaria alternata ZJ33

    PubMed Central

    Li, You-Hai; Han, Wen-Jin; Gui, Xi-Wu; Wei, Tao; Tang, Shuang-Yan; Jin, Jian-Ming

    2016-01-01

    Tentoxin, a cyclic tetrapeptide produced by several Alternaria species, inhibits the F1-ATPase activity of chloroplasts, resulting in chlorosis in sensitive plants. In this study, we report two clustered genes, encoding a putative non-ribosome peptide synthetase (NRPS) TES and a cytochrome P450 protein TES1, that are required for tentoxin biosynthesis in Alternaria alternata strain ZJ33, which was isolated from blighted leaves of Eupatorium adenophorum. Using a pair of primers designed according to the consensus sequences of the adenylation domain of NRPSs, two fragments containing putative adenylation domains were amplified from A. alternata ZJ33, and subsequent PCR analyses demonstrated that these fragments belonged to the same NRPS coding sequence. With no introns, TES consists of a single 15,486 base pair open reading frame encoding a predicted 5161 amino acid protein. Meanwhile, the TES1 gene is predicted to contain five introns and encode a 506 amino acid protein. The TES protein is predicted to be comprised of four peptide synthase modules with two additional N-methylation domains, and the number and arrangement of the modules in TES were consistent with the number and arrangement of the amino acid residues of tentoxin, respectively. Notably, both TES and TES1 null mutants generated via homologous recombination failed to produce tentoxin. This study provides the first evidence concerning the biosynthesis of tentoxin in A. alternata. PMID:27490569

  8. Variety of genotypes in males diagnosed as dichromatic on a conventional clinical anomaloscope

    PubMed Central

    CARROLL, JOSEPH; RENNER, AGNES; KNAU, HOLGER; WERNER, JOHN S.; NEITZ, JAY

    2008-01-01

    The hypothesis that dichromatic behavior on a clinical anomaloscope can be explained by the complement and arrangement of the long- (L) and middle-wavelength (M) pigment genes was tested. It was predicted that dichromacy is associated with an X-chromosome pigment gene array capable of producing only a single functional pigment type. The simplest case of this is when deletion has left only a single X-chromosome pigment gene. The production of a single L or M pigment type can also result from rearrangements in which multiple genes remain. Often, only the two genes at the 5′ end of the array are expressed; thus, dichromacy is also predicted to occur if one of these is defective or encodes a defective pigment, or if both of them encode pigments with identical spectral sensitivities. Subjects were 128 males who accepted the full range of admixtures of the two primary lights as matching the comparison light on a Neitz or Nagel anomaloscope. Strikingly, examination of the L and M pigment genes revealed a potential cause for a color-vision defect in all 128 dichromats. This indicates that the major component of color-vision deficiency could be attributed to alterations of the pigment genes or their regulatory regions in all cases, and the variety of gene arrangements associated with dichromacy is cataloged here. However, a fraction of the dichromats (17 out of 128; 13%) had genes predicted to encode pigments that would result in two populations of cones with different spectral sensitivities. Nine of the 17 were predicted to have two pigments with slightly different spectral peaks (usually≤2.5 nm) and eight had genes which specified pigments identical in peak absorption, but different in amino acid positions previously associated with optical density differences. In other subjects, reported previously, the same small spectral differences were associated with anomalous trichromacy rather than dichromacy. It appears that when the spectral difference specified by the genes is very small, the amount of residual red-green color vision measured varies; some individuals test as dichromats, others test as anomalous trichromats. The discrepancy is probably partly attributable to testing method differences and partly to a difference in performance not perception, but it seems there must also be cases in which other factors, for example, cone ratio, contribute to a person's ability to extract a color signal from a small spectral difference. PMID:15518190

  9. Evaluating the pathogenic potential of environmental Escherichia coli by using the Caenorhabditis elegans infection model.

    PubMed

    Merkx-Jacques, Alexandra; Coors, Anja; Brousseau, Roland; Masson, Luke; Mazza, Alberto; Tien, Yuan-Ching; Topp, Edward

    2013-04-01

    The detection and abundance of Escherichia coli in water is used to monitor and mandate the quality of drinking and recreational water. Distinguishing commensal waterborne E. coli isolates from those that cause diarrhea or extraintestinal disease in humans is important for quantifying human health risk. A DNA microarray was used to evaluate the distribution of virulence genes in 148 E. coli environmental isolates from a watershed in eastern Ontario, Canada, and in eight clinical isolates. Their pathogenic potential was evaluated with Caenorhabditis elegans, and the concordance between the bioassay result and the pathotype deduced by genotyping was explored. Isolates identified as potentially pathogenic on the basis of their complement of virulence genes were significantly more likely to be pathogenic to C. elegans than those determined to be potentially nonpathogenic. A number of isolates that were identified as nonpathogenic on the basis of genotyping were pathogenic in the infection assay, suggesting that genotyping did not capture all potentially pathogenic types. The detection of the adhesin-encoding genes sfaD, focA, and focG, which encode adhesins; of iroN2, which encodes a siderophore receptor; of pic, which encodes an autotransporter protein; and of b1432, which encodes a putative transposase, was significantly associated with pathogenicity in the infection assay. Overall, E. coli isolates predicted to be pathogenic on the basis of genotyping were indeed so in the C. elegans infection assay. Furthermore, the detection of C. elegans-infective environmental isolates predicted to be nonpathogenic on the basis of genotyping suggests that there are hitherto-unrecognized virulence factors or combinations thereof that are important in the establishment of infection.

  10. Massively Convergent Evolution for Ribosomal Protein Gene Content in Plastid and Mitochondrial Genomes

    PubMed Central

    Maier, Uwe-G; Zauner, Stefan; Woehle, Christian; Bolte, Kathrin; Hempel, Franziska; Allen, John F.; Martin, William F.

    2013-01-01

    Plastid and mitochondrial genomes have undergone parallel evolution to encode the same functional set of genes. These encode conserved protein components of the electron transport chain in their respective bioenergetic membranes and genes for the ribosomes that express them. This highly convergent aspect of organelle genome evolution is partly explained by the redox regulation hypothesis, which predicts a separate plastid or mitochondrial location for genes encoding bioenergetic membrane proteins of either photosynthesis or respiration. Here we show that convergence in organelle genome evolution is far stronger than previously recognized, because the same set of genes for ribosomal proteins is independently retained by both plastid and mitochondrial genomes. A hitherto unrecognized selective pressure retains genes for the same ribosomal proteins in both organelles. On the Escherichia coli ribosome assembly map, the retained proteins are implicated in 30S and 50S ribosomal subunit assembly and initial rRNA binding. We suggest that ribosomal assembly imposes functional constraints that govern the retention of ribosomal protein coding genes in organelles. These constraints are subordinate to redox regulation for electron transport chain components, which anchor the ribosome to the organelle genome in the first place. As organelle genomes undergo reduction, the rRNAs also become smaller. Below size thresholds of approximately 1,300 nucleotides (16S rRNA) and 2,100 nucleotides (26S rRNA), all ribosomal protein coding genes are lost from organelles, while electron transport chain components remain organelle encoded as long as the organelles use redox chemistry to generate a proton motive force. PMID:24259312

  11. Gene duplications are extensive and contribute significantly to the toxic proteome of nematocysts isolated from Acropora digitifera (Cnidaria: Anthozoa: Scleractinia).

    PubMed

    Gacesa, Ranko; Chung, Ray; Dunn, Simon R; Weston, Andrew J; Jaimes-Becerra, Adrian; Marques, Antonio C; Morandini, André C; Hranueli, Daslav; Starcevic, Antonio; Ward, Malcolm; Long, Paul F

    2015-10-13

    Gene duplication followed by adaptive selection is a well-accepted process leading to toxin diversification in venoms. However, emergent genomic, transcriptomic and proteomic evidence now challenges this role to be at best equivocal to other processess . Cnidaria are arguably the most ancient phylum of the extant metazoa that are venomous and such provide a definitive ancestral anchor to examine the evolution of this trait. Here we compare predicted toxins from the translated genome of the coral Acropora digitifera to putative toxins revealed by proteomic analysis of soluble proteins discharged from nematocysts, to determine the extent to which gene duplications contribute to venom innovation in this reef-building coral species. A new bioinformatics tool called HHCompare was developed to detect potential gene duplications in the genomic data, which is made freely available ( https://github.com/rgacesa/HHCompare ). A total of 55 potential toxin encoding genes could be predicted from the A. digitifera genome, of which 36 (65 %) had likely arisen by gene duplication as evinced using the HHCompare tool and verified using two standard phylogeny methods. Surprisingly, only 22 % (12/55) of the potential toxin repertoire could be detected following rigorous proteomic analysis, for which only half (6/12) of the toxin proteome could be accounted for as peptides encoded by the gene duplicates. Biological activities of these toxins are dominatedby putative phospholipases and toxic peptidases. Gene expansions in A. digitifera venom are the most extensive yet described in any venomous animal, and gene duplication plays a significant role leading to toxin diversification in this coral species. Since such low numbers of toxins were detected in the proteome, it is unlikely that the venom is evolving rapidly by prey-driven positive natural selection. Rather we contend that the venom has a defensive role deterring predation or harm from interspecific competition and overgrowth by fouling organisms. Factors influencing translation of toxin encoding genes perhaps warrants more profound experimental consideration.

  12. Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family

    PubMed Central

    Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H

    2005-01-01

    Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476

  13. Genomes to natural products PRediction Informatics for Secondary Metabolomes (PRISM)

    PubMed Central

    Skinnider, Michael A.; Dejong, Chris A.; Rees, Philip N.; Johnston, Chad W.; Li, Haoxin; Webster, Andrew L. H.; Wyatt, Morgan A.; Magarvey, Nathan A.

    2015-01-01

    Microbial natural products are an invaluable source of evolved bioactive small molecules and pharmaceutical agents. Next-generation and metagenomic sequencing indicates untapped genomic potential, yet high rediscovery rates of known metabolites increasingly frustrate conventional natural product screening programs. New methods to connect biosynthetic gene clusters to novel chemical scaffolds are therefore critical to enable the targeted discovery of genetically encoded natural products. Here, we present PRISM, a computational resource for the identification of biosynthetic gene clusters, prediction of genetically encoded nonribosomal peptides and type I and II polyketides, and bio- and cheminformatic dereplication of known natural products. PRISM implements novel algorithms which render it uniquely capable of predicting type II polyketides, deoxygenated sugars, and starter units, making it a comprehensive genome-guided chemical structure prediction engine. A library of 57 tailoring reactions is leveraged for combinatorial scaffold library generation when multiple potential substrates are consistent with biosynthetic logic. We compare the accuracy of PRISM to existing genomic analysis platforms. PRISM is an open-source, user-friendly web application available at http://magarveylab.ca/prism/. PMID:26442528

  14. Characterization and Heterologous Expression of the Genes Encoding Enterocin A Production, Immunity, and Regulation in Enterococcus faecium DPC1146

    PubMed Central

    O’Keeffe, Triona; Hill, Colin; Ross, R. Paul

    1999-01-01

    Enterocin A is a small, heat-stable, antilisterial bacteriocin produced by Enterococcus faecium DPC1146. The sequence of a 10,879-bp chromosomal region containing at least 12 open reading frames (ORFs), 7 of which are predicted to play a role in enterocin biosynthesis, is presented. The genes entA, entI, and entF encode the enterocin A prepeptide, the putative immunity protein, and the induction factor prepeptide, respectively. The deduced proteins EntK and EntR resemble the histidine kinase and response regulator proteins of two-component signal transducing systems of the AgrC-AgrA type. The predicted proteins EntT and EntD are homologous to ABC (ATP-binding cassette) transporters and accessory factors, respectively, of several other bacteriocin systems and to proteins implicated in the signal-sequence-independent export of Escherichia coli hemolysin A. Immediately downstream of the entT and entD genes are two ORFs, the product of one of which, ORF4, is very similar to the product of the yteI gene of Bacillus subtilis and to E. coli protease IV, a signal peptide peptidase known to be involved in outer membrane lipoprotein export. Another potential bacteriocin is encoded in the opposite direction to the other genes in the enterocin cluster. This putative bacteriocin-like peptide is similar to LafX, one of the components of the lactacin F complex. A deletion which included one of two direct repeats upstream of the entA gene abolished enterocin A activity, immunity, and ability to induce bacteriocin production. Transposon insertion upstream of the entF gene also had the same effect, but this mutant could be complemented by exogenously supplied induction factor. The putative EntI peptide was shown to be involved in the immunity to enterocin A. Cloning of a 10.5-kb amplicon comprising all predicted ORFs and regulatory regions resulted in heterologous production of enterocin A and induction factor in Enterococcus faecalis, while a four-gene construct (entAITD) under the control of a constitutive promoter resulted in heterologous enterocin A production in both E. faecalis and Lactococcus lactis. PMID:10103244

  15. The RpfCG two-component system negatively regulates the colonization of sugar cane stalks by Xanthomonas albilineans.

    PubMed

    Rott, Philippe; Fleites, Laura A; Mensi, Imène; Sheppard, Lauren; Daugrois, Jean-Heinrich; Dow, J Maxwell; Gabriel, Dean W

    2013-06-01

    The genome of Xanthomonas albilineans, the causal agent of sugar cane leaf scald, carries a gene cluster encoding a predicted quorum sensing system that is highly related to the diffusible signalling factor (DSF) systems of the plant pathogens Xylella fastidiosa and Xanthomonas campestris. In these latter pathogens, a cluster of regulation of pathogenicity factors (rpf) genes encodes the DSF system and is involved in control of various cellular processes. Mutation of Xanthomonas albilineans rpfF, encoding a predicted DSF synthase, in Florida strain XaFL07-1 resulted in a small reduction of disease severity (DS). Single-knockout mutations of rpfC and rpfG (encoding a predicted DSF sensor and regulator, respectively) had no effect on DS or swimming motility of the pathogen. However, capacity of the pathogen to cause disease was slightly reduced and swimming motility was severely affected when rpfG and rpfC were both deleted. Similar results were obtained when the entire rpfGCF region was deleted. Surprisingly, when the pathogen was mutated in rpfG or rpfC (single or double mutations) it was able to colonize sugar cane spatially more efficiently than the wild-type. Mutation in rpfF alone did not affect the degree of spatial invasion. We conclude that the DSF signal contributes to symptom expression but not to invasion of sugar cane stalks by Xanthomonas albilineans strain XaFL07-1, which is mainly controlled by the RpfCG two-component system.

  16. Pharmacogenetically driven treatments for alcoholism: are we there yet?

    PubMed

    Arias, Albert J; Sewell, R Andrew

    2012-06-01

    Pharmacogenetic analyses of treatments for alcohol dependence attempt to predict treatment response and side-effect risk for specific medications. We review the literature on pharmacogenetics relevant to alcohol dependence treatment, and describe state-of-the-art methods of pharmacogenetic research in this area. Two main pharmacogenetic study designs predominate: challenge studies and treatment-trial analyses. Medications studied include US FDA-approved naltrexone and acamprosate, both indicated for treating alcohol dependence, as well as several investigational (and off-label) treatments such as sertraline, olanzapine and ondansetron. The best-studied functional genetic variant relevant to alcoholism treatment is rs1799971, a single-nucleotide polymorphism in exon 1 of the OPRM1 gene that encodes the μ-opioid receptor. Evidence from clinical trials suggests that the presence of the variant G allele of rs1799971 may predict better treatment response to opioid receptor antagonists such as naltrexone. Evidence from clinical trials also suggests that several medications interact pharmacogenetically with variation in genes that encode proteins involved in dopaminergic and serotonergic neurotransmission. Variation in the DRD4 gene, which encodes the dopamine D(4) receptor, may predict better response to naltrexone and olanzapine. A polymorphism in the serotonin transporter gene SLC6A4 promoter region appears related to differential treatment response to sertraline depending on the subject's age of onset of alcoholism. Genetic variation in SLC6A4 may also be associated with better treatment response to ondansetron. Initial pharmacogenetic efforts in alcohol research have identified functional variants with potential clinical utility, but more research is needed to further elucidate the mechanism of these pharmacogenetic interactions and their moderators in order to translate them into clinical practice.

  17. The 2p21 deletion syndrome: characterization of the transcription content.

    PubMed

    Parvari, Ruti; Gonen, Yael; Alshafee, Ismael; Buriakovsky, Sophia; Regev, Kfir; Hershkovitz, Eli

    2005-08-01

    The vast majority of small-deletion syndromes are caused by haploinsufficiency of one or several genes and are transmitted as dominant traits. We have previously identified a homozygous deletion of 179,311 bp on chromosome 2p21 as the cause of a unique syndrome, inherited in a recessive mode, consisting of cystinuria, neonatal seizures, hypotonia, severe somatic and developmental delay, facial dysmorphism, and reduced activity of all the respiratory chain enzymatic complexes that are encoded in the mitochondria. We now present the transcription content of this region: Multiple splicing variants of the genes protein phosphatase 1B (formerly 2C) magnesium-dependent, beta isoform (PPM1B), SLC3A1, and KIAA0436 (approved gene symbol PREPL) were identified and their patterns of expression analyzed. The spliced variants are predicted to have additional functions compared to the known variants and their patterns of expression fit the tissues affected by the syndrome. The first exon of an additional gene (C2orf34) is encoded in the deleted region and the gene is not expressed in the patients. In addition several transcripts with very short open reading frames are also encoded in the deletion. The identification of all transcripts encoded in the region deleted in the patients is the first step in the study of the genotype-phenotype correlation of the 2p21 patients.

  18. The genome-wide identification and transcriptional levels of DNA methyltransferases and demethylases in globe artichoke.

    PubMed

    Gianoglio, Silvia; Moglia, Andrea; Acquadro, Alberto; Comino, Cinzia; Portis, Ezio

    2017-01-01

    Changes to the cytosine methylation status of DNA, driven by the activity of C5 methyltransferases (C5-MTases) and demethylases, exert an important influence over development, transposon movement, gene expression and imprinting. Three groups of C5-MTase enzymes have been identified in plants, namely MET (methyltransferase 1), CMT (chromomethyltransferases) and DRM (domains rearranged methyltransferases). Here the repertoire of genes encoding C5-MTase and demethylase by the globe artichoke (Cynara cardunculus var. scolymus) is described, based on sequence homology, a phylogenetic analysis and a characterization of their functional domains. A total of ten genes encoding C5-MTase (one MET, five CMTs and four DRMs) and five demethylases was identified. An analysis of their predicted product's protein structure suggested an extensive level of conservation has been retained by the C5-MTases. Transcriptional profiling based on quantitative real time PCR revealed a number of differences between the genes encoding maintenance and de novo methyltransferases, sometimes in a tissue- or development-dependent manner, which implied a degree of functional specialization.

  19. FUBT, a putative MFS transporter, promotes secretion of fusaric acid in the cotton pathogen Fusarium oxysporum f.sp. vasinfectum

    USDA-ARS?s Scientific Manuscript database

    Fusaric acid (FA), a phytotoxic polyketide produced by Fusarium oxysporum f. sp. vasinfectum (FOV), has been shown to be associated with disease symptoms on cotton. A gene located upstream of the polyketide synthase gene responsible for the biosynthesis of FA is predicted to encode a member of the ...

  20. Genes regulated by AoXlnR, the xylanolytic and cellulolytic transcriptional regulator, in Aspergillus oryzae.

    PubMed

    Noguchi, Yuji; Sano, Motoaki; Kanamaru, Kyoko; Ko, Taro; Takeuchi, Michio; Kato, Masashi; Kobayashi, Tetsuo

    2009-11-01

    XlnR is a Zn(II)2Cys6 transcriptional activator of xylanolytic and cellulolytic genes in Aspergillus. Overexpression of the aoxlnR gene in Aspergillus oryzae (A. oryzae xlnR gene) resulted in elevated xylanolytic and cellulolytic activities in the culture supernatant, in which nearly 40 secreted proteins were detected by two-dimensional electrophoresis. DNA microarray analysis to identify the transcriptional targets of AoXlnR led to the identification of 75 genes that showed more than fivefold increase in their expression in the AoXlnR overproducer than in the disruptant. Of these, 32 genes were predicted to encode a glycoside hydrolase, highlighting the biotechnological importance of AoXlnR in biomass degradation. The 75 genes included the genes previously identified as AoXlnR targets (xynF1, xynF3, xynG2, xylA, celA, celB, celC, and celD). Thirty-six genes were predicted to be extracellular, which was consistent with the number of proteins secreted, and 61 genes possessed putative XlnR-binding sites (5'-GGCTAA-3', 5'-GGCTAG-3', and 5'-GGCTGA-3') in their promoter regions. Functional annotation of the genes revealed that AoXlnR regulated the expression of hydrolytic genes for degradation of beta-1,4-xylan, arabinoxylan, cellulose, and xyloglucan and of catabolic genes for the conversion of D-xylose to xylulose-5-phosphate. In addition, genes encoding glucose-6-phosphate 1-dehydrogenase and L-arabinitol-4- dehydrogenase involved in D-glucose and L-arabinose catabolism also appeared to be targets of AoXlnR.

  1. Organization of the Escherichia coli K-12 gene cluster responsible for production of the extracellular polysaccharide colanic acid.

    PubMed Central

    Stevenson, G; Andrianopoulos, K; Hobbs, M; Reeves, P R

    1996-01-01

    Colanic acid (CA) is an extracellular polysaccharide produced by most Escherichia coli strains as well as by other species of the family Enterobacteriaceae. We have determined the sequence of a 23-kb segment of the E. coli K-12 chromosome which includes the cluster of genes necessary for production of CA. The CA cluster comprises 19 genes. Two other sequenced genes (orf1.3 and galF), which are situated between the CA cluster and the O-antigen cluster, were shown to be unnecessary for CA production. The CA cluster includes genes for synthesis of GDP-L-fucose, one of the precursors of CA, and the gene for one of the enzymes in this pathway (GDP-D-mannose 4,6-dehydratase) was identified by biochemical assay. Six of the inferred proteins show sequence similarity to glycosyl transferases, and two others have sequence similarity to acetyl transferases. Another gene (wzx) is predicted to encode a protein with multiple transmembrane segments and may function in export of the CA repeat unit from the cytoplasm into the periplasm in a process analogous to O-unit export. The first three genes of the cluster are predicted to encode an outer membrane lipoprotein, a phosphatase, and an inner membrane protein with an ATP-binding domain. Since homologs of these genes are found in other extracellular polysaccharide gene clusters, they may have a common function, such as export of polysaccharide from the cell. PMID:8759852

  2. Gene Expression Changes in Phosphorus Deficient Potato (Solanum tuberosum L.) Leaves and the Potential for Diagnostic Gene Expression Markers

    PubMed Central

    Hammond, John P.; Broadley, Martin R.; Bowen, Helen C.; Spracklen, William P.; Hayden, Rory M.; White, Philip J.

    2011-01-01

    Background There are compelling economic and environmental reasons to reduce our reliance on inorganic phosphate (Pi) fertilisers. Better management of Pi fertiliser applications is one option to improve the efficiency of Pi fertiliser use, whilst maintaining crop yields. Application rates of Pi fertilisers are traditionally determined from analyses of soil or plant tissues. Alternatively, diagnostic genes with altered expression under Pi limiting conditions that suggest a physiological requirement for Pi fertilisation, could be used to manage Pifertiliser applications, and might be more precise than indirect measurements of soil or tissue samples. Results We grew potato (Solanum tuberosum L.) plants hydroponically, under glasshouse conditions, to control their nutrient status accurately. Samples of total leaf RNA taken periodically after Pi was removed from the nutrient solution were labelled and hybridised to potato oligonucleotide arrays. A total of 1,659 genes were significantly differentially expressed following Pi withdrawal. These included genes that encode proteins involved in lipid, protein, and carbohydrate metabolism, characteristic of Pi deficient leaves and included potential novel roles for genes encoding patatin like proteins in potatoes. The array data were analysed using a support vector machine algorithm to identify groups of genes that could predict the Pi status of the crop. These groups of diagnostic genes were tested using field grown potatoes that had either been fertilised or unfertilised. A group of 200 genes could correctly predict the Pi status of field grown potatoes. Conclusions This paper provides a proof-of-concept demonstration for using microarrays and class prediction tools to predict the Pi status of a field grown potato crop. There is potential to develop this technology for other biotic and abiotic stresses in field grown crops. Ultimately, a better understanding of crop stresses may improve our management of the crop, improving the sustainability of agriculture. PMID:21935429

  3. The VPH1 gene encodes a 95-kDa integral membrane polypeptide required for in vivo assembly and activity of the yeast vacuolar H(+)-ATPase.

    PubMed

    Manolson, M F; Proteau, D; Preston, R A; Stenbit, A; Roberts, B T; Hoyt, M A; Preuss, D; Mulholland, J; Botstein, D; Jones, E W

    1992-07-15

    Yeast vacuolar acidification-defective (vph) mutants were identified using the pH-sensitive fluorescence of 6-carboxyfluorescein diacetate (Preston, R. A., Murphy, R. F., and Jones, E. W. (1989) Proc. Natl. Acad. Sci. U.S.A. 86, 7027-7031). Vacuoles purified from yeast bearing the vph1-1 mutation had no detectable bafilomycin-sensitive ATPase activity or ATP-dependent proton pumping. The peripherally bound nucleotide-binding subunits of the vacuolar H(+)-ATPase (60 and 69 kDa) were no longer associated with vacuolar membranes yet were present in wild type levels in yeast whole cell extracts. The VPH1 gene was cloned by complementation of the vph1-1 mutation and independently cloned by screening a lambda gt11 expression library with antibodies directed against a 95-kDa vacuolar integral membrane protein. Deletion disruption of the VPH1 gene revealed that the VPH1 gene is not essential for viability but is required for vacuolar H(+)-ATPase assembly and vacuolar acidification. VPH1 encodes a predicted polypeptide of 840 amino acid residues (molecular mass 95.6 kDa) and contains six putative membrane-spanning regions. Cell fractionation and immunodetection demonstrate that Vph1p is a vacuolar integral membrane protein that co-purifies with vacuolar H(+)-ATPase activity. Multiple sequence alignments show extensive homology over the entire lengths of the following four polypeptides: Vph1p, the 116-kDa polypeptide of the rat clathrin-coated vesicles/synaptic vesicle proton pump, the predicted polypeptide encoded by the yeast gene STV1 (Similar To VPH1, identified as an open reading frame next to the BUB2 gene), and the TJ6 mouse immune suppressor factor.

  4. Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

    PubMed

    Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

    2018-02-23

    Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.

  5. Cloning and sequencing of a gene encoding a 21-kilodalton outer membrane protein from Bordetella avium and expression of the gene in Salmonella typhimurium.

    PubMed Central

    Gentry-Weeks, C R; Hultsch, A L; Kelly, S M; Keith, J M; Curtiss, R

    1992-01-01

    Three gene libraries of Bordetella avium 197 DNA were prepared in Escherichia coli LE392 by using the cosmid vectors pCP13 and pYA2329, a derivative of pCP13 specifying spectinomycin resistance. The cosmid libraries were screened with convalescent-phase anti-B. avium turkey sera and polyclonal rabbit antisera against B. avium 197 outer membrane proteins. One E. coli recombinant clone produced a 56-kDa protein which reacted with convalescent-phase serum from a turkey infected with B. avium 197. In addition, five E. coli recombinant clones were identified which produced B. avium outer membrane proteins with molecular masses of 21, 38, 40, 43, and 48 kDa. At least one of these E. coli clones, which encoded the 21-kDa protein, reacted with both convalescent-phase turkey sera and antibody against B. avium 197 outer membrane proteins. The gene for the 21-kDa outer membrane protein was localized by Tn5seq1 mutagenesis, and the nucleotide sequence was determined by dideoxy sequencing. DNA sequence analysis of the 21-kDa protein revealed an open reading frame of 582 bases that resulted in a predicted protein of 194 amino acids. Comparison of the predicted amino acid sequence of the gene encoding the 21-kDa outer membrane protein with protein sequences in the National Biomedical Research Foundation protein sequence data base indicated significant homology to the OmpA proteins of Shigella dysenteriae, Enterobacter aerogenes, E. coli, and Salmonella typhimurium and to Neisseria gonorrhoeae outer membrane protein III, Haemophilus influenzae protein P6, and Pseudomonas aeruginosa porin protein F. The gene (ompA) encoding the B. avium 21-kDa protein hybridized with 4.1-kb DNA fragments from EcoRI-digested, chromosomal DNA of Bordetella pertussis and Bordetella bronchiseptica and with 6.0- and 3.2-kb DNA fragments from EcoRI-digested, chromosomal DNA of B. avium and B. avium-like DNA, respectively. A 6.75-kb DNA fragment encoding the B. avium 21-kDa protein was subcloned into the Asd+ vector pYA292, and the construct was introduced into the avirulent delta cya delta crp delta asd S. typhimurium chi 3987 for oral immunization of birds. The gene encoding the 21-kDa protein was expressed equivalently in B. avium 197, delta asd E. coli chi 6097, and S. typhimurium chi 3987 and was localized primarily in the cytoplasmic membrane and outer membrane. In preliminary studies on oral inoculation of turkey poults with S. typhimurium chi 3987 expressing the gene encoding the B. avium 21-kDa protein, it was determined that a single dose of the recombinant Salmonella vaccine failed to elicit serum antibodies against the 21-kDa protein and challenge with wild-type B. avium 197 resulted in colonization of the trachea and thymus with B. avium 197. Images PMID:1447140

  6. Satellite remote sensing data can be used to model marine microbial metabolite turnover

    PubMed Central

    Larsen, Peter E; Scott, Nicole; Post, Anton F; Field, Dawn; Knight, Rob; Hamada, Yuki; Gilbert, Jack A

    2015-01-01

    Sampling ecosystems, even at a local scale, at the temporal and spatial resolution necessary to capture natural variability in microbial communities are prohibitively expensive. We extrapolated marine surface microbial community structure and metabolic potential from 72 16S rRNA amplicon and 8 metagenomic observations using remotely sensed environmental parameters to create a system-scale model of marine microbial metabolism for 5904 grid cells (49 km2) in the Western English Chanel, across 3 years of weekly averages. Thirteen environmental variables predicted the relative abundance of 24 bacterial Orders and 1715 unique enzyme-encoding genes that encode turnover of 2893 metabolites. The genes' predicted relative abundance was highly correlated (Pearson Correlation 0.72, P-value <10−6) with their observed relative abundance in sequenced metagenomes. Predictions of the relative turnover (synthesis or consumption) of CO2 were significantly correlated with observed surface CO2 fugacity. The spatial and temporal variation in the predicted relative abundances of genes coding for cyanase, carbon monoxide and malate dehydrogenase were investigated along with the predicted inter-annual variation in relative consumption or production of ∼3000 metabolites forming six significant temporal clusters. These spatiotemporal distributions could possibly be explained by the co-occurrence of anaerobic and aerobic metabolisms associated with localized plankton blooms or sediment resuspension, which facilitate the presence of anaerobic micro-niches. This predictive model provides a general framework for focusing future sampling and experimental design to relate biogeochemical turnover to microbial ecology. PMID:25072414

  7. Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

    PubMed

    Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2004-02-01

    To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.

  8. Expansion and diversification of the MSDIN family of cyclic peptide genes in the poisonous agarics Amanita phalloides and A. bisporigera

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pulman, Jane A.; Childs, Kevin L.; Sgambelluri, R. Michael

    Here, the cyclic peptide toxins of Amanita mushrooms, such as α-amanitin and phalloidin, are encoded by the “MSDIN” gene family and ribosomally biosynthesized. Based on partial genome sequence and PCR analysis, some members of the MSDIN family were previously identified in Amanita bisporigera, and several other members are known from other species of Amanita. However, the complete complement in any one species, and hence the genetic capacity for these fungi to make cyclic peptides, remains unknown. As a result, draft genome sequences of two cyclic peptide-producing mushrooms, the “Death Cap” A. phalloides and the “Destroying Angel” A. bisporigera, were obtained.more » Each species has ~30 MSDIN genes, most of which are predicted to encode unknown cyclic peptides. Some MSDIN genes were duplicated in one or the other species, but only three were common to both species. A gene encoding cycloamanide B, a previously described nontoxic cyclic heptapeptide, was also present in A. phalloides, but genes for antamanide and cycloamanides A, C, and D were not. In A. bisporigera, RNA expression was observed for 20 of the MSDIN family members. Based on their predicted sequences, novel cyclic peptides were searched for by LC/MS/MS in extracts of A. phalloides. The presence of two cyclic peptides, named cycloamanides E and F with structures cyclo(SFFFPVP) and cyclo(IVGILGLP), was thereby demonstrated. Of the MSDIN genes reported earlier from another specimen of A. bisporigera, 9 of 14 were not found in the current genome assembly. Differences between previous and current results for the complement of MSDIN genes and cyclic peptides in the two fungi probably represents natural variation among geographically dispersed isolates of A. phalloides and among the members of the poorly defined A. bisporigera species complex. Both A. phalloides and A. bisporigera contain two prolyl oligopeptidase genes, one of which (POPB) is probably dedicated to cyclic peptide biosynthesis as it is in Galerina marginata. Finally, the MSDIN gene family has expanded and diverged rapidly in Amanita section Phalloideae. Together, A. bisporigera and A. phalloides are predicted to have the capacity to make more than 50 cyclic hexa-, hepta-,octa-, nona- and decapeptides.« less

  9. Expansion and diversification of the MSDIN family of cyclic peptide genes in the poisonous agarics Amanita phalloides and A. bisporigera

    DOE PAGES

    Pulman, Jane A.; Childs, Kevin L.; Sgambelluri, R. Michael; ...

    2016-12-15

    Here, the cyclic peptide toxins of Amanita mushrooms, such as α-amanitin and phalloidin, are encoded by the “MSDIN” gene family and ribosomally biosynthesized. Based on partial genome sequence and PCR analysis, some members of the MSDIN family were previously identified in Amanita bisporigera, and several other members are known from other species of Amanita. However, the complete complement in any one species, and hence the genetic capacity for these fungi to make cyclic peptides, remains unknown. As a result, draft genome sequences of two cyclic peptide-producing mushrooms, the “Death Cap” A. phalloides and the “Destroying Angel” A. bisporigera, were obtained.more » Each species has ~30 MSDIN genes, most of which are predicted to encode unknown cyclic peptides. Some MSDIN genes were duplicated in one or the other species, but only three were common to both species. A gene encoding cycloamanide B, a previously described nontoxic cyclic heptapeptide, was also present in A. phalloides, but genes for antamanide and cycloamanides A, C, and D were not. In A. bisporigera, RNA expression was observed for 20 of the MSDIN family members. Based on their predicted sequences, novel cyclic peptides were searched for by LC/MS/MS in extracts of A. phalloides. The presence of two cyclic peptides, named cycloamanides E and F with structures cyclo(SFFFPVP) and cyclo(IVGILGLP), was thereby demonstrated. Of the MSDIN genes reported earlier from another specimen of A. bisporigera, 9 of 14 were not found in the current genome assembly. Differences between previous and current results for the complement of MSDIN genes and cyclic peptides in the two fungi probably represents natural variation among geographically dispersed isolates of A. phalloides and among the members of the poorly defined A. bisporigera species complex. Both A. phalloides and A. bisporigera contain two prolyl oligopeptidase genes, one of which (POPB) is probably dedicated to cyclic peptide biosynthesis as it is in Galerina marginata. Finally, the MSDIN gene family has expanded and diverged rapidly in Amanita section Phalloideae. Together, A. bisporigera and A. phalloides are predicted to have the capacity to make more than 50 cyclic hexa-, hepta-,octa-, nona- and decapeptides.« less

  10. Screening of Metagenomic and Genomic Libraries Reveals Three Classes of Bacterial Enzymes That Overcome the Toxicity of Acrylate

    PubMed Central

    Curson, Andrew R. J.; Burns, Oliver J.; Voget, Sonja; Daniel, Rolf; Todd, Jonathan D.; McInnis, Kathryn; Wexler, Margaret; Johnston, Andrew W. B.

    2014-01-01

    Acrylate is produced in significant quantities through the microbial cleavage of the highly abundant marine osmoprotectant dimethylsulfoniopropionate, an important process in the marine sulfur cycle. Acrylate can inhibit bacterial growth, likely through its conversion to the highly toxic molecule acrylyl-CoA. Previous work identified an acrylyl-CoA reductase, encoded by the gene acuI, as being important for conferring on bacteria the ability to grow in the presence of acrylate. However, some bacteria lack acuI, and, conversely, many bacteria that may not encounter acrylate in their regular environments do contain this gene. We therefore sought to identify new genes that might confer tolerance to acrylate. To do this, we used functional screening of metagenomic and genomic libraries to identify novel genes that corrected an E. coli mutant that was defective in acuI, and was therefore hyper-sensitive to acrylate. The metagenomic libraries yielded two types of genes that overcame this toxicity. The majority encoded enzymes resembling AcuI, but with significant sequence divergence among each other and previously ratified AcuI enzymes. One other metagenomic gene, arkA, had very close relatives in Bacillus and related bacteria, and is predicted to encode an enoyl-acyl carrier protein reductase, in the same family as FabK, which catalyses the final step in fatty-acid biosynthesis in some pathogenic Firmicute bacteria. A genomic library of Novosphingobium, a metabolically versatile alphaproteobacterium that lacks both acuI and arkA, yielded vutD and vutE, two genes that, together, conferred acrylate resistance. These encode sequential steps in the oxidative catabolism of valine in a pathway in which, significantly, methacrylyl-CoA is a toxic intermediate. These findings expand the range of bacteria for which the acuI gene encodes a functional acrylyl-CoA reductase, and also identify novel enzymes that can similarly function in conferring acrylate resistance, likely, again, through the removal of the toxic product acrylyl-CoA. PMID:24848004

  11. Rice Ribosomal Protein Large Subunit Genes and Their Spatio-temporal and Stress Regulation

    PubMed Central

    Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Dutta, Mouboni; Madhav, Sheshu M.; Kirti, P. B.

    2016-01-01

    Ribosomal proteins (RPs) are well-known for their role in mediating protein synthesis and maintaining the stability of the ribosomal complex, which includes small and large subunits. In the present investigation, in a genome-wide survey, we predicted that the large subunit of rice ribosomes is encoded by at least 123 genes including individual gene copies, distributed throughout the 12 chromosomes. We selected 34 candidate genes, each having 2–3 identical copies, for a detailed characterization of their gene structures, protein properties, cis-regulatory elements and comprehensive expression analysis. RPL proteins appear to be involved in interactions with other RP and non-RP proteins and their encoded RNAs have a higher content of alpha-helices in their predicted secondary structures. The majority of RPs have binding sites for metal and non-metal ligands. Native expression profiling of 34 ribosomal protein large (RPL) subunit genes in tissues covering the major stages of rice growth shows that they are predominantly expressed in vegetative tissues and seedlings followed by meiotically active tissues like flowers. The putative promoter regions of these genes also carry cis-elements that respond specifically to stress and signaling molecules. All the 34 genes responded differentially to the abiotic stress treatments. Phytohormone and cold treatments induced significant up-regulation of several RPL genes, while heat and H2O2 treatments down-regulated a majority of them. Furthermore, infection with a bacterial pathogen, Xanthomonas oryzae, which causes leaf blight also induced the expression of 80% of the RPL genes in leaves. Although the expression of RPL genes was detected in all the tissues studied, they are highly responsive to stress and signaling molecules indicating that their encoded proteins appear to have roles in stress amelioration besides house-keeping. This shows that the RPL gene family is a valuable resource for manipulation of stress tolerance in rice and other crops, which may be achieved by overexpressing and raising independent transgenic plants carrying the genes that became up-regulated significantly and instantaneously. PMID:27605933

  12. Developmental Regulation of Genes Encoding Universal Stress Proteins in Schistosoma mansoni

    PubMed Central

    Isokpehi, Raphael D.; Mahmud, Ousman; Mbah, Andreas N.; Simmons, Shaneka S.; Avelar, Lívia; Rajnarayanan, Rajendram V.; Udensi, Udensi K.; Ayensu, Wellington K.; Cohly, Hari H.; Brown, Shyretha D.; Dates, Centdrika R.; Hentz, Sonya D.; Hughes, Shawntae J.; Smith-McInnis, Dominique R.; Patterson, Carvey O.; Sims, Jennifer N.; Turner, Kelisha T.; Williams, Baraka S.; Johnson, Matilda O.; Adubi, Taiwo; Mbuh, Judith V.; Anumudu, Chiaka I.; Adeoye, Grace O.; Thomas, Bolaji N.; Nashiru, Oyekanmi; Oliveira, Guilherme

    2011-01-01

    The draft nuclear genome sequence of the snail-transmitted, dimorphic, parasitic, platyhelminth Schistosoma mansoni revealed eight genes encoding proteins that contain the Universal Stress Protein (USP) domain. Schistosoma mansoni is a causative agent of human schistosomiasis, a severe and debilitating Neglected Tropical Disease (NTD) of poverty, which is endemic in at least 76 countries. The availability of the genome sequences of Schistosoma species presents opportunities for bioinformatics and genomics analyses of associated gene families that could be targets for understanding schistosomiasis ecology, intervention, prevention and control. Proteins with the USP domain are known to provide bacteria, archaea, fungi, protists and plants with the ability to respond to diverse environmental stresses. In this research investigation, the functional annotations of the USP genes and predicted nucleotide and protein sequences were initially verified. Subsequently, sequence clusters and distinctive features of the sequences were determined. A total of twelve ligand binding sites were predicted based on alignment to the ATP-binding universal stress protein from Methanocaldococcus jannaschii. In addition, six USP sequences showed the presence of ATP-binding motif residues indicating that they may be regulated by ATP. Public domain gene expression data and RT-PCR assays confirmed that all the S. mansoni USP genes were transcribed in at least one of the developmental life cycle stages of the helminth. Six of these genes were up-regulated in the miracidium, a free-swimming stage that is critical for transmission to the snail intermediate host. It is possible that during the intra-snail stages, S. mansoni gene transcripts for universal stress proteins are low abundant and are induced to perform specialized functions triggered by environmental stressors such as oxidative stress due to hydrogen peroxide that is present in the snail hemocytes. This report serves to catalyze the formation of a network of researchers to understand the function and regulation of the universal stress proteins encoded in genomes of schistosomes and their snail intermediate hosts. PMID:22084571

  13. Transposon Insertions of magellan-4 That Impair Social Gliding Motility in Myxococcus xanthus

    PubMed Central

    Youderian, Philip; Hartzell, Patricia L.

    2006-01-01

    Myxococcus xanthus has two different mechanisms of motility, adventurous (A) motility, which permits individual cells to glide over solid surfaces, and social (S) motility, which permits groups of cells to glide. To identify the genes involved in S-gliding motility, we mutagenized a ΔaglU (A−) strain with the defective transposon, magellan-4, and screened for S− mutants that form nonmotile colonies. Sequence analysis of the sites of the magellan-4 insertions in these mutants and the alignment of these sites with the M. xanthus genome sequence show that two-thirds of these insertions lie within 27 of the 37 nonessential genes known to be required for social motility, including those necessary for the biogenesis of type IV pili, exopolysaccharide, and lipopolysaccharide. The remaining insertions also identify 31 new, nonessential genes predicted to encode both structural and regulatory determinants of S motility. These include three tetratricopeptide repeat proteins, several regulators of transcription that may control the expression of genes involved in pilus extension and retraction, and additional enzymes involved in polysaccharide metabolism. Three insertions that abolish S motility lie within genes predicted to encode glycolytic enzymes, suggesting that the signal for pilus retraction may be a simple product of exopolysaccharide catabolism. PMID:16299386

  14. Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

    PubMed

    Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

    1998-10-20

    Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.

  15. Regulation of Oil Biosynthesis in Algae

    DTIC Science & Technology

    2008-06-25

    for future engineering purposes 3. Biochemical analysis of diacylglycerol acyltransferases ( DGATs ). These are key enzymes of oil biosynthesis...catalyzing the assembly of triacylglycerol in many organisms. 5 Genes predicted to encode DGATs and their role in triacylglycerol biosynthesis were identified

  16. MicroRNA biogenesis and function in plants.

    PubMed

    Chen, Xuemei

    2005-10-31

    A microRNA (miRNA) is a 21-24 nucleotide RNA product of a non-protein-coding gene. Plants, like animals, have a large number of miRNA-encoding genes in their genomes. The biogenesis of miRNAs in Arabidopsis is similar to that in animals in that miRNAs are processed from primary precursors by at least two steps mediated by RNAse III-like enzymes and that the miRNAs are incorporated into a protein complex named RISC. However, the biogenesis of plant miRNAs consists of an additional step, i.e., the miRNAs are methylated on the ribose of the last nucleotide by the miRNA methyltransferase HEN1. The high degree of sequence complementarity between plant miRNAs and their target mRNAs has facilitated the bioinformatic prediction of miRNA targets, many of which have been subsequently validated. Plant miRNAs have been predicted or confirmed to regulate a variety of processes, such as development, metabolism, and stress responses. A large category of miRNA targets consists of genes encoding transcription factors that play important roles in patterning the plant form.

  17. Genomic analysis of the type VI secretion systems in Pseudomonas spp.: novel clusters and putative effectors uncovered.

    PubMed

    Barret, Matthieu; Egan, Frank; Fargier, Emilie; Morrissey, John P; O'Gara, Fergal

    2011-06-01

    Bacteria encode multiple protein secretion systems that are crucial for interaction with the environment and with hosts. In recent years, attention has focused on type VI secretion systems (T6SSs), which are specialized transporters widely encoded in Proteobacteria. The myriad of processes associated with these secretion systems could be explained by subclasses of T6SS, each involved in specialized functions. To assess diversity and predict function associated with different T6SSs, comparative genomic analysis of 34 Pseudomonas genomes was performed. This identified 70 T6SSs, with at least one locus in every strain, except for Pseudomonas stutzeri A1501. By comparing 11 core genes of the T6SS, it was possible to identify five main Pseudomonas phylogenetic clusters, with strains typically carrying T6SSs from more than one clade. In addition, most strains encode additional vgrG and hcp genes, which encode extracellular structural components of the secretion apparatus. Using a combination of phylogenetic and meta-analysis of transcriptome datasets it was possible to associate specific subsets of VgrG and Hcp proteins with each Pseudomonas T6SS clade. Moreover, a closer examination of the genomic context of vgrG genes in multiple strains highlights a number of additional genes associated with these regions. It is proposed that these genes may play a role in secretion or alternatively could be new T6S effectors.

  18. Whole exome sequencing with genomic triangulation implicates CDH2-encoded N-cadherin as a novel pathogenic substrate for arrhythmogenic cardiomyopathy.

    PubMed

    Turkowski, Kari L; Tester, David J; Bos, J Martijn; Haugaa, Kristina H; Ackerman, Michael J

    2017-03-01

    Arrhythmogenic cardiomyopathy (ACM) is a heritable disease characterized by fibrofatty replacement of cardiomyocytes, has a prevalence of approximately 1 in 5000 individuals, and accounts for approximately 20% of sudden cardiac death in the young (≤35 years). ACM is most often inherited as an autosomal dominant trait with incomplete penetrance and variable expression. While mutations in several genes that encode key desmosomal proteins underlie about half of all ACM, the remainder is elusive genetically. Here, whole exome sequencing (WES) was performed with genomic triangulation in an effort to identify a novel explanation for a phenotype-positive, genotype-negative multi-generational pedigree with a presumed autosomal dominant, maternal inheritance of ACM. WES and genomic triangulation was performed on a symptomatic 14-year-old female proband, her affected mother and affected sister, and her unaffected father to elucidate a novel ACM-susceptibility gene for this pedigree. Following variant filtering using Ingenuity® Variant Analysis, gene priority ranking was performed on the candidate genes using ToppGene and Endeavour. The phylogenetic and physiochemical properties of candidate mutations were assessed further by 6 in silico prediction tools. Species alignment and amino acid conservation analysis was performed using the Uniprot Consortium. Tissue expression data was abstracted from Expression Atlas. Following WES and genomic triangulation, CDH2 emerged as a novel, autosomal dominant, ACM-susceptibility gene. The CDH2-encoded N-cadherin is a cell-cell adhesion protein predominately expressed in the heart. Cardiac dysfunction has been demonstrated in prior CDH2 knockout and over-expression animal studies. Further in silico mutation prediction, species conservation, and protein expression analysis supported the ultra-rare (minor allele frequency <0.005%) p.Asp407Asn-CDH2 variant as a likely pathogenic variant. Herein, it is demonstrated that genetic mutations in CDH2-encoded N-cadherin may represent a novel pathogenetic basis for ACM in humans. The prevalence of CDH2-mediated ACM in heretofore genetically elusive ACM remains to be determined. © 2017 Wiley Periodicals, Inc.

  19. Experimental verification of a predicted novel microRNA located in human PIK3CA gene with a potential oncogenic function in colorectal cancer.

    PubMed

    Saleh, Ali Jason; Soltani, Bahram M; Dokanehiifard, Sadat; Medlej, Abdallah; Tavalaei, Mahmoud; Mowla, Seyed Javad

    2016-10-01

    PI3K/AKT signaling is involved in cell survival, proliferation, and migration. In this pathway, PI3Kα enzyme is composed of a regulatory protein encoded by p85 gene and a catalytic protein encoded by PIK3CA gene. Human PIK3CA locus is amplified in several cancers including lung and colorectal cancer (CRC). Therefore, microRNAs (miRNAs) that are encoded within the PIK3CA gene might have a role in cancer development. Here, we report a novel microRNA named PIK3CA-miR1 (EBI accession no. LN626315), which is located within PIK3CA gene. A DNA segment corresponding to PIK3CA-premir1 sequence was transfected in human cell lines that resulted in generation of mature exogenous PIK3CA-miR1. Following the overexpression of PIK3CA-miR1, its predicted target genes (APPL1 and TrkC) were significantly downregulated in the CRC-originated HCT116 and SW480 cell lines, detected by qRT-PCR. Then, dual luciferase assay supported the interaction of PIK3CA-miR1 with APPL1 and TrkC transcripts. Endogenous PIK3CA-miR1 expression was also detected in several cell lines (highly in HCT116 and SW480) and highly in CRC specimens. Consistently, overexpression of PIK3CA-premir1 in HCT116 and SW480 cells resulted in significant reduction of the sub-G1 cell distribution and apoptotic cell rate, as detected by flowcytometry, and resulted in increased cell proliferation, as detected by 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide (MTT) assay. PIK3CA-miR1 overexpression also resulted in Wnt signaling upregulation detected by Top/Fop assay. Overall, accumulative evidences indicated the presence of a bona fide novel onco-miRNA encoded within the PIK3CA oncogene, which is highly expressed in colorectal cancer and has a survival effect in CRC-originated cells.

  20. The Dynein Gene Family in Chlamydomonas Reinhardtii

    PubMed Central

    Porter, M. E.; Knott, J. A.; Myster, S. H.; Farlow, S. J.

    1996-01-01

    To correlate dynein heavy chain (Dhc) genes with flagellar mutations and gain insight into the function of specific dynein isoforms, we placed eight members of the Dhc gene family on the genetic map of Chlamydomonas. Using a PCR-based strategy, we cloned 11 Dhc genes from Chlamydomonas. Comparisons with other Dhc genes indicate that two clones correspond to genes encoding the alpha and beta heavy chains of the outer dynein arm. Alignment of the predicted amino acid sequences spanning the nucleotide binding site indicates that the remaining nine clones can be subdivided into three groups that are likely to include representatives of the inner-arm Dhc isoforms. Gene-specific probes reveal that each clone represents a single-copy gene that is expressed as a transcript of the appropriate size (>13 kb) sufficient to encode a high molecular weight Dhc polypeptide. The expression of all nine genes is upregulated in response to deflagellation, suggesting a role in axoneme assembly or motility. Restriction fragment length polymorphisms between divergent C. reinhardtii strains have been used to place each Dhc gene on the genetic map of Chlamydomonas. These studies lay the groundwork for correlating defects in different Dhc genes with specific flagellar mutations. PMID:8889521

  1. Chimeric Amino Acid Rearrangements as Immune Targets in Prostate Cancer

    DTIC Science & Technology

    2016-05-01

    plot showing gene fusions between exon boundaries Figure 3. Lum (PC141070) A B Figure 4. Recurrent fusion genes present in the TCGA intermediate and...class I restricted epitopes in 6 out of 50 patient tumors. One recurrent gene fusion encoded by the TMPRSS2:ERG type VI fusion was detected in 3...found to have high-affinity (IEDB score អ nM) MHC class I predicted epitopes. Recurrent fusions In a comparative analysis across the patient

  2. Molecular and Mutational Analysis of a Gelsolin-Family Member Encoded by the Flightless I Gene of Drosophila Melanogaster

    PubMed Central

    de-Couet, H. G.; Fong, KSK.; Weeds, A. G.; McLaughlin, P. J.; Miklos, GLG.

    1995-01-01

    The flightless locus of Drosophila melanogaster has been analyzed at the genetic, molecular, ultrastructural and comparative crystallographic levels. The gene encodes a single transcript encoding a protein consisting of a leucine-rich amino terminal half and a carboxyterminal half with high sequence similarity to gelsolin. We determined the genomic sequence of the flightless landscape, the breakpoints of four chromosomal rearrangements, and the molecular lesions in two lethal and two viable alleles of the gene. The two alleles that lead to flight muscle abnormalities encode mutant proteins exhibiting amino acid replacements within the S1-like domain of their gelsolin-like region. Furthermore, the deduced intronexon structure of the D. melanogaster gene has been compared with that of the Caenorhabditis elegans homologue. Furthermore, the sequence similarities of the flightless protein with gelsolin allow it to be evaluated in the context of the published crystallographic structure of the S1 domain of gelsolin. Amino acids considered essential for the structural integrity of the core are found to be highly conserved in the predicted flightless protein. Some of the residues considered essential for actin and calcium binding in gelsolin S1 and villin V1 are also well conserved. These data are discussed in light of the phenotypic characteristics of the mutants and the putative functions of the protein. PMID:8582612

  3. Proteiniphilum saccharofermentans str. M3/6T isolated from a laboratory biogas reactor is versatile in polysaccharide and oligopeptide utilization as deduced from genome-based metabolic reconstructions.

    PubMed

    Tomazetto, Geizecler; Hahnke, Sarah; Wibberg, Daniel; Pühler, Alfred; Klocke, Michael; Schlüter, Andreas

    2018-06-01

    Proteiniphilum saccharofermentans str. M3/6 T is a recently described species within the family Porphyromonadaceae (phylum Bacteroidetes ), which was isolated from a mesophilic laboratory-scale biogas reactor. The genome of the strain was completely sequenced and manually annotated to reconstruct its metabolic potential regarding biomass degradation and fermentation pathways. The P. saccharofermentans str. M3/6 T genome consists of a 4,414,963 bp chromosome featuring an average GC-content of 43.63%. Genome analyses revealed that the strain possesses 3396 protein-coding sequences. Among them are 158 genes assigned to the carbohydrate-active-enzyme families as defined by the CAZy database, including 116 genes encoding glycosyl hydrolases (GHs) involved in pectin, arabinogalactan, hemicellulose (arabinan, xylan, mannan, β-glucans), starch, fructan and chitin degradation. The strain also features several transporter genes, some of which are located in polysaccharide utilization loci (PUL). PUL gene products are involved in glycan binding, transport and utilization at the cell surface. In the genome of strain M3/6 T , 64 PUL are present and most of them in association with genes encoding carbohydrate-active enzymes. Accordingly, the strain was predicted to metabolize several sugars yielding carbon dioxide, hydrogen, acetate, formate, propionate and isovalerate as end-products of the fermentation process. Moreover, P. saccharofermentans str. M3/6 T encodes extracellular and intracellular proteases and transporters predicted to be involved in protein and oligopeptide degradation. Comparative analyses between P. saccharofermentans str. M3/6 T and its closest described relative P. acetatigenes str. DSM 18083 T indicate that both strains share a similar metabolism regarding decomposition of complex carbohydrates and fermentation of sugars.

  4. Identification of two novel mammalian genes establishes a subfamily of KH-domain RNA-binding proteins.

    PubMed

    Makeyev, A V; Liebhaber, S A

    2000-08-01

    We have identified two novel human genes encoding proteins with a high level of sequence identity to two previously characterized RNA-binding proteins, alphaCP-1 and alphaCP-2. Both of these novel genes, alphaCP-3 and alphaCP-4, are predicted to encode proteins with triplicated KH domains. The number and organization of the KH domains, their sequences, and the sequences of the contiguous regions are conserved among all four alphaCP proteins. The common evolutionary origin of these proteins is substantiated by conservation of exon-intron organization in the corresponding genes. The map positions of alphaCP-1 and alphaCP-2 (previously reported) and those of alphaCP-3 and alphaCP-4 (present report) reveal that the four alphaCP loci are dispersed in the human genome; alphaCP-3 and alphaCP-4 mapped to 21q22.3 and 3p21, and the respective mouse orthologues mapped to syntenic regions of the mouse genome, 10B5 and 9F1-F2, respectively. Two additional loci in the human genome were identified as alphaCP-2 processed pseudogenes (PCBP2P1, 21q22.3, and PCBP2P2, 8q21-q22). Although the overall levels of alphaCP-3 and alphaCP-4 mRNAs are substantially lower than those of alphaCP-1 and alphaCP-2, transcripts of alphaCP-3 and alphaCP-4 were found in all mouse tissues tested. These data establish a new subfamily of genes predicted to encode closely related KH-containing RNA-binding proteins with potential functions in posttranscriptional controls. Copyright 2000 Academic Press.

  5. Genomes to natural products PRediction Informatics for Secondary Metabolomes (PRISM).

    PubMed

    Skinnider, Michael A; Dejong, Chris A; Rees, Philip N; Johnston, Chad W; Li, Haoxin; Webster, Andrew L H; Wyatt, Morgan A; Magarvey, Nathan A

    2015-11-16

    Microbial natural products are an invaluable source of evolved bioactive small molecules and pharmaceutical agents. Next-generation and metagenomic sequencing indicates untapped genomic potential, yet high rediscovery rates of known metabolites increasingly frustrate conventional natural product screening programs. New methods to connect biosynthetic gene clusters to novel chemical scaffolds are therefore critical to enable the targeted discovery of genetically encoded natural products. Here, we present PRISM, a computational resource for the identification of biosynthetic gene clusters, prediction of genetically encoded nonribosomal peptides and type I and II polyketides, and bio- and cheminformatic dereplication of known natural products. PRISM implements novel algorithms which render it uniquely capable of predicting type II polyketides, deoxygenated sugars, and starter units, making it a comprehensive genome-guided chemical structure prediction engine. A library of 57 tailoring reactions is leveraged for combinatorial scaffold library generation when multiple potential substrates are consistent with biosynthetic logic. We compare the accuracy of PRISM to existing genomic analysis platforms. PRISM is an open-source, user-friendly web application available at http://magarveylab.ca/prism/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes.

    PubMed

    Seligmann, Hervé

    2013-05-07

    GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.

  7. Global differential gene expression in response to growth temperature alteration in group A Streptococcus.

    PubMed

    Smoot, L M; Smoot, J C; Graham, M R; Somerville, G A; Sturdevant, D E; Migliaccio, C A; Sylva, G L; Musser, J M

    2001-08-28

    Pathogens are exposed to different temperatures during an infection cycle and must regulate gene expression accordingly. However, the extent to which virulent bacteria alter gene expression in response to temperatures encountered in the host is unknown. Group A Streptococcus (GAS) is a human-specific pathogen that is responsible for illnesses ranging from superficial skin infections and pharyngitis to severe invasive infections such as necrotizing fasciitis and streptococcal toxic shock syndrome. GAS survives and multiplies at different temperatures during human infection. DNA microarray analysis was used to investigate the influence of temperature on global gene expression in a serotype M1 strain grown to exponential phase at 29 degrees C and 37 degrees C. Approximately 9% of genes were differentially expressed by at least 1.5-fold at 29 degrees C relative to 37 degrees C, including genes encoding transporter proteins, proteins involved in iron homeostasis, transcriptional regulators, phage-associated proteins, and proteins with no known homologue. Relatively few known virulence genes were differentially expressed at this threshold. However, transcription of 28 genes encoding proteins with predicted secretion signal sequences was altered, indicating that growth temperature substantially influences the extracellular proteome. TaqMan real-time reverse transcription-PCR assays confirmed the microarray data. We also discovered that transcription of genes encoding hemolysins, and proteins with inferred roles in iron regulation, transport, and homeostasis, was influenced by growth at 40 degrees C. Thus, GAS profoundly alters gene expression in response to temperature. The data delineate the spectrum of temperature-regulated gene expression in an important human pathogen and provide many unforeseen lines of pathogenesis investigation.

  8. Differential Expression of Three α-Galactosidase Genes and a Single β-Galactosidase Gene from Aspergillus niger

    PubMed Central

    de Vries, Ronald P.; van den Broeck, Hetty C.; Dekkers, Ester; Manzanares, Paloma; de Graaff, Leo H.; Visser, Jaap

    1999-01-01

    A gene encoding a third α-galactosidase (AglB) from Aspergillus niger has been cloned and sequenced. The gene consists of an open reading frame of 1,750 bp containing six introns. The gene encodes a protein of 443 amino acids which contains a eukaryotic signal sequence of 16 amino acids and seven putative N-glycosylation sites. The mature protein has a calculated molecular mass of 48,835 Da and a predicted pI of 4.6. An alignment of the AglB amino acid sequence with those of other α-galactosidases revealed that it belongs to a subfamily of α-galactosidases that also includes A. niger AglA. A. niger AglC belongs to a different subfamily that consists mainly of prokaryotic α-galactosidases. The expression of aglA, aglB, aglC, and lacA, the latter of which encodes an A. niger β-galactosidase, has been studied by using a number of monomeric, oligomeric, and polymeric compounds as growth substrates. Expression of aglA is only detected on galactose and galactose-containing oligomers and polymers. The aglB gene is expressed on all of the carbon sources tested, including glucose. Elevated expression was observed on xylan, which could be assigned to regulation via XlnR, the xylanolytic transcriptional activator. Expression of aglC was only observed on glucose, fructose, and combinations of glucose with xylose and galactose. High expression of lacA was detected on arabinose, xylose, xylan, and pectin. Similar to aglB, the expression on xylose and xylan can be assigned to regulation via XlnR. All four genes have distinct expression patterns which seem to mirror the natural substrates of the encoded proteins. PMID:10347026

  9. vanC Cluster of Vancomycin-Resistant Enterococcus gallinarum BM4174

    PubMed Central

    Arias, Cesar A.; Courvalin, Patrice; Reynolds, Peter E.

    2000-01-01

    Glycopeptide-resistant enterococci of the VanC type synthesize UDP-muramyl-pentapeptide[d-Ser] for cell wall assembly and prevent synthesis of peptidoglycan precursors ending in d-Ala. The vanC cluster of Enterococcus gallinarum BM4174 consists of five genes: vanC-1, vanXYC, vanT, vanRC, and vanSC. Three genes are sufficient for resistance: vanC-1 encodes a ligase that synthesizes the dipeptide d-Ala-d-Ser for addition to UDP-MurNAc-tripeptide, vanXYC encodes a d,d-dipeptidase–carboxypeptidase that hydrolyzes d-Ala-d-Ala and removes d-Ala from UDP-MurNAc-pentapeptide[d-Ala], and vanT encodes a membrane-bound serine racemase that provides d-Ser for the synthetic pathway. The three genes are clustered: the start codons of vanXYC and vanT overlap the termination codons of vanC-1 and vanXYC, respectively. Two genes which encode proteins with homology to the VanS-VanR two-component regulatory system were present downstream from the resistance genes. The predicted amino acid sequence of VanRC exhibited 50% identity to VanR and 33% identity to VanRB. VanSC had 40% identity to VanS over a region of 308 amino acids and 24% identity to VanSB over a region of 285 amino acids. All residues with important functions in response regulators and histidine kinases were conserved in VanRC and VanSC, respectively. Induction experiments based on the determination of d,d-carboxypeptidase activity in cytoplasmic extracts confirmed that the genes were expressed constitutively. Using a promoter-probing vector, regions upstream from the resistance and regulatory genes were identified that have promoter activity. PMID:10817725

  10. High-Quality Draft Genomes from Thermus caliditerrae YIM 77777 and T. tengchongensis YIM 77401, Isolates from Tengchong, China

    DOE PAGES

    Mefferd, Chrisabelle C.; Zhou, En-Min; Yu, Tian-Tian; ...

    2016-04-28

    The draft genomes ofThermus  tengchongensisYIM 77401 andT. caliditerraeYIM 77777 are 2,562,314 and 2,218,114 bp and encode 2,726 and 2,305 predicted genes, respectively. Gene content and growth experiments demonstrate broad metabolic capacity, including starch hydrolysis, thiosulfate oxidation, arsenite oxidation, incomplete denitrification, and polysulfide reduction.

  11. High-Quality Draft Genome Sequence of Thermocrinis jamiesonii GBS1 T Isolated from Great Boiling Spring, Nevada

    DOE PAGES

    Ganji, Rakesh; Murugapiran, Senthil K.; Ong, John C.; ...

    2016-10-20

    The draft genome of Thermocrinis jamiesonii GBS1 T is 1,315,625 bp in 10 contigs and encodes 1,463 predicted genes. The presence of sox genes and various glycoside hydrolases and the absence of uptake NiFe hydrogenases ( hyaB) are consistent with a requirement for thiosulfate and suggest the ability to use carbohydrate polymers.

  12. Comparative genome analysis of non-toxigenic non-O1 versus toxigenic O1 Vibrio cholerae

    PubMed Central

    Mukherjee, Munmun; Kakarla, Prathusha; Kumar, Sanath; Gonzalez, Esmeralda; Floyd, Jared T.; Inupakutika, Madhuri; Devireddy, Amith Reddy; Tirrell, Selena R.; Bruns, Merissa; He, Guixin; Lindquist, Ingrid E.; Sundararajan, Anitha; Schilkey, Faye D.; Mudge, Joann; Varela, Manuel F.

    2015-01-01

    Pathogenic strains of Vibrio cholerae are responsible for endemic and pandemic outbreaks of the disease cholera. The complete toxigenic mechanisms underlying virulence in Vibrio strains are poorly understood. The hypothesis of this work was that virulent versus non-virulent strains of V. cholerae harbor distinctive genomic elements that encode virulence. The purpose of this study was to elucidate genomic differences between the O1 serotypes and non-O1 V. cholerae PS15, a non-toxigenic strain, in order to identify novel genes potentially responsible for virulence. In this study, we compared the whole genome of the non-O1 PS15 strain to the whole genomes of toxigenic serotypes at the phylogenetic level, and found that the PS15 genome was distantly related to those of toxigenic V. cholerae. Thus we focused on a detailed gene comparison between PS15 and the distantly related O1 V. cholerae N16961. Based on sequence alignment we tentatively assigned chromosome numbers 1 and 2 to elements within the genome of non-O1 V. cholerae PS15. Further, we found that PS15 and O1 V. cholerae N16961 shared 98% identity and 766 genes, but of the genes present in N16961 that were missing in the non-O1 V. cholerae PS15 genome, 56 were predicted to encode not only for virulence–related genes (colonization, antimicrobial resistance, and regulation of persister cells) but also genes involved in the metabolic biosynthesis of lipids, nucleosides and sulfur compounds. Additionally, we found 113 genes unique to PS15 that were predicted to encode other properties related to virulence, disease, defense, membrane transport, and DNA metabolism. Here, we identified distinctive and novel genomic elements between O1 and non-O1 V. cholerae genomes as potential virulence factors and, thus, targets for future therapeutics. Modulation of such novel targets may eventually enhance eradication efforts of endemic and pandemic disease cholera in afflicted nations. PMID:25722857

  13. Comparative genome analysis of non-toxigenic non-O1 versus toxigenic O1 Vibrio cholerae.

    PubMed

    Mukherjee, Munmun; Kakarla, Prathusha; Kumar, Sanath; Gonzalez, Esmeralda; Floyd, Jared T; Inupakutika, Madhuri; Devireddy, Amith Reddy; Tirrell, Selena R; Bruns, Merissa; He, Guixin; Lindquist, Ingrid E; Sundararajan, Anitha; Schilkey, Faye D; Mudge, Joann; Varela, Manuel F

    Pathogenic strains of Vibrio cholerae are responsible for endemic and pandemic outbreaks of the disease cholera. The complete toxigenic mechanisms underlying virulence in Vibrio strains are poorly understood. The hypothesis of this work was that virulent versus non-virulent strains of V. cholerae harbor distinctive genomic elements that encode virulence. The purpose of this study was to elucidate genomic differences between the O1 serotypes and non-O1 V. cholerae PS15, a non-toxigenic strain, in order to identify novel genes potentially responsible for virulence. In this study, we compared the whole genome of the non-O1 PS15 strain to the whole genomes of toxigenic serotypes at the phylogenetic level, and found that the PS15 genome was distantly related to those of toxigenic V. cholerae . Thus we focused on a detailed gene comparison between PS15 and the distantly related O1 V. cholerae N16961. Based on sequence alignment we tentatively assigned chromosome numbers 1 and 2 to elements within the genome of non-O1 V. cholerae PS15. Further, we found that PS15 and O1 V. cholerae N16961 shared 98% identity and 766 genes, but of the genes present in N16961 that were missing in the non-O1 V. cholerae PS15 genome, 56 were predicted to encode not only for virulence-related genes (colonization, antimicrobial resistance, and regulation of persister cells) but also genes involved in the metabolic biosynthesis of lipids, nucleosides and sulfur compounds. Additionally, we found 113 genes unique to PS15 that were predicted to encode other properties related to virulence, disease, defense, membrane transport, and DNA metabolism. Here, we identified distinctive and novel genomic elements between O1 and non-O1 V. cholerae genomes as potential virulence factors and, thus, targets for future therapeutics. Modulation of such novel targets may eventually enhance eradication efforts of endemic and pandemic disease cholera in afflicted nations.

  14. Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.

    PubMed

    Mayer, K; Schüller, C; Wambutt, R; Murphy, G; Volckaert, G; Pohl, T; Düsterhöft, A; Stiekema, W; Entian, K D; Terryn, N; Harris, B; Ansorge, W; Brandt, P; Grivell, L; Rieger, M; Weichselgartner, M; de Simone, V; Obermaier, B; Mache, R; Müller, M; Kreis, M; Delseny, M; Puigdomenech, P; Watson, M; Schmidtheini, T; Reichert, B; Portatelle, D; Perez-Alonso, M; Boutry, M; Bancroft, I; Vos, P; Hoheisel, J; Zimmermann, W; Wedler, H; Ridley, P; Langham, S A; McCullagh, B; Bilham, L; Robben, J; Van der Schueren, J; Grymonprez, B; Chuang, Y J; Vandenbussche, F; Braeken, M; Weltjens, I; Voet, M; Bastiaens, I; Aert, R; Defoor, E; Weitzenegger, T; Bothe, G; Ramsperger, U; Hilbert, H; Braun, M; Holzer, E; Brandt, A; Peters, S; van Staveren, M; Dirske, W; Mooijman, P; Klein Lankhorst, R; Rose, M; Hauf, J; Kötter, P; Berneiser, S; Hempel, S; Feldpausch, M; Lamberth, S; Van den Daele, H; De Keyser, A; Buysshaert, C; Gielen, J; Villarroel, R; De Clercq, R; Van Montagu, M; Rogers, J; Cronin, A; Quail, M; Bray-Allen, S; Clark, L; Doggett, J; Hall, S; Kay, M; Lennard, N; McLay, K; Mayes, R; Pettett, A; Rajandream, M A; Lyne, M; Benes, V; Rechmann, S; Borkova, D; Blöcker, H; Scharfe, M; Grimm, M; Löhnert, T H; Dose, S; de Haan, M; Maarse, A; Schäfer, M; Müller-Auer, S; Gabel, C; Fuchs, M; Fartmann, B; Granderath, K; Dauner, D; Herzl, A; Neumann, S; Argiriou, A; Vitale, D; Liguori, R; Piravandi, E; Massenet, O; Quigley, F; Clabauld, G; Mündlein, A; Felber, R; Schnabl, S; Hiller, R; Schmidt, W; Lecharny, A; Aubourg, S; Chefdor, F; Cooke, R; Berger, C; Montfort, A; Casacuberta, E; Gibbons, T; Weber, N; Vandenbol, M; Bargues, M; Terol, J; Torres, A; Perez-Perez, A; Purnelle, B; Bent, E; Johnson, S; Tacon, D; Jesse, T; Heijnen, L; Schwarz, S; Scholler, P; Heber, S; Francs, P; Bielke, C; Frishman, D; Haase, D; Lemcke, K; Mewes, H W; Stocker, S; Zaccaria, P; Bevan, M; Wilson, R K; de la Bastide, M; Habermann, K; Parnell, L; Dedhia, N; Gnoj, L; Schutz, K; Huang, E; Spiegel, L; Sehkon, M; Murray, J; Sheet, P; Cordes, M; Abu-Threideh, J; Stoneking, T; Kalicki, J; Graves, T; Harmon, G; Edwards, J; Latreille, P; Courtney, L; Cloud, J; Abbott, A; Scott, K; Johnson, D; Minx, P; Bentley, D; Fulton, B; Miller, N; Greco, T; Kemp, K; Kramer, J; Fulton, L; Mardis, E; Dante, M; Pepin, K; Hillier, L; Nelson, J; Spieth, J; Ryan, E; Andrews, S; Geisel, C; Layman, D; Du, H; Ali, J; Berghoff, A; Jones, K; Drone, K; Cotton, M; Joshu, C; Antonoiu, B; Zidanic, M; Strong, C; Sun, H; Lamar, B; Yordan, C; Ma, P; Zhong, J; Preston, R; Vil, D; Shekher, M; Matero, A; Shah, R; Swaby, I K; O'Shaughnessy, A; Rodriguez, M; Hoffmann, J; Till, S; Granat, S; Shohdy, N; Hasegawa, A; Hameed, A; Lodhi, M; Johnson, A; Chen, E; Marra, M; Martienssen, R; McCombie, W R

    1999-12-16

    The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.

  15. Massive Collection of Full-Length Complementary DNA Clones and Microarray Analyses:. Keys to Rice Transcriptome Analysis

    NASA Astrophysics Data System (ADS)

    Kikuchi, Shoshi

    2009-02-01

    Completion of the high-precision genome sequence analysis of rice led to the collection of about 35,000 full-length cDNA clones and the determination of their complete sequences. Mapping of these full-length cDNA sequences has given us information on (1) the number of genes expressed in the rice genome; (2) the start and end positions and exon-intron structures of rice genes; (3) alternative transcripts; (4) possible encoded proteins; (5) non-protein-coding (np) RNAs; (6) the density of gene localization on the chromosome; (7) setting the parameters of gene prediction programs; and (8) the construction of a microarray system that monitors global gene expression. Manual curation for rice gene annotation by using mapping information on full-length cDNA and EST assemblies has revealed about 32,000 expressed genes in the rice genome. Analysis of major gene families, such as those encoding membrane transport proteins (pumps, ion channels, and secondary transporters), along with the evolution from bacteria to higher animals and plants, reveals how gene numbers have increased through adaptation to circumstances. Family-based gene annotation also gives us a new way of comparing organisms. Massive amounts of data on gene expression under many kinds of physiological conditions are being accumulated in rice oligoarrays (22K and 44K) based on full-length cDNA sequences. Cluster analyses of genes that have the same promoter cis-elements, that have similar expression profiles, or that encode enzymes in the same metabolic pathways or signal transduction cascades give us clues to understanding the networks of gene expression in rice. As a tool for that purpose, we recently developed "RiCES", a tool for searching for cis-elements in the promoter regions of clustered genes.

  16. DNA methylation of miRNA-encoding genes in non-small cell lung cancer patients.

    PubMed

    Heller, Gerwin; Altenberger, Corinna; Steiner, Irene; Topakian, Thais; Ziegler, Barbara; Tomasich, Erwin; Lang, György; End-Pfützenreuter, Adelheid; Zehetmayer, Sonja; Döme, Balazs; Arns, Britt-Madeleine; Klepetko, Walter; Zielinski, Christoph C; Zöchbauer-Müller, Sabine

    2018-03-23

    De-regulated DNA methylation leading to transcriptional inactivation of certain genes occurs frequently in non-small cell lung cancers (NSCLC). Besides protein-encoding genes also microRNA (miRNA)-encoding genes may be targets for methylation in NSCLCs, however, the number of known methylated miRNA genes is still small. Thus, we investigated methylation of miRNA genes in primary tumours (TU) and corresponding non-malignant lung tissue samples (NL) of 50 NSCLC patients using methylated DNA immunoprecipitation followed by custom designed tiling microarray analyses (MeDIP-chip) and 252 differentially methylated probes between TU and NL samples were identified. These probes were annotated which resulted in the identification of 34 miRNA-encoding genes with increased methylation in TU specimens. While some of these miRNA-encoding genes were already known to be methylated in NSCLCs (e.g. miR-9-3, miR-124), methylation of the vast majority of them was unknown so far. We selected six miRNA genes (miR-10b, miR-1179, miR-137, miR-572, miR-3150b and miR-129-2) for gene-specific methylation analyses in TU and corresponding NL samples of 104 NSCLC patients and observed a statistically significant increase of methylation of these miRNA genes in TU samples (p<0.0001, respectively). In silico target prediction of the six miRNAs identified several oncogenic/cell proliferation promoting factors (e.g. CCNE1 as miR-1179 target). To investigate if miR-1179 indeed targets CCNE1, we transfected miR-1179 mimics into CCNE1 expressing NSCLC cells and observed down-regulated CCNE1 mRNA expression in these cells compared to control cells. Similar effects on Cyclin E1 expression were seen in Western blot analyses. In addition, we found a statistically significant growth reduction of NSCLC cells transfected with miR-1179 mimics compared to control cells. In conclusion, we identified many methylated miRNA genes in NSCLC patients and found that miR-1179 is a potential tumour cell growth suppressor in NSCLCs. Overall, our findings emphasize the impact of miRNA gene methylation on the pathogenesis of NSCLCs. This article is protected by copyright. All rights reserved.

  17. DNA Asymmetric Strand Bias Affects the Amino Acid Composition of Mitochondrial Proteins

    PubMed Central

    Min, Xiang Jia; Hickey, Donal A.

    2007-01-01

    Abstract Variations in GC content between genomes have been extensively documented. Genomes with comparable GC contents can, however, still differ in the apportionment of the G and C nucleotides between the two DNA strands. This asymmetric strand bias is known as GC skew. Here, we have investigated the impact of differences in nucleotide skew on the amino acid composition of the encoded proteins. We compared orthologous genes between animal mitochondrial genomes that show large differences in GC and AT skews. Specifically, we compared the mitochondrial genomes of mammals, which are characterized by a negative GC skew and a positive AT skew, to those of flatworms, which show the opposite skews for both GC and AT base pairs. We found that the mammalian proteins are highly enriched in amino acids encoded by CA-rich codons (as predicted by their negative GC and positive AT skews), whereas their flatworm orthologs were enriched in amino acids encoded by GT-rich codons (also as predicted from their skews). We found that these differences in mitochondrial strand asymmetry (measured as GC and AT skews) can have very large, predictable effects on the composition of the encoded proteins. PMID:17974594

  18. Shedding new light on viral photosynthesis.

    PubMed

    Puxty, Richard J; Millard, Andrew D; Evans, David J; Scanlan, David J

    2015-10-01

    Viruses infecting the environmentally important marine cyanobacteria Prochlorococcus and Synechococcus encode 'auxiliary metabolic genes' (AMGs) involved in the light and dark reactions of photosynthesis. Here, we discuss progress on the inventory of such AMGs in the ever-increasing number of viral genome sequences as well as in metagenomic datasets. We contextualise these gene acquisitions with reference to a hypothesised fitness gain to the phage. We also report new evidence with regard to the sequence and predicted structural properties of viral petE genes encoding the soluble electron carrier plastocyanin. Viral copies of PetE exhibit extensive modifications to the N-terminal signal peptide and possess several novel residues in a region responsible for interaction with redox partners. We also highlight potential knowledge gaps in this field and discuss future opportunities to discover novel phage-host interactions involved in the photosynthetic process.

  19. Systematic Analysis and Comparison of Nucleotide-Binding Site Disease Resistance Genes in a Diploid Cotton Gossypium raimondii

    PubMed Central

    Wei, Hengling; Li, Wei; Sun, Xiwei; Zhu, Shuijin; Zhu, Jun

    2013-01-01

    Plant disease resistance genes are a key component of defending plants from a range of pathogens. The majority of these resistance genes belong to the super-family that harbors a Nucleotide-binding site (NBS). A number of studies have focused on NBS-encoding genes in disease resistant breeding programs for diverse plants. However, little information has been reported with an emphasis on systematic analysis and comparison of NBS-encoding genes in cotton. To fill this gap of knowledge, in this study, we identified and investigated the NBS-encoding resistance genes in cotton using the whole genome sequence information of Gossypium raimondii. Totally, 355 NBS-encoding resistance genes were identified. Analyses of the conserved motifs and structural diversity showed that the most two distinct features for these genes are the high proportion of non-regular NBS genes and the high diversity of N-termini domains. Analyses of the physical locations and duplications of NBS-encoding genes showed that gene duplication of disease resistance genes could play an important role in cotton by leading to an increase in the functional diversity of the cotton NBS-encoding genes. Analyses of phylogenetic comparisons indicated that, in cotton, the NBS-encoding genes with TIR domain not only have their own evolution pattern different from those of genes without TIR domain, but also have their own species-specific pattern that differs from those of TIR genes in other plants. Analyses of the correlation between disease resistance QTL and NBS-encoding resistance genes showed that there could be more than half of the disease resistance QTL associated to the NBS-encoding genes in cotton, which agrees with previous studies establishing that more than half of plant resistance genes are NBS-encoding genes. PMID:23936305

  20. Genome-wide comparative analysis of NBS-encoding genes between Brassica species and Arabidopsis thaliana.

    PubMed

    Yu, Jingyin; Tehrim, Sadia; Zhang, Fengqi; Tong, Chaobo; Huang, Junyan; Cheng, Xiaohui; Dong, Caihua; Zhou, Yanqiu; Qin, Rui; Hua, Wei; Liu, Shengyi

    2014-01-03

    Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana. Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species. This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome triplication analysis in B. oleracea, B. rapa and A. thaliana genomes, our study provides insight into the evolutionary history of NBS-encoding genes after divergence of A. thaliana and the Brassica lineage. These results together with expression pattern analysis of NBS-encoding orthologous genes provide useful resource for functional characterization of these genes and genetic improvement of relevant crops.

  1. A Brassica oleracea gene expressed in a variety-specific manner may encode a novel plant transmembrane receptor.

    PubMed

    Palmer, J E; Dikeman, D A; Fujinuma, T; Kim, B; Jones, J I; Denda, M; Martínez-Zapater, J M; Cruz-Alvarez, M

    2001-04-01

    The species Brassica oleracea includes several agricultural varieties characterized by the proliferation of different types of meristems. Using a combination of subtractive hybridization and PCR (polymerase chain reaction) techniques we have identified several genes which are expressed in the reproductive meristems of the cauliflower curd (B. oleracea var. botrytis) but not in the vegetative meristems of Brussels sprouts (B. oleracea var. gemmifera) axillary buds. One of the cloned genes, termed CCE1 (CAULIFLOWER CURD EXPRESSION 1) shows specific expression in the botrytis variety. Preferential expression takes place in this variety in the meristems of the curd and in the stem throughout the vegetative and reproductive stages of plant growth. CCE1 transcripts are not detected in any of the organs of other B. oleracea varieties analyzed. Based on the nucleotide sequence of a cDNA encompassing the complete coding region, we predict that this gene encodes a transmembrane protein, with three transmembrane domains. The deduced amino acid sequence includes motifs conserved in G-protein-coupled receptors (GPCRs) from yeast and animal species. Our results suggest that the cloned gene encodes a protein belonging to a new, so far unidentified, family of transmembrane receptors in plants. The expression pattern of the gene suggests that the receptor may be involved in the control of meristem development/arrest that takes place in cauliflower.

  2. Genome analysis and identification of gelatinase encoded gene in Enterobacter aerogenes

    NASA Astrophysics Data System (ADS)

    Shahimi, Safiyyah; Mutalib, Sahilah Abdul; Khalid, Rozida Abdul; Repin, Rul Aisyah Mat; Lamri, Mohd Fadly; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat

    2016-11-01

    In this study, bioinformatic analysis towards genome sequence of E. aerogenes was done to determine gene encoded for gelatinase. Enterobacter aerogenes was isolated from hot spring water and gelatinase species-specific bacterium to porcine and fish gelatin. This bacterium offers the possibility of enzymes production which is specific to both species gelatine, respectively. Enterobacter aerogenes was partially genome sequenced resulting in 5.0 mega basepair (Mbp) total size of sequence. From pre-process pipeline, 87.6 Mbp of total reads, 68.8 Mbp of total high quality reads and 78.58 percent of high quality percentage was determined. Genome assembly produced 120 contigs with 67.5% of contigs over 1 kilo base pair (kbp), 124856 bp of N50 contig length and 55.17 % of GC base content percentage. About 4705 protein gene was identified from protein prediction analysis. Two candidate genes selected have highest similarity identity percentage against gelatinase enzyme available in Swiss-Prot and NCBI online database. They were NODE_9_length_26866_cov_148.013245_12 containing 1029 base pair (bp) sequence with 342 amino acid sequence and NODE_24_length_155103_cov_177.082458_62 which containing 717 bp sequence with 238 amino acid sequence, respectively. Thus, two paired of primers (forward and reverse) were designed, based on the open reading frame (ORF) of selected genes. Genome analysis of E. aerogenes resulting genes encoded gelatinase were identified.

  3. Multiple conversion between the genes encoding bacterial class-I release factors

    PubMed Central

    Ishikawa, Sohta A.; Kamikawa, Ryoma; Inagaki, Yuji

    2015-01-01

    Bacteria require two class-I release factors, RF1 and RF2, that recognize stop codons and promote peptide release from the ribosome. RF1 and RF2 were most likely established through gene duplication followed by altering their stop codon specificities in the common ancestor of extant bacteria. This scenario expects that the two RF gene families have taken independent evolutionary trajectories after the ancestral gene duplication event. However, we here report two independent cases of conversion between RF1 and RF2 genes (RF1-RF2 gene conversion), which were severely examined by procedures incorporating the maximum-likelihood phylogenetic method. In both cases, RF1-RF2 gene conversion was predicted to occur in the region encoding nearly entire domain 3, of which functions are common between RF paralogues. Nevertheless, the ‘direction’ of gene conversion appeared to be opposite from one another—from RF2 gene to RF1 gene in one case, while from RF1 gene to RF2 gene in the other. The two cases of RF1-RF2 gene conversion prompt us to propose two novel aspects in the evolution of bacterial class-I release factors: (i) domain 3 is interchangeable between RF paralogues, and (ii) RF1-RF2 gene conversion have occurred frequently in bacterial genome evolution. PMID:26257102

  4. Adaptations Required for Mitochondrial Import following Mitochondrial to Nucleus Gene Transfer of Ribosomal Protein S101[w

    PubMed Central

    Murcha, Monika W.; Rudhe, Charlotta; Elhafez, Dina; Adams, Keith L.; Daley, Daniel O.; Whelan, James

    2005-01-01

    The minimal requirements to support protein import into mitochondria were investigated in the context of the phenomenon of ongoing gene transfer from the mitochondrion to the nucleus in plants. Ribosomal protein 10 of the small subunit is encoded in the mitochondrion in soybean and many other angiosperms, whereas in several other species it is nuclear encoded and thus must be imported into the mitochondrial matrix to function. When encoded by the nuclear genome, it has adopted different strategies for mitochondrial targeting and import. In lettuce (Lactuca sativa) and carrot (Daucus carota), Rps10 independently gained different N-terminal extensions from other genes, following transfer to the nucleus. (The designation of Rps10 follows the following convention. The gene is indicated in italics. If encoded in the mitochondrion, it is rps10; if encoded in the nucleus, it is Rps10.) Here, we show that the N-terminal extensions of Rps10 in lettuce and carrot are both essential for mitochondrial import. In maize (Zea mays), Rps10 has not acquired an extension upon transfer but can be readily imported into mitochondria. Deletion analysis located the mitochondrial targeting region to the first 20 amino acids. Using site directed mutagenesis, we changed residues in the first 20 amino acids of the mitochondrial encoded soybean (Glycine max) rps10 to the corresponding amino acids in the nuclear encoded maize Rps10 until import was achieved. Changes were required that altered charge, hydrophobicity, predicted ability to form an amphiphatic α-helix, and generation of a binding motif for the outer mitochondrial membrane receptor, translocase of the outer membrane 20. In addition to defining the changes required to achieve mitochondrial localization, the results demonstrate that even proteins that do not present barriers to import can require substantial changes to acquire a mitochondrial targeting signal. PMID:16040655

  5. Genetic analysis of the agrocinopine catabolic region of Agrobacterium tumefaciens Ti plasmid pTiC58, which encodes genes required for opine and agrocin 84 transport.

    PubMed Central

    Hayman, G T; Beck von Bodman, S; Kim, H; Jiang, P; Farrand, S K

    1993-01-01

    The acc region, subcloned from pTiC58 of classical nopaline and agrocinopine A and B Agrobacterium tumefaciens C58, allowed agrobacteria to grow using agrocinopine B as the sole source of carbon and energy. acc is approximately 6 kb in size. It consists of at least five genes, accA through accE, as defined by complementation analysis using subcloned fragments and transposon insertion mutations of acc carried on different plasmids within the same cell. All five regions are required for agrocin 84 sensitivity, and at least four are required for agrocinopine and agrocin 84 uptake. The complementation results are consistent with the hypothesis that each of the five regions is separately transcribed. Maxicell experiments showed that the first of these genes, accA, encodes a 60-kDa protein. Analysis of osmotic shock fractions showed this protein to be located in the periplasm. The DNA sequence of the accA region revealed an open reading frame encoding a predicted polypeptide of 59,147 Da. The amino acid sequence encoded by this open reading frame is similar to the periplasmic binding proteins OppA and DppA of Escherichia coli and Salmonella typhimurium and OppA of Bacillus subtilis. Images PMID:8366042

  6. Topological and organizational properties of the products of house-keeping and tissue-specific genes in protein-protein interaction networks.

    PubMed

    Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing

    2009-03-11

    Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene-encoded proteins are attached to the core at more peripheral positions of the networks.

  7. Draft genome sequence of Actinotignum schaalii DSM 15541T: Genetic insights into the lifestyle, cell fitness and virulence.

    PubMed

    Yassin, Atteyet F; Langenberg, Stefan; Huntemann, Marcel; Clum, Alicia; Pillay, Manoj; Palaniappan, Krishnaveni; Varghese, Neha; Mikhailova, Natalia; Mukherjee, Supratim; Reddy, T B K; Daum, Chris; Shapiro, Nicole; Ivanova, Natalia; Woyke, Tanja; Kyrpides, Nikos C

    2017-01-01

    The permanent draft genome sequence of Actinotignum schaalii DSM 15541T is presented. The annotated genome includes 2,130,987 bp, with 1777 protein-coding and 58 rRNA-coding genes. Genome sequence analysis revealed absence of genes encoding for: components of the PTS systems, enzymes of the TCA cycle, glyoxylate shunt and gluconeogensis. Genomic data revealed that A. schaalii is able to oxidize carbohydrates via glycolysis, the nonoxidative pentose phosphate and the Entner-Doudoroff pathways. Besides, the genome harbors genes encoding for enzymes involved in the conversion of pyruvate to lactate, acetate and ethanol, which are found to be the end products of carbohydrate fermentation. The genome contained the gene encoding Type I fatty acid synthase required for de novo FAS biosynthesis. The plsY and plsX genes encoding the acyltransferases necessary for phosphatidic acid biosynthesis were absent from the genome. The genome harbors genes encoding enzymes responsible for isoprene biosynthesis via the mevalonate (MVA) pathway. Genes encoding enzymes that confer resistance to reactive oxygen species (ROS) were identified. In addition, A. schaalii harbors genes that protect the genome against viral infections. These include restriction-modification (RM) systems, type II toxin-antitoxin (TA), CRISPR-Cas and abortive infection system. A. schaalii genome also encodes several virulence factors that contribute to adhesion and internalization of this pathogen such as the tad genes encoding proteins required for pili assembly, the nanI gene encoding exo-alpha-sialidase, genes encoding heat shock proteins and genes encoding type VII secretion system. These features are consistent with anaerobic and pathogenic lifestyles. Finally, resistance to ciprofloxacin occurs by mutation in chromosomal genes that encode the subunits of DNA-gyrase (GyrA) and topisomerase IV (ParC) enzymes, while resistant to metronidazole was due to the frxA gene, which encodes NADPH-flavin oxidoreductase.

  8. Molecular and biochemical characterization of two tungsten- and selenium-containing formate dehydrogenases from Eubacterium acidaminophilum that are associated with components of an iron-only hydrogenase.

    PubMed

    Graentzdoerffer, Andrea; Rauh, David; Pich, Andreas; Andreesen, Jan R

    2003-01-01

    Two gene clusters encoding similar formate dehydrogenases (FDH) were identified in Eubacterium acidaminophilum. Each cluster is composed of one gene coding for a catalytic subunit ( fdhA-I, fdhA-II) and one for an electron-transferring subunit ( fdhB-I, fdhB-II). Both fdhA genes contain a TGA codon for selenocysteine incorporation and the encoded proteins harbor five putative iron-sulfur clusters in their N-terminal region. Both FdhB subunits resemble the N-terminal region of FdhA on the amino acid level and contain five putative iron-sulfur clusters. Four genes thought to encode the subunits of an iron-only hydrogenase are located upstream of the FDH gene cluster I. By sequence comparison, HymA and HymB are predicted to contain one and four iron-sulfur clusters, respectively, the latter protein also binding sites for FMN and NAD(P). Thus, HymA and HymB seem to represent electron-transferring subunits, and HymC the putative catalytic subunit containing motifs for four iron-sulfur clusters and one H-cluster specific for Fe-only hydrogenases. HymD has six predicted transmembrane helices and might be an integral membrane protein. Viologen-dependent FDH activity was purified from serine-grown cells of E. acidaminophilum and the purified protein complex contained four subunits, FdhA and FdhB, encoded by FDH gene cluster II, and HymA and HymB, identified after determination of their N-terminal sequences. Thus, this complex might represent the most simple type of a formate hydrogen lyase. The purified formate dehydrogenase fraction contained iron, tungsten, a pterin cofactor, and zinc, but no molybdenum. FDH-II had a two-fold higher K(m) for formate (0.37 mM) than FDH-I and also catalyzed CO(2) reduction to formate. Reverse transcription (RT)-PCR pointed to increased expression of FDH-II in serine-grown cells, supporting the isolation of this FDH isoform. The fdhA-I gene was expressed as inactive protein in Escherichia coli. The in-frame UGA codon for selenocysteine incorporation was read in the heterologous system only as stop codon, although its potential SECIS element exhibited a quite high similarity to that of E. coli FDH.

  9. In-Frame and Unmarked Gene Deletions in Burkholderia cenocepacia via an Allelic Exchange System Compatible with Gateway Technology.

    PubMed

    Fazli, Mustafa; Harrison, Joe J; Gambino, Michela; Givskov, Michael; Tolker-Nielsen, Tim

    2015-06-01

    Burkholderia cenocepacia is an emerging opportunistic pathogen causing life-threatening infections in immunocompromised individuals and in patients with cystic fibrosis, which are often difficult, if not impossible, to treat. Understanding the genetic basis of virulence in this emerging pathogen is important for the development of novel treatment regimes. Generation of deletion mutations in genes predicted to encode virulence determinants is fundamental to investigating the mechanisms of pathogenesis. However, there is a lack of appropriate selectable and counterselectable markers for use in B. cenocepacia, making its genetic manipulation problematic. Here we describe a Gateway-compatible allelic exchange system based on the counterselectable pheS gene and the I-SceI homing endonuclease. This system provides efficiency in cloning homology regions of target genes and allows the generation of precise and unmarked gene deletions in B. cenocepacia. As a proof of concept, we demonstrate its utility by deleting the Bcam1349 gene, encoding a cyclic di-GMP (c-di-GMP)-responsive regulator protein important for biofilm formation. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  10. In-Frame and Unmarked Gene Deletions in Burkholderia cenocepacia via an Allelic Exchange System Compatible with Gateway Technology

    PubMed Central

    Fazli, Mustafa; Harrison, Joe J.; Gambino, Michela; Givskov, Michael

    2015-01-01

    Burkholderia cenocepacia is an emerging opportunistic pathogen causing life-threatening infections in immunocompromised individuals and in patients with cystic fibrosis, which are often difficult, if not impossible, to treat. Understanding the genetic basis of virulence in this emerging pathogen is important for the development of novel treatment regimes. Generation of deletion mutations in genes predicted to encode virulence determinants is fundamental to investigating the mechanisms of pathogenesis. However, there is a lack of appropriate selectable and counterselectable markers for use in B. cenocepacia, making its genetic manipulation problematic. Here we describe a Gateway-compatible allelic exchange system based on the counterselectable pheS gene and the I-SceI homing endonuclease. This system provides efficiency in cloning homology regions of target genes and allows the generation of precise and unmarked gene deletions in B. cenocepacia. As a proof of concept, we demonstrate its utility by deleting the Bcam1349 gene, encoding a cyclic di-GMP (c-di-GMP)-responsive regulator protein important for biofilm formation. PMID:25795676

  11. Macronuclear Genome Sequence of the Ciliate Tetrahymena thermophila, a Model Eukaryote

    PubMed Central

    Eisen, Jonathan A; Coyne, Robert S; Wu, Martin; Wu, Dongying; Thiagarajan, Mathangi; Wortman, Jennifer R; Badger, Jonathan H; Ren, Qinghu; Amedeo, Paolo; Jones, Kristie M; Tallon, Luke J; Delcher, Arthur L; Salzberg, Steven L; Silva, Joana C; Haas, Brian J; Majoros, William H; Farzad, Maryam; Carlton, Jane M; Smith, Roger K; Garg, Jyoti; Pearlman, Ronald E; Karrer, Kathleen M; Sun, Lei; Manning, Gerard; Elde, Nels C; Turkewitz, Aaron P; Asai, David J; Wilkes, David E; Wang, Yufeng; Cai, Hong; Collins, Kathleen; Stewart, B. Andrew; Lee, Suzanne R; Wilamowska, Katarzyna; Weinberg, Zasha; Ruzzo, Walter L; Wloga, Dorota; Gaertig, Jacek; Frankel, Joseph; Tsao, Che-Chia; Gorovsky, Martin A; Keeling, Patrick J; Waller, Ross F; Patron, Nicola J; Cherry, J. Michael; Stover, Nicholas A; Krieger, Cynthia J; del Toro, Christina; Ryder, Hilary F; Williamson, Sondra C; Barbeau, Rebecca A; Hamilton, Eileen P; Orias, Eduardo

    2006-01-01

    The ciliate Tetrahymena thermophila is a model organism for molecular and cellular biology. Like other ciliates, this species has separate germline and soma functions that are embodied by distinct nuclei within a single cell. The germline-like micronucleus (MIC) has its genome held in reserve for sexual reproduction. The soma-like macronucleus (MAC), which possesses a genome processed from that of the MIC, is the center of gene expression and does not directly contribute DNA to sexual progeny. We report here the shotgun sequencing, assembly, and analysis of the MAC genome of T. thermophila, which is approximately 104 Mb in length and composed of approximately 225 chromosomes. Overall, the gene set is robust, with more than 27,000 predicted protein-coding genes, 15,000 of which have strong matches to genes in other organisms. The functional diversity encoded by these genes is substantial and reflects the complexity of processes required for a free-living, predatory, single-celled organism. This is highlighted by the abundance of lineage-specific duplications of genes with predicted roles in sensing and responding to environmental conditions (e.g., kinases), using diverse resources (e.g., proteases and transporters), and generating structural complexity (e.g., kinesins and dyneins). In contrast to the other lineages of alveolates (apicomplexans and dinoflagellates), no compelling evidence could be found for plastid-derived genes in the genome. UGA, the only T. thermophila stop codon, is used in some genes to encode selenocysteine, thus making this organism the first known with the potential to translate all 64 codons in nuclear genes into amino acids. We present genomic evidence supporting the hypothesis that the excision of DNA from the MIC to generate the MAC specifically targets foreign DNA as a form of genome self-defense. The combination of the genome sequence, the functional diversity encoded therein, and the presence of some pathways missing from other model organisms makes T. thermophila an ideal model for functional genomic studies to address biological, biomedical, and biotechnological questions of fundamental importance. PMID:16933976

  12. Identification of differentially expressed small non-coding RNAs in the legume endosymbiont Sinorhizobium meliloti by comparative genomics

    PubMed Central

    del Val, Coral; Rivas, Elena; Torres-Quesada, Omar; Toro, Nicolás; Jiménez-Zurdo, José I

    2007-01-01

    Bacterial small non-coding RNAs (sRNAs) are being recognized as novel widespread regulators of gene expression in response to environmental signals. Here, we present the first search for sRNA-encoding genes in the nitrogen-fixing endosymbiont Sinorhizobium meliloti, performed by a genome-wide computational analysis of its intergenic regions. Comparative sequence data from eight related α-proteobacteria were obtained, and the interspecies pairwise alignments were scored with the programs eQRNA and RNAz as complementary predictive tools to identify conserved and stable secondary structures corresponding to putative non-coding RNAs. Northern experiments confirmed that eight of the predicted loci, selected among the original 32 candidates as most probable sRNA genes, expressed small transcripts. This result supports the combined use of eQRNA and RNAz as a robust strategy to identify novel sRNAs in bacteria. Furthermore, seven of the transcripts accumulated differentially in free-living and symbiotic conditions. Experimental mapping of the 5′-ends of the detected transcripts revealed that their encoding genes are organized in autonomous transcription units with recognizable promoter and, in most cases, termination signatures. These findings suggest novel regulatory functions for sRNAs related to the interactions of α-proteobacteria with their eukaryotic hosts. PMID:17971083

  13. Three new members of the RNP protein family in Xenopus.

    PubMed Central

    Good, P J; Rebbert, M L; Dawid, I B

    1993-01-01

    Many RNP proteins contain one or more copies of the RNA recognition motif (RRM) and are thought to be involved in cellular RNA metabolism. We have previously characterized in Xenopus a nervous system specific gene, nrp1, that is more similar to the hnRNP A/B proteins than to other known proteins (K. Richter, P. J. Good, and I. B. Dawid (1990), New Biol. 2, 556-565). PCR amplification with degenerate primers was used to identify additional cDNAs encoding two RRMs in Xenopus. Three previously uncharacterized genes were identified. Two genes encode hnRNP A/B proteins with two RRMs and a glycine-rich domain. One of these is the Xenopus homolog of the human A2/B1 gene; the other, named hnRNP A3, is similar to both the A1 and A2 hnRNP genes. The Xenopus hnRNP A1, A2 and A3 genes are expressed throughout development and in all adult tissues. Multiple protein isoforms for the hnRNP A2 gene are predicted that differ by the insertion of short peptide sequences in the glycine-rich domain. The third newly isolated gene, named xrp1, encodes a protein that is related by sequence to the nrp1 protein but is expressed ubiquitously. Despite the similarity to nuclear RNP proteins, both the nrp1 and xrp1 proteins are localized to the cytoplasm in the Xenopus oocyte. The xrp1 gene may have a function in all cells that is similar to that executed by nrp1 specifically within the nervous system. Images PMID:8451200

  14. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    PubMed Central

    2013-01-01

    Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs) and Support Vector Machines (SVMs) were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression. PMID:23369200

  15. Predictive minimum description length principle approach to inferring gene regulatory networks.

    PubMed

    Chaitankar, Vijender; Zhang, Chaoyang; Ghosh, Preetam; Gong, Ping; Perkins, Edward J; Deng, Youping

    2011-01-01

    Reverse engineering of gene regulatory networks using information theory models has received much attention due to its simplicity, low computational cost, and capability of inferring large networks. One of the major problems with information theory models is to determine the threshold that defines the regulatory relationships between genes. The minimum description length (MDL) principle has been implemented to overcome this problem. The description length of the MDL principle is the sum of model length and data encoding length. A user-specified fine tuning parameter is used as control mechanism between model and data encoding, but it is difficult to find the optimal parameter. In this work, we propose a new inference algorithm that incorporates mutual information (MI), conditional mutual information (CMI), and predictive minimum description length (PMDL) principle to infer gene regulatory networks from DNA microarray data. In this algorithm, the information theoretic quantities MI and CMI determine the regulatory relationships between genes and the PMDL principle method attempts to determine the best MI threshold without the need of a user-specified fine tuning parameter. The performance of the proposed algorithm is evaluated using both synthetic time series data sets and a biological time series data set (Saccharomyces cerevisiae). The results show that the proposed algorithm produced fewer false edges and significantly improved the precision when compared to existing MDL algorithm.

  16. funRNA: a fungi-centered genomics platform for genes encoding key components of RNAi.

    PubMed

    Choi, Jaeyoung; Kim, Ki-Tae; Jeon, Jongbum; Wu, Jiayao; Song, Hyeunjeong; Asiegbu, Fred O; Lee, Yong-Hwan

    2014-01-01

    RNA interference (RNAi) is involved in genome defense as well as diverse cellular, developmental, and physiological processes. Key components of RNAi are Argonaute, Dicer, and RNA-dependent RNA polymerase (RdRP), which have been functionally characterized mainly in model organisms. The key components are believed to exist throughout eukaryotes; however, there is no systematic platform for archiving and dissecting these important gene families. In addition, few fungi have been studied to date, limiting our understanding of RNAi in fungi. Here we present funRNA http://funrna.riceblast.snu.ac.kr/, a fungal kingdom-wide comparative genomics platform for putative genes encoding Argonaute, Dicer, and RdRP. To identify and archive genes encoding the abovementioned key components, protein domain profiles were determined from reference sequences obtained from UniProtKB/SwissProt. The domain profiles were searched using fungal, metazoan, and plant genomes, as well as bacterial and archaeal genomes. 1,163, 442, and 678 genes encoding Argonaute, Dicer, and RdRP, respectively, were predicted. Based on the identification results, active site variation of Argonaute, diversification of Dicer, and sequence analysis of RdRP were discussed in a fungus-oriented manner. funRNA provides results from diverse bioinformatics programs and job submission forms for BLAST, BLASTMatrix, and ClustalW. Furthermore, sequence collections created in funRNA are synced with several gene family analysis portals and databases, offering further analysis opportunities. funRNA provides identification results from a broad taxonomic range and diverse analysis functions, and could be used in diverse comparative and evolutionary studies. It could serve as a versatile genomics workbench for key components of RNAi.

  17. Cloning and Expression of the Benzoate Dioxygenase Genes from Rhodococcus sp. Strain 19070

    PubMed Central

    Haddad, Sandra; Eby, D. Matthew; Neidle, Ellen L.

    2001-01-01

    The bopXYZ genes from the gram-positive bacterium Rhodococcus sp. strain 19070 encode a broad-substrate-specific benzoate dioxygenase. Expression of the BopXY terminal oxygenase enabled Escherichia coli to convert benzoate or anthranilate (2-aminobenzoate) to a nonaromatic cis-diol or catechol, respectively. This expression system also rapidly transformed m-toluate (3-methylbenzoate) to an unidentified product. In contrast, 2-chlorobenzoate was not a good substrate. The BopXYZ dioxygenase was homologous to the chromosomally encoded benzoate dioxygenase (BenABC) and the plasmid-encoded toluate dioxygenase (XylXYZ) of gram-negative acinetobacters and pseudomonads. Pulsed-field gel electrophoresis failed to identify any plasmid in Rhodococcus sp. strain 19070. Catechol 1,2- and 2,3-dioxygenase activity indicated that strain 19070 possesses both meta- and ortho-cleavage degradative pathways, which are associated in pseudomonads with the xyl and ben genes, respectively. Open reading frames downstream of bopXYZ, designated bopL and bopK, resembled genes encoding cis-diol dehydrogenases and benzoate transporters, respectively. The bop genes were in the same order as the chromosomal ben genes of P. putida PRS2000. The deduced sequences of BopXY were 50 to 60% identical to the corresponding proteins of benzoate and toluate dioxygenases. The reductase components of these latter dioxygenases, BenC and XylZ, are 201 residues shorter than the deduced BopZ sequence. As predicted from the sequence, expression of BopZ in E. coli yielded an approximately 60-kDa protein whose presence corresponded to increased cytochrome c reductase activity. While the N-terminal region of BopZ was approximately 50% identical in sequence to the entire BenC or XylZ reductases, the C terminus was unlike other known protein sequences. PMID:11375157

  18. RNA-Seq Analysis of the Expression of Genes Encoding Cell Wall Degrading Enzymes during Infection of Lupin (Lupinus angustifolius) by Phytophthora parasitica

    PubMed Central

    Blackman, Leila M.; Cullerne, Darren P.; Torreña, Pernelyn; Taylor, Jen; Hardham, Adrienne R.

    2015-01-01

    RNA-Seq analysis has shown that over 60% (12,962) of the predicted transcripts in the Phytophthora parasitica genome are expressed during the first 60 h of lupin root infection. The infection transcriptomes included 278 of the 431 genes encoding P. parasitica cell wall degrading enzymes. The transcriptome data provide strong evidence of global transcriptional cascades of genes whose encoded proteins target the main categories of plant cell wall components. A major cohort of pectinases is predominantly expressed early but as infection progresses, the transcriptome becomes increasingly dominated by transcripts encoding cellulases, hemicellulases, β-1,3-glucanases and glycoproteins. The most highly expressed P. parasitica carbohydrate active enzyme gene contains two CBM1 cellulose binding modules and no catalytic domains. The top 200 differentially expressed genes include β-1,4-glucosidases, β-1,4-glucanases, β-1,4-galactanases, a β-1,3-glucanase, an α-1,4-polygalacturonase, a pectin deacetylase and a pectin methylesterase. Detailed analysis of gene expression profiles provides clues as to the order in which linkages within the complex carbohydrates may come under attack. The gene expression profiles suggest that (i) demethylation of pectic homogalacturonan occurs before its deacetylation; (ii) cleavage of the backbone of pectic rhamnogalacturonan I precedes digestion of its side chains; (iii) early attack on cellulose microfibrils by non-catalytic cellulose-binding proteins and enzymes with auxiliary activities may facilitate subsequent attack by glycosyl hydrolases and enzymes containing CBM1 cellulose-binding modules; (iv) terminal hemicellulose backbone residues are targeted after extensive internal backbone cleavage has occurred; and (v) the carbohydrate chains on glycoproteins are degraded late in infection. A notable feature of the P. parasitica infection transcriptome is the high level of transcription of genes encoding enzymes that degrade β-1,3-glucanases during middle and late stages of infection. The results suggest that high levels of β-1,3-glucanases may effectively degrade callose as it is produced by the plant during the defence response. PMID:26332397

  19. RNA-Seq Analysis of the Expression of Genes Encoding Cell Wall Degrading Enzymes during Infection of Lupin (Lupinus angustifolius) by Phytophthora parasitica.

    PubMed

    Blackman, Leila M; Cullerne, Darren P; Torreña, Pernelyn; Taylor, Jen; Hardham, Adrienne R

    2015-01-01

    RNA-Seq analysis has shown that over 60% (12,962) of the predicted transcripts in the Phytophthora parasitica genome are expressed during the first 60 h of lupin root infection. The infection transcriptomes included 278 of the 431 genes encoding P. parasitica cell wall degrading enzymes. The transcriptome data provide strong evidence of global transcriptional cascades of genes whose encoded proteins target the main categories of plant cell wall components. A major cohort of pectinases is predominantly expressed early but as infection progresses, the transcriptome becomes increasingly dominated by transcripts encoding cellulases, hemicellulases, β-1,3-glucanases and glycoproteins. The most highly expressed P. parasitica carbohydrate active enzyme gene contains two CBM1 cellulose binding modules and no catalytic domains. The top 200 differentially expressed genes include β-1,4-glucosidases, β-1,4-glucanases, β-1,4-galactanases, a β-1,3-glucanase, an α-1,4-polygalacturonase, a pectin deacetylase and a pectin methylesterase. Detailed analysis of gene expression profiles provides clues as to the order in which linkages within the complex carbohydrates may come under attack. The gene expression profiles suggest that (i) demethylation of pectic homogalacturonan occurs before its deacetylation; (ii) cleavage of the backbone of pectic rhamnogalacturonan I precedes digestion of its side chains; (iii) early attack on cellulose microfibrils by non-catalytic cellulose-binding proteins and enzymes with auxiliary activities may facilitate subsequent attack by glycosyl hydrolases and enzymes containing CBM1 cellulose-binding modules; (iv) terminal hemicellulose backbone residues are targeted after extensive internal backbone cleavage has occurred; and (v) the carbohydrate chains on glycoproteins are degraded late in infection. A notable feature of the P. parasitica infection transcriptome is the high level of transcription of genes encoding enzymes that degrade β-1,3-glucanases during middle and late stages of infection. The results suggest that high levels of β-1,3-glucanases may effectively degrade callose as it is produced by the plant during the defence response.

  20. Microarray-based comparative genomic profiling of reference strains and selected Canadian field isolates of Actinobacillus pleuropneumoniae

    PubMed Central

    Gouré, Julien; Findlay, Wendy A; Deslandes, Vincent; Bouevitch, Anne; Foote, Simon J; MacInnes, Janet I; Coulton, James W; Nash, John HE; Jacques, Mario

    2009-01-01

    Background Actinobacillus pleuropneumoniae, the causative agent of porcine pleuropneumonia, is a highly contagious respiratory pathogen that causes severe losses to the swine industry worldwide. Current commercially-available vaccines are of limited value because they do not induce cross-serovar immunity and do not prevent development of the carrier state. Microarray-based comparative genomic hybridizations (M-CGH) were used to estimate whole genomic diversity of representative Actinobacillus pleuropneumoniae strains. Our goal was to identify conserved genes, especially those predicted to encode outer membrane proteins and lipoproteins because of their potential for the development of more effective vaccines. Results Using hierarchical clustering, our M-CGH results showed that the majority of the genes in the genome of the serovar 5 A. pleuropneumoniae L20 strain were conserved in the reference strains of all 15 serovars and in representative field isolates. Fifty-eight conserved genes predicted to encode for outer membrane proteins or lipoproteins were identified. As well, there were several clusters of diverged or absent genes including those associated with capsule biosynthesis, toxin production as well as genes typically associated with mobile elements. Conclusion Although A. pleuropneumoniae strains are essentially clonal, M-CGH analysis of the reference strains of the fifteen serovars and representative field isolates revealed several classes of genes that were divergent or absent. Not surprisingly, these included genes associated with capsule biosynthesis as the capsule is associated with sero-specificity. Several of the conserved genes were identified as candidates for vaccine development, and we conclude that M-CGH is a valuable tool for reverse vaccinology. PMID:19239696

  1. Receptor-like genes in the major resistance locus of lettuce are subject to divergent selection.

    PubMed Central

    Meyers, B C; Shen, K A; Rohani, P; Gaut, B S; Michelmore, R W

    1998-01-01

    Disease resistance genes in plants are often found in complex multigene families. The largest known cluster of disease resistance specificities in lettuce contains the RGC2 family of genes. We compared the sequences of nine full-length genomic copies of RGC2 representing the diversity in the cluster to determine the structure of genes within this family and to examine the evolution of its members. The transcribed regions range from at least 7.0 to 13.1 kb, and the cDNAs contain deduced open reading frames of approximately 5. 5 kb. The predicted RGC2 proteins contain a nucleotide binding site and irregular leucine-rich repeats (LRRs) that are characteristic of resistance genes cloned from other species. Unique features of the RGC2 gene products include a bipartite LRR region with >40 repeats. At least eight members of this family are transcribed. The level of sequence diversity between family members varied in different regions of the gene. The ratio of nonsynonymous (Ka) to synonymous (Ks) nucleotide substitutions was lowest in the region encoding the nucleotide binding site, which is the presumed effector domain of the protein. The LRR-encoding region showed an alternating pattern of conservation and hypervariability. This alternating pattern of variation was also found in all comparisons within families of resistance genes cloned from other species. The Ka /Ks ratios indicate that diversifying selection has resulted in increased variation at these codons. The patterns of variation support the predicted structure of LRR regions with solvent-exposed hypervariable residues that are potentially involved in binding pathogen-derived ligands. PMID:9811792

  2. Genome sequence of the model mushroom Schizophyllum commune

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ohm, Robin A.; de Jong, Jan F.; Lugones, Luis G.

    2010-09-01

    Much remains to be learned about the biology of mushroom-forming fungi, which are an important source of food, secondary metabolites and industrial enzymes. The wood-degrading fungus Schizophyllum commune is both a genetically tractable model for studying mushroom development and a likely source of enzymes capable of efficient degradation of lignocellulosic biomass. Comparative analyses of its 38.5-megabase genome, which encodes 13,210 predicted genes, reveal the species's unique wood-degrading machinery. One-third of the 471 genes predicted to encode transcription factors are differentially expressed during sexual development of S. commune. Whereas inactivation of one of these, fst4, prevented mushroom formation, inactivation of another,more » fst3, resulted in more, albeit smaller, mushrooms than in the wild-type fungus. Antisense transcripts may also have a role in the formation of fruiting bodies. Better insight into the mechanisms underlying mushroom formation should affect commercial production of mushrooms and their industrial use for producing enzymes and pharmaceuticals.« less

  3. Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.

    PubMed

    Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M

    1991-02-15

    The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.

  4. Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement.

    PubMed

    Blazier, J Chris; Ruhlman, Tracey A; Weng, Mao-Lun; Rehman, Sumaiyah K; Sabir, Jamal S M; Jansen, Robert K

    2016-04-18

    Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.

  5. Cloning and expression in Escherichia coli of isopenicillin N synthetase genes from Streptomyces lipmanii and Aspergillus nidulans.

    PubMed Central

    Weigel, B J; Burgett, S G; Chen, V J; Skatrud, P L; Frolik, C A; Queener, S W; Ingolia, T D

    1988-01-01

    beta-Lactam antibiotics such as penicillins and cephalosporins are synthesized by a wide variety of microbes, including procaryotes and eucaryotes. Isopenicillin N synthetase catalyzes a key reaction in the biosynthetic pathway of penicillins and cephalosporins. The genes encoding this protein have previously been cloned from the filamentous fungi Cephalosporium acremonium and Penicillium chrysogenum and characterized. We have extended our analysis to the isopenicillin N synthetase genes from the fungus Aspergillus nidulans and the gram-positive procaryote Streptomyces lipmanii. The isopenicillin N synthetase genes from these organisms have been cloned and sequenced, and the proteins encoded by the open reading frames were expressed in Escherichia coli. Active isopenicillin N synthetase enzyme was recovered from extracts of E. coli cells prepared from cells containing each of the genes in expression vectors. The four isopenicillin N synthetase genes studied are closely related. Pairwise comparison of the DNA sequences showed between 62.5 and 75.7% identity; comparison of the predicted amino acid sequences showed between 53.9 and 80.6% identity. The close homology of the procaryotic and eucaryotic isopenicillin N synthetase genes suggests horizontal transfer of the genes during evolution. Images PMID:3045077

  6. The role of the ataxia telangiectasia mutated gene in lung cancer: recent advances in research.

    PubMed

    Xu, Yanling; Gao, Peng; Lv, Xuejiao; Zhang, Lin; Zhang, Jie

    2017-09-01

    Lung cancer is the leading cause of death due to cancer worldwide. It is estimated that approximately 1.2 million new cases of lung cancer are diagnosed each year. Early detection and treatment are crucial for improvements in both prognosis and quality of life of lung cancer patients. The ataxia telangiectasia mutated (ATM) gene is a cancer-susceptibility gene that encodes a key apical kinase in the DNA damage response pathway. It has recently been shown to play an important role in the development of lung cancer. The main functions of the ATM gene and protein includes participation in cell cycle regulation, and identification and repair of DNA damage. ATM gene mutation can lead to multiple system dysfunctions as well as a concomitant increase in tumor tendency. In recent years, many studies have indicated that single nucleotide polymorphism of the ATM gene is associated with increased incidence of lung cancer. At the same time, the ATM gene and its encoding product ATM protein predicts the response to radiotherapy, chemotherapy, and prognosis of lung cancer, thus suggesting that the ATM gene may be a new potential target for the diagnosis and treatment of lung cancer.

  7. Molecular characterization of a gene POLR2H encoded an essential subunit for RNA polymerase II from the Giant Panda (Ailuropoda Melanoleuca).

    PubMed

    Du, Yu-Jie; Hou, Yi-Ling; Hou, Wan-Ru

    2013-02-01

    The Giant Panda is an endangered and valuable gene pool in genetic, its important functional gene POLR2H encodes an essential shared peptide H of RNA polymerases. The genomic DNA and cDNA sequences were cloned successfully for the first time from the Giant Panda (Ailuropoda melanoleuca) adopting touchdown-PCR and reverse transcription polymerase chain reaction (RT-PCR), respectively. The length of the genomic sequence of the Giant Panda is 3,285 bp, including five exons and four introns. The cDNA fragment cloned is 509 bp in length, containing an open reading frame of 453 bp encoding 150 amino acids. Alignment analysis indicated that both the cDNA and its deduced amino acid sequence were highly conserved. Protein structure prediction showed that there was one protein kinase C phosphorylation site, four casein kinase II phosphorylation sites and one amidation site in the POLR2H protein, further shaping advanced protein structure. The cDNA cloned was expressed in Escherichia coli, which indicated that POLR2H fusion with the N-terminally His-tagged form brought about the accumulation of an expected 20.5 kDa polypeptide in line with the predicted protein. On the basis of what has already been achieved in this study, further deep-in research will be conducted, which has great value in theory and practical significance.

  8. Defining Aggressive Prostate Cancer Using a 12-Gene Model1

    PubMed Central

    Riva, Alberto; Kim, Robert; Varambally, Sooryanarayana; He, Le; Kutok, Jeff; Aster, Jonathan C; Tang, Jeffery; Kuefer, Rainer; Hofer, Matthias D; Febbo, Phillip G; Chinnaiyan, Arul M; Rubin, Mark A

    2006-01-01

    Abstract The critical clinical question in prostate cancer research is: How do we develop means of distinguishing aggressive disease from indolent disease? Using a combination of proteomic and expression array data, we identified a set of 36 genes with concordant dysregulation of protein products that could be evaluated in situ by quantitative immunohistochemistry. Another five prostate cancer biomarkers were included using linear discriminant analysis, we determined that the optimal model used to predict prostate cancer progression consisted of 12 proteins. Using a separate patient population, transcriptional levels of the 12 genes encoding for these proteins predicted prostate-specific antigen failure in 79 men following surgery for clinically localized prostate cancer (P = .0015). This study demonstrates that cross-platform models can lead to predictive models with the possible advantage of being more robust through this selection process. PMID:16533427

  9. Identification of PaCOL1 and PaCOL2, two CONSTANS-like genes showing decreased transcript levels preceding short day induced growth cessation in Norway spruce.

    PubMed

    Holefors, Anna; Opseth, Lars; Ree Rosnes, Anne Katrine; Ripel, Linda; Snipen, Lars; Fossdal, Carl Gunnar; Olsen, Jorunn E

    2009-02-01

    In woody plants of the temperate zone short photoperiod (SD) leads to growth cessation. In angiosperms CONSTANS (CO) or CO-like genes play an important role in the photoperiodic control of flowering, tuberisation and shoot growth. To investigate the role of CO-like genes in photoperiodic control of shoot elongation in gymnosperms, PaCOL1 and PaCOL2 were isolated from Norway spruce. PaCOL1 encodes a 3.9kb gene with a predicted protein of 444 amino acids. PaCOL2 encodes a 1.2kb gene with a predicted protein of 385 amino acids. Both genes consist of two exons and have conserved domains found in other CO-like genes; two zinc finger domains, a CCT and a COOH domain. PaCOL1 and PaCOL2 fall into the group 1c clade of the CO-like genes, and are thus distinct from Arabidopsis CO that belongs to group 1a. Transcript levels of both PaCOL-genes appear to be light regulated, an increasing trend was observed upon transition from darkness to light, and a decreasing trend during darkness. The increasing trend at dawn was observed both in needles and shoot tips, whereas the decreasing trend in darkness was most prominent in shoot tips, and limited to the late part of the dark period in needles. The transcript levels of both genes decreased significantly in both tissues under SD prior to growth cessation and bud formation. This might suggest an involvement in photoperiodic control of shoot elongation or might be a consequence of regulation by light.

  10. RNA Sequencing-Based Genome Reannotation of the Dermatophyte Arthroderma benhamiae and Characterization of Its Secretome and Whole Gene Expression Profile during Infection

    PubMed Central

    De Coi, Niccolò; Feuermann, Marc; Schmid-Siegert, Emanuel; Băguţ, Elena-Tatiana; Mignon, Bernard; Waridel, Patrice; Peter, Corinne; Pradervand, Sylvain

    2016-01-01

    ABSTRACT Dermatophytes are the most common agents of superficial mycoses in humans and animals. The aim of the present investigation was to systematically identify the extracellular, possibly secreted, proteins that are putative virulence factors and antigenic molecules of dermatophytes. A complete gene expression profile of Arthroderma benhamiae was obtained during infection of its natural host (guinea pig) using RNA sequencing (RNA-seq) technology. This profile was completed with those of the fungus cultivated in vitro in two media containing either keratin or soy meal protein as the sole source of nitrogen and in Sabouraud medium. More than 60% of transcripts deduced from RNA-seq data differ from those previously deposited for A. benhamiae. Using these RNA-seq data along with an automatic gene annotation procedure, followed by manual curation, we produced a new annotation of the A. benhamiae genome. This annotation comprised 7,405 coding sequences (CDSs), among which only 2,662 were identical to the currently available annotation, 383 were newly identified, and 15 secreted proteins were manually corrected. The expression profile of genes encoding proteins with a signal peptide in infected guinea pigs was found to be very different from that during in vitro growth when using keratin as the substrate. Especially, the sets of the 12 most highly expressed genes encoding proteases with a signal sequence had only the putative vacuolar aspartic protease gene PEP2 in common, during infection and in keratin medium. The most upregulated gene encoding a secreted protease during infection was that encoding subtilisin SUB6, which is a known major allergen in the related dermatophyte Trichophyton rubrum. IMPORTANCE Dermatophytoses (ringworm, jock itch, athlete’s foot, and nail infections) are the most common fungal infections, but their virulence mechanisms are poorly understood. Combining transcriptomic data obtained from growth under various culture conditions with data obtained during infection led to a significantly improved genome annotation. About 65% of the protein-encoding genes predicted with our protocol did not match the existing annotation for A. benhamiae. Comparing gene expression during infection on guinea pigs with keratin degradation in vitro, which is supposed to mimic the host environment, revealed the critical importance of using real in vivo conditions for investigating virulence mechanisms. The analysis of genes expressed in vivo, encoding cell surface and secreted proteins, particularly proteases, led to the identification of new allergen and virulence factor candidates. PMID:27822542

  11. RNA Sequencing-Based Genome Reannotation of the Dermatophyte Arthroderma benhamiae and Characterization of Its Secretome and Whole Gene Expression Profile during Infection.

    PubMed

    Tran, Van Du T; De Coi, Niccolò; Feuermann, Marc; Schmid-Siegert, Emanuel; Băguţ, Elena-Tatiana; Mignon, Bernard; Waridel, Patrice; Peter, Corinne; Pradervand, Sylvain; Pagni, Marco; Monod, Michel

    2016-01-01

    Dermatophytes are the most common agents of superficial mycoses in humans and animals. The aim of the present investigation was to systematically identify the extracellular, possibly secreted, proteins that are putative virulence factors and antigenic molecules of dermatophytes. A complete gene expression profile of Arthroderma benhamiae was obtained during infection of its natural host (guinea pig) using RNA sequencing (RNA-seq) technology. This profile was completed with those of the fungus cultivated in vitro in two media containing either keratin or soy meal protein as the sole source of nitrogen and in Sabouraud medium. More than 60% of transcripts deduced from RNA-seq data differ from those previously deposited for A. benhamiae . Using these RNA-seq data along with an automatic gene annotation procedure, followed by manual curation, we produced a new annotation of the A. benhamiae genome. This annotation comprised 7,405 coding sequences (CDSs), among which only 2,662 were identical to the currently available annotation, 383 were newly identified, and 15 secreted proteins were manually corrected. The expression profile of genes encoding proteins with a signal peptide in infected guinea pigs was found to be very different from that during in vitro growth when using keratin as the substrate. Especially, the sets of the 12 most highly expressed genes encoding proteases with a signal sequence had only the putative vacuolar aspartic protease gene PEP2 in common, during infection and in keratin medium. The most upregulated gene encoding a secreted protease during infection was that encoding subtilisin SUB6, which is a known major allergen in the related dermatophyte Trichophyton rubrum . IMPORTANCE Dermatophytoses (ringworm, jock itch, athlete's foot, and nail infections) are the most common fungal infections, but their virulence mechanisms are poorly understood. Combining transcriptomic data obtained from growth under various culture conditions with data obtained during infection led to a significantly improved genome annotation. About 65% of the protein-encoding genes predicted with our protocol did not match the existing annotation for A. benhamiae . Comparing gene expression during infection on guinea pigs with keratin degradation in vitro , which is supposed to mimic the host environment, revealed the critical importance of using real in vivo conditions for investigating virulence mechanisms. The analysis of genes expressed in vivo , encoding cell surface and secreted proteins, particularly proteases, led to the identification of new allergen and virulence factor candidates.

  12. Transcription Factor Binding Profiles Reveal Cyclic Expression of Human Protein-coding Genes and Non-coding RNAs

    PubMed Central

    Cheng, Chao; Ung, Matthew; Grant, Gavin D.; Whitfield, Michael L.

    2013-01-01

    Cell cycle is a complex and highly supervised process that must proceed with regulatory precision to achieve successful cellular division. Despite the wide application, microarray time course experiments have several limitations in identifying cell cycle genes. We thus propose a computational model to predict human cell cycle genes based on transcription factor (TF) binding and regulatory motif information in their promoters. We utilize ENCODE ChIP-seq data and motif information as predictors to discriminate cell cycle against non-cell cycle genes. Our results show that both the trans- TF features and the cis- motif features are predictive of cell cycle genes, and a combination of the two types of features can further improve prediction accuracy. We apply our model to a complete list of GENCODE promoters to predict novel cell cycle driving promoters for both protein-coding genes and non-coding RNAs such as lincRNAs. We find that a similar percentage of lincRNAs are cell cycle regulated as protein-coding genes, suggesting the importance of non-coding RNAs in cell cycle division. The model we propose here provides not only a practical tool for identifying novel cell cycle genes with high accuracy, but also new insights on cell cycle regulation by TFs and cis-regulatory elements. PMID:23874175

  13. Identification of the mpl gene encoding UDP-N-acetylmuramate: L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase in Escherichia coli and its role in recycling of cell wall peptidoglycan.

    PubMed Central

    Mengin-Lecreulx, D; van Heijenoort, J; Park, J T

    1996-01-01

    A gene, mpl, encoding UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelat e ligase was recognized by its amino acid sequence homology with murC as the open reading frame yjfG present at 96 min on the Escherichia coli map. The existence of such an enzymatic activity was predicted from studies indicating that reutilization of the intact tripeptide L-alanyl-gamma-D-glutamyl-meso-diaminopimelate occurred and accounted for well over 30% of new cell wall synthesis. Murein tripeptide ligase activity could be demonstrated in crude extracts, and greatly increased activity was produced when the gene was cloned and expressed under control of the trc promoter. A null mutant totally lacked activity but was viable, showing that the enzyme is not essential for growth. PMID:8808921

  14. Deep-sea vent phage DNA polymerase specifically initiates DNA synthesis in the absence of primers.

    PubMed

    Zhu, Bin; Wang, Longfei; Mitsunobu, Hitoshi; Lu, Xueling; Hernandez, Alfredo J; Yoshida-Takashima, Yukari; Nunoura, Takuro; Tabor, Stanley; Richardson, Charles C

    2017-03-21

    A DNA polymerase is encoded by the deep-sea vent phage NrS-1. NrS-1 has a unique genome organization containing genes that are predicted to encode a helicase and a single-stranded DNA (ssDNA)-binding protein. The gene for an unknown protein shares weak homology with the bifunctional primase-polymerases (prim-pols) from archaeal plasmids but is missing the zinc-binding domain typically found in primases. We show that this gene product has efficient DNA polymerase activity and is processive in DNA synthesis in the presence of the NrS-1 helicase and ssDNA-binding protein. Remarkably, this NrS-1 DNA polymerase initiates DNA synthesis from a specific template DNA sequence in the absence of any primer. The de novo DNA polymerase activity resides in the N-terminal domain of the protein, whereas the C-terminal domain enhances DNA binding.

  15. Genome of the opportunistic pathogen Streptococcus sanguinis.

    PubMed

    Xu, Ping; Alves, Joao M; Kitten, Todd; Brown, Arunsri; Chen, Zhenming; Ozaki, Luiz S; Manque, Patricio; Ge, Xiuchun; Serrano, Myrna G; Puiu, Daniela; Hendricks, Stephanie; Wang, Yingping; Chaplin, Michael D; Akan, Doruk; Paik, Sehmi; Peterson, Darrell L; Macrina, Francis L; Buck, Gregory A

    2007-04-01

    The genome of Streptococcus sanguinis is a circular DNA molecule consisting of 2,388,435 bp and is 177 to 590 kb larger than the other 21 streptococcal genomes that have been sequenced. The G+C content of the S. sanguinis genome is 43.4%, which is considerably higher than the G+C contents of other streptococci. The genome encodes 2,274 predicted proteins, 61 tRNAs, and four rRNA operons. A 70-kb region encoding pathways for vitamin B(12) biosynthesis and degradation of ethanolamine and propanediol was apparently acquired by horizontal gene transfer. The gene complement suggests new hypotheses for the pathogenesis and virulence of S. sanguinis and differs from the gene complements of other pathogenic and nonpathogenic streptococci. In particular, S. sanguinis possesses a remarkable abundance of putative surface proteins, which may permit it to be a primary colonizer of the oral cavity and agent of streptococcal endocarditis and infection in neutropenic patients.

  16. Characterization of a polyketide synthase in Aspergillus niger whose product is a precursor for both dihydroxynaphthalene (DHN) melanin and naphtho-γ-pyrone.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chiang, Yi Ming; Meyer, Kristen M; Praseuth, Michael

    2010-12-06

    The genome sequencing of the fungus Aspergillus niger, an industrial workhorse, uncovered a large cache of genes encoding enzymes thought to be involved in the production of secondary metabolites yet to be identified. Identification and structural characterization of many of these predicted secondary metabolites are hampered by their low concentration relative to the known A. niger metabolites such as the naphtho-γ-pyrone family of polyketides. We deleted a nonreducing PKS gene in A. niger strain ATCC 11414, a daughter strain of A. niger ATCC strain 1015 whose genome was sequenced by the DOE Joint Genome Institute. This PKS encoding gene ismore » a predicted ortholog of alb1 from Aspergillus fumigatus which is responsible for production of YWA1, a precursor of fungal DHN melanin. Our results show that the A. niger alb1 PKS is responsible for the production of the polyketide precursor for DHN melanin biosynthesis. Deletion of alb1 elimnates the production of major metabolites, naphtho-γ-pyrones. The generation of an A. niger strain devoid of naphtho-γ-pyrones will greatly facilitate the elucidation of cryptic biosynthetic pathways in this organism.« less

  17. A Bacteriophage-Related Chimeric Marine Virus Infecting Abalone

    PubMed Central

    Zhuang, Jun; Cai, Guiqin; Lin, Qiying; Wu, Zujian; Xie, Lianhui

    2010-01-01

    Marine viruses shape microbial communities with the most genetic diversity in the sea by multiple genetic exchanges and infect multiple marine organisms. Here we provide proof from experimental infection that abalone shriveling syndrome-associated virus (AbSV) can cause abalone shriveling syndrome. This malady produces histological necrosis and abnormally modified macromolecules (hemocyanin and ferritin). The AbSV genome is a 34.952-kilobase circular double-stranded DNA, containing putative genes with similarity to bacteriophages, eukaryotic viruses, bacteria and endosymbionts. Of the 28 predicted open reading frames (ORFs), eight ORF-encoded proteins have identifiable functional homologues. The 4 ORF products correspond to a predicted terminase large subunit and an endonuclease in bacteriophage, and both an integrase and an exonuclease from bacteria. The other four proteins are homologous to an endosymbiont-derived helicase, primase, single-stranded binding (SSB) protein, and thymidylate kinase, individually. Additionally, AbSV exhibits a common gene arrangement similar to the majority of bacteriophages. Unique to AbSV, the viral genome also contains genes associated with bacterial outer membrane proteins and may lack the structural protein-encoding ORFs. Genomic characterization of AbSV indicates that it may represent a transitional form of microbial evolution from viruses to bacteria. PMID:21079776

  18. De Novo Sequencing of a Sparassis latifolia Genome and Its Associated Comparative Analyses

    PubMed Central

    Ma, Lu; Yang, Chi; Ying, Zhenghe; Jiang, Xiaoling

    2018-01-01

    Known to be rich in β-glucan, Sparassis latifolia (S. latifolia) is a valuable edible fungus cultivated in East Asia. A few studies have suggested that S. latifolia is effective on antidiabetic, antihypertension, antitumor, and antiallergen medications. However, it is still unclear genetically why the fungus has these medical effects, which has become a key bottleneck for its further applications. To provide a better understanding of this fungus, we sequenced its whole genome, which has a total size of 48.13 megabases (Mb) and contains 12,471 predicted gene models. We then performed comparative and phylogenetic analyses, which indicate that S. latifolia is closely related to a few species in the antrodia clade including Fomitopsis pinicola, Wolfiporia cocos, Postia placenta, and Antrodia sinuosa. Finally, we annotated the predicted genes. Interestingly, the S. latifolia genome encodes most enzymes involved in carbohydrate and glycoconjugate metabolism and is also enriched in genes encoding enzymes critical to secondary metabolite biosynthesis and involved in indole, terpene, and type I polyketide pathways. As a conclusion, the genome content of S. latifolia sheds light on its genetic basis of the reported medicinal properties and could also be used as a reference genome for comparative studies on fungi. PMID:29682127

  19. Molecular diversity of tuliposide B-converting enzyme in tulip (Tulipa gesneriana): identification of the root-specific isozyme.

    PubMed

    Nomura, Taiji; Ueno, Ayaka; Ogita, Shinjiro; Kato, Yasuo

    2017-06-01

    6-Tuliposide B (PosB) is a glucose ester accumulated in tulip (Tulipa gesneriana) as a major secondary metabolite. PosB serves as the precursor of the antimicrobial lactone tulipalin B (PaB), which is formed by PosB-converting enzyme (TCEB). The gene TgTCEB1, encoding a TCEB, is transcribed in tulip pollen but scarcely transcribed in other tissues (e.g. roots) even though those tissues show high TCEB activity. This led to the prediction of the presence of a TCEB isozyme with distinct tissue specificity. Herein, we describe the identification of the TgTCEB-R gene from roots via native enzyme purification; this gene is a paralog of TgTCEB1. Recombinant enzyme characterization verified that TgTCEB-R encodes a TCEB. Moreover, TgTCEB-R was localized in tulip plastids, as found for pollen TgTCEB1. TgTCEB-R is transcribed almost exclusively in roots, indicating a tissue preference for the transcription of TCEB isozyme genes.

  20. Type 3 fimbriae and biofilm formation are regulated by the transcriptional regulators MrkHI in Klebsiella pneumoniae.

    PubMed

    Johnson, Jeremiah G; Murphy, Caitlin N; Sippy, Jean; Johnson, Tylor J; Clegg, Steven

    2011-07-01

    Klebsiella pneumoniae is an opportunistic pathogen which frequently causes hospital-acquired urinary and respiratory tract infections. K. pneumoniae may establish these infections in vivo following adherence, using the type 3 fimbriae, to indwelling devices coated with extracellular matrix components. Using a colony immunoblot screen, we identified transposon insertion mutants which were deficient for type 3 fimbrial surface production. One of these mutants possessed a transposon insertion within a gene, designated mrkI, encoding a putative transcriptional regulator. A site-directed mutant of this gene was constructed and shown to be deficient for fimbrial surface expression under aerobic conditions. MrkI mutants have a significantly decreased ability to form biofilms on both abiotic and extracellular matrix-coated surfaces. This gene was found to be cotranscribed with a gene predicted to encode a PilZ domain-containing protein, designated MrkH. This protein was found to bind cyclic-di-GMP (c-di-GMP) and regulate type 3 fimbrial expression.

  1. Type 3 Fimbriae and Biofilm Formation Are Regulated by the Transcriptional Regulators MrkHI in Klebsiella pneumoniae▿

    PubMed Central

    Johnson, Jeremiah G.; Murphy, Caitlin N.; Sippy, Jean; Johnson, Tylor J.; Clegg, Steven

    2011-01-01

    Klebsiella pneumoniae is an opportunistic pathogen which frequently causes hospital-acquired urinary and respiratory tract infections. K. pneumoniae may establish these infections in vivo following adherence, using the type 3 fimbriae, to indwelling devices coated with extracellular matrix components. Using a colony immunoblot screen, we identified transposon insertion mutants which were deficient for type 3 fimbrial surface production. One of these mutants possessed a transposon insertion within a gene, designated mrkI, encoding a putative transcriptional regulator. A site-directed mutant of this gene was constructed and shown to be deficient for fimbrial surface expression under aerobic conditions. MrkI mutants have a significantly decreased ability to form biofilms on both abiotic and extracellular matrix-coated surfaces. This gene was found to be cotranscribed with a gene predicted to encode a PilZ domain-containing protein, designated MrkH. This protein was found to bind cyclic-di-GMP (c-di-GMP) and regulate type 3 fimbrial expression. PMID:21571997

  2. Expression analysis of the N-Myc downstream-regulated gene 1 indicates that myelinating Schwann cells are the primary disease target in hereditary motor and sensory neuropathy-Lom.

    PubMed

    Berger, Philipp; Sirkowski, Erich E; Scherer, Steven S; Suter, Ueli

    2004-11-01

    Mutations in the gene encoding N-myc downstream-regulated gene-1 (NDRG1) lead to truncations of the encoded protein and are associated with an autosomal recessive demyelinating neuropathy--hereditary motor and sensory neuropathy-Lom. NDRG1 protein is highly expressed in peripheral nerve and is localized in the cytoplasm of myelinating Schwann cells, including the paranodes and Schmidt-Lanterman incisures. In contrast, sensory and motor neurons as well as their axons lack NDRG1. NDRG1 mRNA levels in developing and injured adult sciatic nerves parallel those of myelin-related genes, indicating that the expression of NDRG1 in myelinating Schwann cells is regulated by axonal interactions. Oligodendrocytes also express NDRG1, and the subtle CNS deficits of affected patients may result from a lack of NDRG1 in these cells. Our data predict that the loss of NDRG1 leads to a Schwann cell autonomous phenotype resulting in demyelination, with secondary axonal loss.

  3. Gene amplification at a locus encoding a putative Na+/H+ antiporter confers sodium and lithium tolerance in fission yeast.

    PubMed Central

    Jia, Z P; McCullough, N; Martel, R; Hemmingsen, S; Young, P G

    1992-01-01

    We have identified a new locus, sodium 2 (sod2) based on selection for increased LiCl tolerance in fission yeast, Schizosaccharomyces pombe. Tolerant strains have enhanced pH-dependent Na+ export capacity and sodium transport experiments suggest that the gene encodes an Na+/H+ antiport. The predicted sod2 gene product can be placed in the broad class of transporters which possess 12 hydrophobic transmembrane domains. The protein shows some sequence similarity to the human and bacterial Na+/H+ antiporters. Overexpression of sod2 increased Na+ export capacity and conferred sodium tolerance. Osmotolerance was not affected and sod2 cells were unaffected for growth in K+. In a sod2 disruption strain cells were incapable of exporting sodium. They were hypersensitive to Na+ or Li+ and could not grow under conditions that approximate pH7. The sod2 gene amplification could be selected stepwise and the degree of such amplification correlated with the level of Na+ or Li+ tolerance. Images PMID:1314171

  4. Viability, Longevity, and Egg Production of Drosophila melanogaster Are Regulated by the miR-282 microRNA

    PubMed Central

    Vilmos, Péter; Bujna, Ágnes; Szuperák, Milán; Havelda, Zoltán; Várallyay, Éva; Szabad, János; Kucerova, Lucie; Somogyi, Kálmán; Kristó, Ildikó; Lukácsovich, Tamás; Jankovics, Ferenc; Henn, László; Erdélyi, Miklós

    2013-01-01

    The first microRNAs were discovered some 20 years ago, but only a small fraction of the microRNA-encoding genes have been described in detail yet. Here we report the molecular analysis of a computationally predicted Drosophila melanogaster microRNA gene, mir-282. We show that the mir-282 gene is the source of a 4.9-kb-long primary transcript with a 5′ cap and a 3′-poly(A) sequence and a mature microRNA of ∼25 bp. Our data strongly suggest the existence of an independent mir-282 gene conserved in holometabolic insects. We give evidence that the mir-282 locus encodes a functional transcript that influences viability, longevity, and egg production in Drosophila. We identify the nervous system-specific adenylate cyclase (rutabaga) as a target of miR-282 and assume that one of the main functions of mir-282 is the regulation of adenylate cyclase activity in the nervous system during metamorphosis. PMID:23852386

  5. Identification of the WBSCR9 gene, encoding a novel transcriptional regulator, in the Williams-Beuren syndrome deletion at 7q11.23.

    PubMed

    Peoples, R J; Cisco, M J; Kaplan, P; Francke, U

    1998-01-01

    We have identified a novel gene (WBSCR9) within the common Williams-Beuren syndrome (WBS) deletion by interspecies sequence conservation. The WBSCR9 gene encodes a roughly 7-kb transcript with an open reading frame of 1483 amino acids and a predicted protein product size of 170.8 kDa. WBSCR9 is comprised of at least 20 exons extending over 60 kb. The transcript is expressed ubiquitously throughout development and is subject to alternative splicing. Functional motifs identified by sequence homology searches include a bromodomain; a PHD, or C4HC3, finger; several putative nuclear localization signals; four nuclear receptor binding motifs; a polyglutamate stretch and two PEST sequences. Bromodomains, PHD motifs and nuclear receptor binding motifs are cardinal features of proteins that are involved in chromatin remodeling and modulation of transcription. Haploinsufficiency for WBSCR9 gene products may contribute to the complex phenotype of WBS by interacting with tissue-specific regulatory factors during development.

  6. Degradation of Benzene by Pseudomonas veronii 1YdBTEX2 and 1YB2 Is Catalyzed by Enzymes Encoded in Distinct Catabolism Gene Clusters.

    PubMed

    de Lima-Morales, Daiana; Chaves-Moreno, Diego; Wos-Oxley, Melissa L; Jáuregui, Ruy; Vilchez-Vargas, Ramiro; Pieper, Dietmar H

    2016-01-01

    Pseudomonas veronii 1YdBTEX2, a benzene and toluene degrader, and Pseudomonas veronii 1YB2, a benzene degrader, have previously been shown to be key players in a benzene-contaminated site. These strains harbor unique catabolic pathways for the degradation of benzene comprising a gene cluster encoding an isopropylbenzene dioxygenase where genes encoding downstream enzymes were interrupted by stop codons. Extradiol dioxygenases were recruited from gene clusters comprising genes encoding a 2-hydroxymuconic semialdehyde dehydrogenase necessary for benzene degradation but typically absent from isopropylbenzene dioxygenase-encoding gene clusters. The benzene dihydrodiol dehydrogenase-encoding gene was not clustered with any other aromatic degradation genes, and the encoded protein was only distantly related to dehydrogenases of aromatic degradation pathways. The involvement of the different gene clusters in the degradation pathways was suggested by real-time quantitative reverse transcription PCR. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  7. Regulatory role of XynR (YagI) in catabolism of xylonate in Escherichia coli K-12.

    PubMed

    Shimada, Tomohiro; Momiyama, Eri; Yamanaka, Yuki; Watanabe, Hiroki; Yamamoto, Kaneyoshi; Ishihama, Akira

    2017-12-01

    The genome of Escherichia coli K-12 contains ten cryptic phages, altogether constituting about 3.6% of the genome in sequence. Among more than 200 predicted genes in these cryptic phages, 14 putative transcription factor (TF) genes exist, but their regulatory functions remain unidentified. As an initial attempt to make a breakthrough for understanding the regulatory roles of cryptic phage-encoded TFs, we tried to identify the regulatory function of CP4-6 cryptic prophage-encoded YagI with unknown function. After SELEX screening, YagI was found to bind mainly at a single site within the spacer of bidirectional transcription units, yagA (encoding another uncharacterized TF) and yagEF (encoding 2-keto-3-deoxy gluconate aldolase, and dehydratase, respectively) within this prophage region. YagEF enzymes are involved in the catabolism of xylose downstream from xylonate. We then designated YagI as XynR (regulator of xylonate catabolism), one of the rare single-target TFs. In agreement with this predicted regulatory function, the activity of XynR was suggested to be controlled by xylonate. Even though low-affinity binding sites of XynR were identified in the E. coli K-12 genome, they all were inside open reading frames, implying that the regulation network of XynR is still fixed within the CR4-6 prophage without significant influence over the host E. coli K-12. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  8. Molecular cloning, characterization and expression analysis of TLR9, MyD88 and TRAF6 genes in common carp (Cyprinus carpio).

    PubMed

    Kongchum, Pawapol; Hallerman, Eric M; Hulata, Gideon; David, Lior; Palti, Yniv

    2011-01-01

    Induction of innate immune pathways is critical for early host defense, but there is limited understanding of how teleost fishes recognize pathogen molecules and activate these pathways. In mammals, cells of the innate immune system detect pathogenic molecular structures using pattern recognition receptors (PRRs). TLR9 functions as a PRR that recognizes CpG motifs in bacterial and viral DNA and requires adaptor molecules MyD88 and TRAF6 for signal transduction. Here we report full-length cDNA isolation, structural characterization and tissue mRNA expression analysis of the common carp (cc) TLR9, MyD88 and TRAF6 gene orthologs. The ccTLR9 open-reading frame (ORF) is predicted to encode a 1064-amino acid (aa) protein. We found that MyD88 and TRAF6 genes are duplicated in common carp. This is the first report of TRAF6 duplication in a vertebrate genome and stronger evidence in support of MyD88 duplication is provided. The ccMyD88a and b ORFs are predicted to encode 288-aa and 284-aa peptides, respectively. They share 91% aa sequence identity between paralogs. The ccTRAF6a and b ORFs are both predicted to encode 543-aa peptides sharing 95% aa sequence identity between paralogs. The ccTLR9 gene is contained in a single large exon. The ccMyD88a and ccMyD88b coding sequences span five exons. The TRAF6b gene spans six exons. PCR amplification to obtain the entire coding sequence of ccTRAF6a gene was not successful. The 2104-bp fragment amplified covers the 3' end of the gene and it contains a partial sequence of one exon and three complete exons. The predicated protein domains of the ccTLR9, ccMyD88 and ccTRAF6 are conserved and resemble orthologs from other vertebrates. Real-time quantitative PCR assays of the ccTLR9, MyD88a and b, and TRAF6a and b gene transcripts in healthy common carp indicated that mRNA expression varied between tissues. Differential expression of duplicate copies were found for ccMyD88 and ccTRAF6 in white and red muscle tissues, suggesting that paralogs may have evolved and attained a new function. The genomic information we describe in this paper provides evidence of sequence and structural conservation of immune response genes in common carp. Published by Elsevier Ltd.

  9. Molecular cloning of an inducible serine esterase gene from human cytotoxic lymphocytes.

    PubMed Central

    Trapani, J A; Klein, J L; White, P C; Dupont, B

    1988-01-01

    A cDNA clone encoding a human serine esterase gene was isolated from a library constructed from poly(A)+ RNA of allogeneically stimulated, interleukin 2-expanded peripheral blood mononuclear cells. The clone, designated HSE26.1, represents a full-length copy of a 0.9-kilobase mRNA present in human cytotoxic cells but absent from a wide variety of noncytotoxic cell lines. Clone HSE26.1 contains an 892-base-pair sequence, including a single 741-base-pair open reading frame encoding a putative 247-residue polypeptide. The first 20 amino acids of the polypeptide form a leader sequence. The mature protein is predicted to have an unglycosylated Mr of approximately equal to 26,000 and contains a single potential site for N-linked glycosylation. The nucleotide and predicted amino acid sequences of clone HSE26.1 are homologous with all murine and human serine esterases cloned thus far but are most similar to mouse granzyme B (70% nucleotide and 68% amino acid identity). HSE26.1 protein is expressed weakly in unstimulated peripheral blood mononuclear cells but is strongly induced within 6-hr incubation in medium containing phytohemagglutinin. The data suggest that the protein encoded by HSE26.1 plays a role in cell-mediated cytotoxicity. Images PMID:3261871

  10. The MET13 Methylenetetrahydrofolate Reductase Gene Is Essential for Infection-Related Morphogenesis in the Rice Blast Fungus Magnaporthe oryzae

    PubMed Central

    Wang, Hong; Wang, Congcong; Li, Ya; Yue, Xiaofeng; Ma, Zhonghua; Talbot, Nicholas J.; Wang, Zhengyi

    2013-01-01

    Methylenetetrahydrofolate reductases (MTHFRs) play a key role in the biosynthesis of methionine in both prokaryotic and eukaryotic organisms. In this study, we report the identification of a novel T-DNA-tagged mutant WH672 in the rice blast fungus Magnaporthe oryzae, which was defective in vegetative growth, conidiation and pathogenicity. Analysis of the mutation confirmed a single T-DNA insertion upstream of MET13, which encodes a 626-amino-acid protein encoding a MTHFR. Targeted gene deletion of MET13 resulted in mutants that were non-pathogenic and significantly impaired in aerial growth and melanin pigmentation. All phenotypes associated with Δmet13 mutants could be overcome by addition of exogenous methionine. The M. oryzae genome contains a second predicted MTHFR-encoding gene, MET12. The deduced amino acid sequences of Met13 and Met12 share 32% identity. Interestingly, Δmet12 mutants produced significantly less conidia compared with the isogenic wild-type strain and grew very poorly in the absence of methionine, but were fully pathogenic. Deletion of both genes resulted in Δmet13Δmet12 mutants that showed similar phenotypes to single Δmet13 mutants. Taken together, we conclude that the MTHFR gene, MET13, is essential for infection-related morphogenesis by the rice blast fungus M. oryzae. PMID:24116181

  11. chs-4, a class IV chitin synthase gene from Neurospora crassa.

    PubMed

    Din, A B; Specht, C A; Robbins, P W; Yarden, O

    1996-02-05

    In Saccharomyces cerevisiae, most of the cellular chitin is produced by chitin synthase III, which requires the product encoded by the CSD2/CAL1/DIT101/KT12 gene. We have identified, isolated and structurally characterized as CSD2/CAL1/DIT101/KT12 homologue in the filamentous ascomycete Neurospora crassa and have used a "reverse genetics" approach to determine its role in vivo. The yeast gene was used as a heterologous probe for the isolation of a N. crassa gene(designated chs-4) encoding a polypeptide belonging to a class of chitin synthases which we have designated class IV. The predicted polypeptide encoded by this gene is highly similar to those of S. cerevisiae and Candida albicans. N. crassa strains in which chs-4 had been inactivated by the Repeat-Induced point mutation (RIP) process grew and developed in a normal manner under standard growth conditions. However, when grown in the presence of sorbose (a carbon source which induces morphological changes accompanied by elevated chitin content), chitin levels in the chs-4RIP strain were significantly lower than those observed in the wild type. We suggest that CHS4 may serve as an auxiliary enzyme in N. crassa and that, in contrast to yeasts, it is possible that filamentous fungi may have more than one class IV chitin synthase.

  12. Mycobacterium ahvazicum sp. nov., the nineteenth species of the Mycobacterium simiae complex.

    PubMed

    Bouam, Amar; Heidarieh, Parvin; Shahraki, Abodolrazagh Hashemi; Pourahmad, Fazel; Mirsaeidi, Mehdi; Hashemzadeh, Mohamad; Baptiste, Emeline; Armstrong, Nicholas; Levasseur, Anthony; Robert, Catherine; Drancourt, Michel

    2018-03-07

    Four slowly growing mycobacteria isolates were isolated from the respiratory tract and soft tissue biopsies collected in four unrelated patients in Iran. Conventional phenotypic tests indicated that these four isolates were identical to Mycobacterium lentiflavum while 16S rRNA gene sequencing yielded a unique sequence separated from that of M. lentiflavum. One representative strain AFP-003 T was characterized as comprising a 6,121,237-bp chromosome (66.24% guanosine-cytosine content) encoding for 5,758 protein-coding genes, 50 tRNA and one complete rRNA operon. A total of 2,876 proteins were found to be associated with the mobilome, including 195 phage proteins. A total of 1,235 proteins were found to be associated with virulence and 96 with toxin/antitoxin systems. The genome of AFP-003 T has the genetic potential to produce secondary metabolites, with 39 genes found to be associated with polyketide synthases and non-ribosomal peptide syntases and 11 genes encoding for bacteriocins. Two regions encoding putative prophages and three OriC regions separated by the dnaA gene were predicted. Strain AFP-003 T genome exhibits 86% average nucleotide identity with Mycobacterium genavense genome. Genetic and genomic data indicate that strain AFP-003 T is representative of a novel Mycobacterium species that we named Mycobacterium ahvazicum, the nineteenth species of the expanding Mycobacterium simiae complex.

  13. Molecular cloning and characterization of a membrane associated NAC family gene, SiNAC from foxtail millet [Setaria italica (L.) P. Beauv].

    PubMed

    Puranik, Swati; Bahadur, Ranjit Prasad; Srivastava, Prem S; Prasad, Manoj

    2011-10-01

    The plant-specific NAC (NAM, ATAF, and CUC) transcription factors have diverse role in development and stress regulation. A transcript encoding NAC protein, termed SiNAC was identified from a salt stress subtractive cDNA library of S. italica seedling (Puranik et al., J Plant Physiol 168:280-287, 2011). This single/low copy gene containing four exons and four introns within the genomic-sequence encoded a protein of 462 amino acids. Structural analysis revealed that highly divergent C terminus contains a transmembrane domain. The NAC domain consisted of a twisted antiparallel beta-sheet packing against N terminal alpha helix on one side and a shorter helix on the other side. The domain was predicted to homodimerize and control DNA-binding specificity. The physicochemical features of the SiNAC homodimer interface justified the dimeric form of the predicted model. A 1539 bp fragment upstream to the start codon of SiNAC gene was cloned and in silico analysis revealed several putative cis-acting regulatory elements within the promoter sequence. Transactivation analysis indicated that SiNAC activated expression of reporter gene and the activation domain lied at the C terminal. The SiNAC:GFP was detected in the nucleus and cytoplasm while SiNAC ΔC(1-158):GFP was nuclear localized in onion epidermal cells. SiNAC transcripts mostly accumulated in young spikes and were strongly induced by dehydration, salinity, ethephon, and methyl jasmonate. These results suggest that SiNAC encodes a membrane associated NAC-domain protein that may function as a transcriptional activator in response to stress and developmental regulation in plants.

  14. Isolation and characterization of the genes for two small RNAs of herpesvirus papio and their comparison with Epstein-Barr virus-encoded EBER RNAs.

    PubMed

    Howe, J G; Shu, M D

    1988-08-01

    Genes for the Epstein-Barr virus-encoded RNAs (EBERs), two low-molecular-weight RNAs encoded by the human gammaherpesvirus Epstein-Barr virus (EBV), hybridize to two small RNAs in a baboon cell line that contains a similar virus, herpesvirus papio (HVP). The genes for the HVP RNAs (HVP-1 and HVP-2) are located together in the small unique region at the left end of the viral genome and are transcribed by RNA polymerase III in a rightward direction, similar to the EBERs. There is significant similarity between EBER1 and HVP-1 RNA, except for an insert of 22 nucleotides which increases the length of HVP-1 RNA to 190 nucleotides. There is less similarity between the sequences of EBER2 and HVP-2 RNA, but both have a length of about 170 nucleotides. The predicted secondary structure of each HVP RNA is remarkably similar to that of the respective EBER, implying that the secondary structures are important for function. Upstream from the initiation sites of all four RNA genes are several highly conserved sequences which may function in the regulation of transcription. The HVP RNAs, together with the EBERs, are highly abundant in transformed cells and are efficiently bound by the cellular La protein.

  15. Genome-Wide Analysis in Three Fusarium Pathogens Identifies Rapidly Evolving Chromosomes and Genes Associated with Pathogenicity

    PubMed Central

    Sperschneider, Jana; Gardiner, Donald M.; Thatcher, Louise F.; Lyons, Rebecca; Singh, Karam B.; Manners, John M.; Taylor, Jennifer M.

    2015-01-01

    Pathogens and hosts are in an ongoing arms race and genes involved in host–pathogen interactions are likely to undergo diversifying selection. Fusarium plant pathogens have evolved diverse infection strategies, but how they interact with their hosts in the biotrophic infection stage remains puzzling. To address this, we analyzed the genomes of three Fusarium plant pathogens for genes that are under diversifying selection. We found a two-speed genome structure both on the chromosome and gene group level. Diversifying selection acts strongly on the dispensable chromosomes in Fusarium oxysporum f. sp. lycopersici and on distinct core chromosome regions in Fusarium graminearum, all of which have associations with virulence. Members of two gene groups evolve rapidly, namely those that encode proteins with an N-terminal [SG]-P-C-[KR]-P sequence motif and proteins that are conserved predominantly in pathogens. Specifically, 29 F. graminearum genes are rapidly evolving, in planta induced and encode secreted proteins, strongly pointing toward effector function. In summary, diversifying selection in Fusarium is strongly reflected as genomic footprints and can be used to predict a small gene set likely to be involved in host–pathogen interactions for experimental verification. PMID:25994930

  16. A Zn(II)2Cys6 DNA binding protein regulates the sirodesmin PL biosynthetic gene cluster in Leptosphaeria maculans

    PubMed Central

    Fox, Ellen M.; Gardiner, Donald M.; Keller, Nancy P.; Howlett, Barbara J.

    2008-01-01

    A gene, sirZ, encoding a Zn(II)2Cys6 DNA binding protein is present in a cluster of genes responsible for the biosynthesis of the epipolythiodioxopiperazine (ETP) toxin, sirodesmin PL in the ascomycete plant pathogen, Leptosphaeria maculans. RNA-mediated silencing of sirZ gives rise to transformants that produce only residual amounts of sirodesmin PL and display a decrease in the transcription of several sirodesmin PL biosynthetic genes. This indicates that SirZ is a major regulator of this gene cluster. Proteins similar to SirZ are encoded in the gliotoxin biosynthetic gene cluster of Aspergillus fumigatus (gliZ) and in an ETP-like cluster in Penicillium lilacinoechinulatum (PlgliZ). Despite its high level of sequence similarity to gliZ, PlgliZ is unable to complement the gliotoxin-deficiency of a mutant of gliZ in A. fumigatus. Putative binding sites for these regulatory proteins in the promoters of genes in these clusters were predicted using bioinformatic analysis. These sites are similar to those commonly bound by other proteins with Zn(II)2Cys6 DNA binding domains. PMID:18023597

  17. cDNA sequence and expression of a cold-responsive gene in Citrus unshiu.

    PubMed

    Hara, M; Wakasugi, Y; Ikoma, Y; Yano, M; Ogawa, K; Kuboi, T

    1999-02-01

    A cDNA clone encoding a protein (CuCOR19), the sequence of which is similar to Poncirus COR19, of the dehydrin family was isolated from the epicarp of Citrus unshiu. The molecular mass of the predicted protein was 18,980 daltons. CuCOR19 was highly hydrophilic and contained three repeating elements including Lys-rich motifs. The gene expression in leaves increased by cold stress.

  18. The Bacillus subtilis yaaH Gene Is Transcribed by SigE RNA Polymerase during Sporulation, and Its Product Is Involved in Germination of Spores

    PubMed Central

    Kodama, Takeko; Takamatsu, Hiromu; Asai, Kei; Kobayashi, Kazuo; Ogasawara, Naotake; Watabe, Kazuhito

    1999-01-01

    The expression of 21 novel genes located in the region from dnaA to abrB of the Bacillus subtilis chromosome was analyzed. One of the genes, yaaH, had a predicted promoter sequence conserved among SigE-dependent genes. Northern blot analysis revealed that yaaH mRNA was first detected from 2 h after the cessation of logarithmic growth (T2) of sporulation in wild-type cells and in spoIIIG (SigG−) and spoIVCB (SigK−) mutants but not in spoIIAC (SigF−) and spoIIGAB (SigE−) mutants. The transcription start point was determined by primer extension analysis; the −10 and −35 regions are very similar to the consensus sequences recognized by SigE-containing RNA polymerase. A YaaH-His tag fusion encoded by a plasmid with a predicted promoter for the yaaH gene was produced from T2 of sporulation in a B. subtilis transformant and extracted from mature spores, indicating that the yaaH gene product is a spore protein. Inactivation of the yaaH gene by insertion of an erythromycin resistance gene did not affect vegetative growth or spore resistance to heat, chloroform, and lysozyme. The germination of yaaH mutant spores in a mixture of l-asparagine, d-glucose, d-fructose, and potassium chloride was almost the same as that of wild-type spores, but the mutant spores were defective in l-alanine-stimulated germination. These results suggest that yaaH is a novel gene encoding a spore protein produced in the mother cell compartment from T2 of sporulation and that it is required for the l-alanine-stimulated germination pathway. PMID:10419957

  19. A provisional regulatory gene network for specification of endomesoderm in the sea urchin embryo

    NASA Technical Reports Server (NTRS)

    Davidson, Eric H.; Rast, Jonathan P.; Oliveri, Paola; Ransick, Andrew; Calestani, Cristina; Yuh, Chiou-Hwa; Minokawa, Takuya; Amore, Gabriele; Hinman, Veronica; Arenas-Mena, Cesar; hide

    2002-01-01

    We present the current form of a provisional DNA sequence-based regulatory gene network that explains in outline how endomesodermal specification in the sea urchin embryo is controlled. The model of the network is in a continuous process of revision and growth as new genes are added and new experimental results become available; see http://www.its.caltech.edu/mirsky/endomeso.htm (End-mes Gene Network Update) for the latest version. The network contains over 40 genes at present, many newly uncovered in the course of this work, and most encoding DNA-binding transcriptional regulatory factors. The architecture of the network was approached initially by construction of a logic model that integrated the extensive experimental evidence now available on endomesoderm specification. The internal linkages between genes in the network have been determined functionally, by measurement of the effects of regulatory perturbations on the expression of all relevant genes in the network. Five kinds of perturbation have been applied: (1) use of morpholino antisense oligonucleotides targeted to many of the key regulatory genes in the network; (2) transformation of other regulatory factors into dominant repressors by construction of Engrailed repressor domain fusions; (3) ectopic expression of given regulatory factors, from genetic expression constructs and from injected mRNAs; (4) blockade of the beta-catenin/Tcf pathway by introduction of mRNA encoding the intracellular domain of cadherin; and (5) blockade of the Notch signaling pathway by introduction of mRNA encoding the extracellular domain of the Notch receptor. The network model predicts the cis-regulatory inputs that link each gene into the network. Therefore, its architecture is testable by cis-regulatory analysis. Strongylocentrotus purpuratus and Lytechinus variegatus genomic BAC recombinants that include a large number of the genes in the network have been sequenced and annotated. Tests of the cis-regulatory predictions of the model are greatly facilitated by interspecific computational sequence comparison, which affords a rapid identification of likely cis-regulatory elements in advance of experimental analysis. The network specifies genomically encoded regulatory processes between early cleavage and gastrula stages. These control the specification of the micromere lineage and of the initial veg(2) endomesodermal domain; the blastula-stage separation of the central veg(2) mesodermal domain (i.e., the secondary mesenchyme progenitor field) from the peripheral veg(2) endodermal domain; the stabilization of specification state within these domains; and activation of some downstream differentiation genes. Each of the temporal-spatial phases of specification is represented in a subelement of the network model, that treats regulatory events within the relevant embryonic nuclei at particular stages. (c) 2002 Elsevier Science (USA).

  20. Genes involved in host-parasite interactions can be revealed by their correlated expression.

    PubMed

    Reid, Adam James; Berriman, Matthew

    2013-02-01

    Molecular interactions between a parasite and its host are key to the ability of the parasite to enter the host and persist. Our understanding of the genes and proteins involved in these interactions is limited. To better understand these processes it would be advantageous to have a range of methods to predict pairs of genes involved in such interactions. Correlated gene expression profiles can be used to identify molecular interactions within a species. Here we have extended the concept to different species, showing that genes with correlated expression are more likely to encode proteins, which directly or indirectly participate in host-parasite interaction. We go on to examine our predictions of molecular interactions between the malaria parasite and both its mammalian host and insect vector. Our approach could be applied to study any interaction between species, for example, between a host and its parasites or pathogens, but also symbiotic and commensal pairings.

  1. Detecting Gene Rearrangements in Patient Populations Through a 2-Step Diagnostic Test Comprised of Rapid IHC Enrichment Followed by Sensitive Next-Generation Sequencing

    PubMed Central

    Murphy, Danielle A.; Ely, Heather A.; Shoemaker, Robert; Boomer, Aaron; Culver, Brady P.; Hoskins, Ian; Haimes, Josh D.; Walters, Ryan D.; Fernandez, Diane; Stahl, Joshua A.; Lee, Jeeyun; Kim, Kyoung-Mee; Lamoureux, Jennifer

    2017-01-01

    Targeted therapy combined with companion diagnostics has led to the advancement of next-generation sequencing (NGS) for detection of molecular alterations. However, using a diagnostic test to identify patient populations with low prevalence molecular alterations, such as gene rearrangements, poses efficiency, and cost challenges. To address this, we have developed a 2-step diagnostic test to identify NTRK1, NTRK2, NTRK3, ROS1, and ALK rearrangements in formalin-fixed paraffin-embedded clinical specimens. This test is comprised of immunohistochemistry screening using a pan-receptor tyrosine kinase cocktail of antibodies to identify samples expressing TrkA (encoded by NTRK1), TrkB (encoded by NTRK2), TrkC (encoded by NTRK3), ROS1, and ALK followed by an RNA-based anchored multiplex polymerase chain reaction NGS assay. We demonstrate that the NGS assay is accurate and reproducible in identification of gene rearrangements. Furthermore, implementation of an RNA quality control metric to assess the presence of amplifiable nucleic acid input material enables a measure of confidence when an NGS result is negative for gene rearrangements. Finally, we demonstrate that performing a pan-receptor tyrosine kinase immunohistochemistry staining enriches detection of the patient population for gene rearrangements from 4% to 9% and has a 100% negative predictive value. Together, this 2-step assay is an efficient method for detection of gene rearrangements in both clinical testing and studies of archival formalin-fixed paraffin-embedded specimens. PMID:27028240

  2. Mitochondrial Genes of Dinoflagellates Are Transcribed by a Nuclear-Encoded Single-Subunit RNA Polymerase.

    PubMed

    Teng, Chang Ying; Dang, Yunkun; Danne, Jillian C; Waller, Ross F; Green, Beverley R

    2013-01-01

    Dinoflagellates are a large group of algae that contribute significantly to marine productivity and are essential photosynthetic symbionts of corals. Although these algae have fully-functioning mitochondria and chloroplasts, both their organelle genomes have been highly reduced and the genes fragmented and rearranged, with many aberrant transcripts. However, nothing is known about their RNA polymerases. We cloned and sequenced the gene for the nuclear-encoded mitochondrial polymerase (RpoTm) of the dinoflagellate Heterocapsa triquetra and showed that the protein presequence targeted a GFP construct into yeast mitochondria. The gene belongs to a small gene family, which includes a variety of 3'-truncated copies that may have originated by retroposition. The catalytic C-terminal domain of the protein shares nine conserved sequence blocks with other single-subunit polymerases and is predicted to have the same fold as the human enzyme. However, the N-terminal (promoter binding/transcription initiation) domain is not well-conserved. In conjunction with the degenerate nature of the mitochondrial genome, this suggests a requirement for novel accessory factors to ensure the accurate production of functional mRNAs.

  3. Transcriptome Profiling of Shewanella oneidensis Gene Expression following Exposure to Acidic and Alkaline pH†

    PubMed Central

    Leaphart, Adam B.; Thompson, Dorothea K.; Huang, Katherine; Alm, Eric; Wan, Xiu-Feng; Arkin, Adam; Brown, Steven D.; Wu, Liyou; Yan, Tingfen; Liu, Xueduan; Wickham, Gene S.; Zhou, Jizhong

    2006-01-01

    The molecular response of Shewanella oneidensis MR-1 to variations in extracellular pH was investigated based on genomewide gene expression profiling. Microarray analysis revealed that cells elicited both general and specific transcriptome responses when challenged with environmental acid (pH 4) or base (pH 10) conditions over a 60-min period. Global responses included the differential expression of genes functionally linked to amino acid metabolism, transcriptional regulation and signal transduction, transport, cell membrane structure, and oxidative stress protection. Response to acid stress included the elevated expression of genes encoding glycogen biosynthetic enzymes, phosphate transporters, and the RNA polymerase sigma-38 factor (rpoS), whereas the molecular response to alkaline pH was characterized by upregulation of nhaA and nhaR, which are predicted to encode an Na+/H+ antiporter and transcriptional activator, respectively, as well as sulfate transport and sulfur metabolism genes. Collectively, these results suggest that S. oneidensis modulates multiple transporters, cell envelope components, and pathways of amino acid consumption and central intermediary metabolism as part of its transcriptome response to changing external pH conditions. PMID:16452448

  4. Evaluating High-Throughput Ab Initio Gene Finders to Discover Proteins Encoded in Eukaryotic Pathogen Genomes Missed by Laboratory Techniques

    PubMed Central

    Goodswen, Stephen J.; Kennedy, Paul J.; Ellis, John T.

    2012-01-01

    Next generation sequencing technology is advancing genome sequencing at an unprecedented level. By unravelling the code within a pathogen’s genome, every possible protein (prior to post-translational modifications) can theoretically be discovered, irrespective of life cycle stages and environmental stimuli. Now more than ever there is a great need for high-throughput ab initio gene finding. Ab initio gene finders use statistical models to predict genes and their exon-intron structures from the genome sequence alone. This paper evaluates whether existing ab initio gene finders can effectively predict genes to deduce proteins that have presently missed capture by laboratory techniques. An aim here is to identify possible patterns of prediction inaccuracies for gene finders as a whole irrespective of the target pathogen. All currently available ab initio gene finders are considered in the evaluation but only four fulfil high-throughput capability: AUGUSTUS, GeneMark_hmm, GlimmerHMM, and SNAP. These gene finders require training data specific to a target pathogen and consequently the evaluation results are inextricably linked to the availability and quality of the data. The pathogen, Toxoplasma gondii, is used to illustrate the evaluation methods. The results support current opinion that predicted exons by ab initio gene finders are inaccurate in the absence of experimental evidence. However, the results reveal some patterns of inaccuracy that are common to all gene finders and these inaccuracies may provide a focus area for future gene finder developers. PMID:23226328

  5. TnpPred: A Web Service for the Robust Prediction of Prokaryotic Transposases

    PubMed Central

    Riadi, Gonzalo; Medina-Moenne, Cristobal; Holmes, David S.

    2012-01-01

    Transposases (Tnps) are enzymes that participate in the movement of insertion sequences (ISs) within and between genomes. Genes that encode Tnps are amongst the most abundant and widely distributed genes in nature. However, they are difficult to predict bioinformatically and given the increasing availability of prokaryotic genomes and metagenomes, it is incumbent to develop rapid, high quality automatic annotation of ISs. This need prompted us to develop a web service, termed TnpPred for Tnp discovery. It provides better sensitivity and specificity for Tnp predictions than given by currently available programs as determined by ROC analysis. TnpPred should be useful for improving genome annotation. The TnpPred web service is freely available for noncommercial use. PMID:23251097

  6. Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication.

    PubMed

    Chapman, Brad A; Bowers, John E; Feltus, Frank A; Paterson, Andrew H

    2006-02-21

    Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a "paleologous" locus than in "singleton" genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during "genomic turmoil" immediately after genome duplication but continues to act approximately 60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages.

  7. Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication

    PubMed Central

    Chapman, Brad A.; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.

    2006-01-01

    Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a “paleologous” locus than in “singleton” genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during “genomic turmoil” immediately after genome duplication but continues to act ≈60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages. PMID:16467140

  8. Molecular Characterization of Lactobacillus plantarum Genes for β-Ketoacyl-Acyl Carrier Protein Synthase III (fabH) and Acetyl Coenzyme A Carboxylase (accBCDA), Which Are Essential for Fatty Acid Biosynthesis

    PubMed Central

    Kiatpapan, Pornpimon; Kobayashi, Hajime; Sakaguchi, Maki; Ono, Hisayo; Yamashita, Mitsuo; Kaneko, Yoshinobu; Murooka, Yoshikatsu

    2001-01-01

    Genes for subunits of acetyl coenzyme A carboxylase (ACC), which is the enzyme that catalyzes the first step in the synthesis of fatty acids in Lactobacillus plantarum L137, were cloned and characterized. We identified six potential open reading frames, namely, manB, fabH, accB, accC, accD, and accA, in that order. Nucleotide sequence analysis suggested that fabH encoded β-ketoacyl-acyl carrier protein synthase III, that the accB, accC, accD, and accA genes encoded biotin carboxyl carrier protein, biotin carboxylase, and the β and α subunits of carboxyltransferase, respectively, and that these genes were clustered. The organization of acc genes was different from that reported for Escherichia coli, for Bacillus subtilis, and for Pseudomonas aeruginosa. E. coli accB and accD mutations were complemented by the L. plantarum accB and accD genes, respectively. The predicted products of all five genes were confirmed by using the T7 expression system in E. coli. The gene product of accB was biotinylated in E. coli. Northern and primer extension analyses demonstrated that the five genes in L. plantarum were regulated polycistronically in an acc operon. PMID:11133475

  9. Identification and Characterization of Cyprinid Herpesvirus-3 (CyHV-3) Encoded MicroRNAs

    PubMed Central

    Donohoe, Owen H.; Henshilwood, Kathy; Way, Keith; Hakimjavadi, Roya; Stone, David M.; Walls, Dermot

    2015-01-01

    MicroRNAs (miRNAs) are a class of small non-coding RNAs involved in post-transcriptional gene regulation. Some viruses encode their own miRNAs and these are increasingly being recognized as important modulators of viral and host gene expression. Cyprinid herpesvirus 3 (CyHV-3) is a highly pathogenic agent that causes acute mass mortalities in carp (Cyprinus carpio carpio) and koi (Cyprinus carpio koi) worldwide. Here, bioinformatic analyses of the CyHV-3 genome suggested the presence of non-conserved precursor miRNA (pre-miRNA) genes. Deep sequencing of small RNA fractions prepared from in vitro CyHV-3 infections led to the identification of potential miRNAs and miRNA–offset RNAs (moRNAs) derived from some bioinformatically predicted pre-miRNAs. DNA microarray hybridization analysis, Northern blotting and stem-loop RT-qPCR were then used to definitively confirm that CyHV-3 expresses two pre-miRNAs during infection in vitro. The evidence also suggested the presence of an additional four high-probability and two putative viral pre-miRNAs. MiRNAs from the two confirmed pre-miRNAs were also detected in gill tissue from CyHV-3-infected carp. We also present evidence that one confirmed miRNA can regulate the expression of a putative CyHV-3-encoded dUTPase. Candidate homologues of some CyHV-3 pre-miRNAs were identified in CyHV-1 and CyHV-2. This is the first report of miRNA and moRNA genes encoded by members of the Alloherpesviridae family, a group distantly related to the Herpesviridae family. The discovery of these novel CyHV-3 genes may help further our understanding of the biology of this economically important virus and their encoded miRNAs may have potential as biomarkers for the diagnosis of latent CyHV-3. PMID:25928140

  10. The Hcp proteins fused with diverse extended-toxin domains represent a novel pattern of antibacterial effectors in type VI secretion systems

    PubMed Central

    Ma, Jiale; Pan, Zihao; Huang, Jinhu; Sun, Min; Lu, Chengping; Yao, Huochun

    2017-01-01

    ABSTRACT The type VI secretion system (T6SS) is a widespread molecular weapon deployed by many bacterial species to target eukaryotic host cells or rival bacteria. Using a dynamic injection mechanism, diverse effectors can be delivered by T6SS directly into recipient cells. Here, we report a new family of T6SS effectors encoded by extended Hcps carrying diverse toxin domains. Bioinformatic analyses revealed that these Hcps with C-terminal extension toxins, designated as Hcp-ET, exist widely in the Enterobacteriaceae. To verify our findings, Hcp-ET1 was tested for its antibacterial effect, and showed effective inhibition of target cell growth via the predicted HNH-DNase activity by T6SS-dependent delivery. Further studies showed that Hcp-ET2 mediated interbacterial antagonism via a Tle1 phospholipase (encoded by DUF2235 domain) activity. Notably, comprehensive analyses of protein homology and genomic neighborhoods revealed that Hcp-ET3–4 is fused with 2 toxin domains (Pyocin S3 and Colicin-DNase) C-terminally, and its encoding gene is followed 3 duplications of the cognate immunity genes. However, some bacteria encode a separated hcp-et3 and an orphan et4 (et4O1) genes caused by a termination-codon mutation in the fusion region between Pyocin S3 and Colicin-DNase encoding fragments. Our results demonstrated that both of these toxins had antibacterial effects. Further, all duplications of the cognate immunity protein contributed to neutralize the DNase toxicity of Pyocin S3 and Colicin, which has not been reported previously. In conclusion, we propose that Hcp-ET proteins are polymorphic T6SS effectors, and thus present a novel encoding pattern of T6SS effectors. PMID:28060574

  11. Ostertagia circumcincta: isolation of a partial cDNA encoding an unusual member of the mitochondrial processing peptidase subfamily of M16 metallopeptidases.

    PubMed

    Walker, J; Tait, A

    1997-11-01

    A reverse-transcriptase polymerase chain reaction (PCR) procedure was used to isolate an Ostertagia circumcincta partial cDNA encoding a protein with general primary sequence features characteristic of members of the mitochondrial processing peptidase (MPP) subfamily of M16 metallopeptidases. The structural relationships of the predicted protein (Oc MPPX) with MPP subfamily proteins from other species (including the model free-living nematode Caenorhabditis elegans) were examined, and Northern analysis confirmed the expression of the Oc mppx gene in adult nematodes.

  12. Transcription Factors Expressed in Lateral Organ Boundaries: Identification of Downstream Targets

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Springer, Patricia S

    2010-07-12

    The processes of lateral organ initiation and patterning are central to the generation of mature plant form. Characterization of the molecular mechanisms underlying these processes is essential to our understanding of plant development. Communication between the shoot apical meristem and initiating organ primordia is important both for functioning of the meristem and for proper organ patterning, and very little is known about this process. In particular, the boundary between meristem and leaf is emerging as a critical region that is important for SAM maintenance and regulation of organogenesis. The goal of this project was to characterize three boundary-expressed genes thatmore » encode predicted transcription factors. Specifically, we have studied LATERAL ORGAN BOUNDARIES (LOB), LATERAL ORGAN FUSION1 (LOF1), and LATERAL ORGAN FUSION2 (LOF2). LOB encodes the founding member of the LOB-DOMAIN (LBD) plant-specific DNA binding transcription factor family and LOF1 and LOF2 encode paralogous MYB-domain transcription factors. We characterized the genetic relationship between these three genes and other boundary and meristem genes. We also used an ectopic inducible expression system to identify direct targets of LOB.« less

  13. SGRL can regulate chlorophyll metabolism and contributes to normal plant growth and development in Pisum sativum L.

    PubMed

    Bell, Andrew; Moreau, Carol; Chinoy, Catherine; Spanner, Rebecca; Dalmais, Marion; Le Signor, Christine; Bendahmane, Abdel; Klenell, Markus; Domoney, Claire

    2015-12-01

    Among a set of genes in pea (Pisum sativum L.) that were induced under drought-stress growth conditions, one encoded a protein with significant similarity to a regulator of chlorophyll catabolism, SGR. This gene, SGRL, is distinct from SGR in genomic location, encoded carboxy-terminal motif, and expression through plant and seed development. Divergence of the two encoded proteins is associated with a loss of similarity in intron/exon gene structure. Transient expression of SGRL in leaves of Nicotiana benthamiana promoted the degradation of chlorophyll, in a manner that was distinct from that shown by SGR. Removal of a predicted transmembrane domain from SGRL reduced its activity in transient expression assays, although variants with and without this domain reduced SGR-induced chlorophyll degradation, indicating that the effects of the two proteins are not additive. The combined data suggest that the function of SGRL during growth and development is in chlorophyll re-cycling, and its mode of action is distinct from that of SGR. Studies of pea sgrL mutants revealed that plants had significantly lower stature and yield, a likely consequence of reduced photosynthetic efficiencies in mutant compared with control plants under conditions of high light intensity.

  14. Plasmid-encoded hygromycin B resistance: the sequence of hygromycin B phosphotransferase gene and its expression in Escherichia coli and Saccharomyces cerevisiae.

    PubMed

    Gritz, L; Davies, J

    1983-11-01

    The plasmid-borne gene hph coding for hygromycin B phosphotransferase (HPH) in Escherichia coli has been identified and its nucleotide sequence determined. The hph gene is 1026 nucleotides long, coding for a protein with a predicted Mr of 39 000. The hph gene was placed in a shuttle plasmid vector, downstream from the promoter region of the cyc 1 gene of Saccharomyces cerevisiae, and an hph construction containing a single AUG in the 5' noncoding region allowed direct selection following transformation in yeast and in E. coli. Thus the hph gene can be used in cloning vectors for both pro- and eukaryotes.

  15. A comparative genomics perspective on the genetic content of the alkaliphilic haloarchaeon Natrialba magadii ATCC 43099T

    PubMed Central

    2012-01-01

    Background Natrialba magadii is an aerobic chemoorganotrophic member of the Euryarchaeota and is a dual extremophile requiring alkaline conditions and hypersalinity for optimal growth. The genome sequence of Nab. magadii type strain ATCC 43099 was deciphered to obtain a comprehensive insight into the genetic content of this haloarchaeon and to understand the basis of some of the cellular functions necessary for its survival. Results The genome of Nab. magadii consists of four replicons with a total sequence of 4,443,643 bp and encodes 4,212 putative proteins, some of which contain peptide repeats of various lengths. Comparative genome analyses facilitated the identification of genes encoding putative proteins involved in adaptation to hypersalinity, stress response, glycosylation, and polysaccharide biosynthesis. A proton-driven ATP synthase and a variety of putative cytochromes and other proteins supporting aerobic respiration and electron transfer were encoded by one or more of Nab. magadii replicons. The genome encodes a number of putative proteases/peptidases as well as protein secretion functions. Genes encoding putative transcriptional regulators, basal transcription factors, signal perception/transduction proteins, and chemotaxis/phototaxis proteins were abundant in the genome. Pathways for the biosynthesis of thiamine, riboflavin, heme, cobalamin, coenzyme F420 and other essential co-factors were deduced by in depth sequence analyses. However, approximately 36% of Nab. magadii protein coding genes could not be assigned a function based on Blast analysis and have been annotated as encoding hypothetical or conserved hypothetical proteins. Furthermore, despite extensive comparative genomic analyses, genes necessary for survival in alkaline conditions could not be identified in Nab. magadii. Conclusions Based on genomic analyses, Nab. magadii is predicted to be metabolically versatile and it could use different carbon and energy sources to sustain growth. Nab. magadii has the genetic potential to adapt to its milieu by intracellular accumulation of inorganic cations and/or neutral organic compounds. The identification of Nab. magadii genes involved in coenzyme biosynthesis is a necessary step toward further reconstruction of the metabolic pathways in halophilic archaea and other extremophiles. The knowledge gained from the genome sequence of this haloalkaliphilic archaeon is highly valuable in advancing the applications of extremophiles and their enzymes. PMID:22559199

  16. TnSeq of Mycobacterium tuberculosis clinical isolates reveals strain-specific antibiotic liabilities

    PubMed Central

    Carey, Allison F.; Rock, Jeremy M.; Krieger, Inna V.; Gagneux, Sebastien; Sacchettini, James C.; Fortune, Sarah M.

    2018-01-01

    Once considered a phenotypically monomorphic bacterium, there is a growing body of work demonstrating heterogeneity among Mycobacterium tuberculosis (Mtb) strains in clinically relevant characteristics, including virulence and response to antibiotics. However, the genetic and molecular basis for most phenotypic differences among Mtb strains remains unknown. To investigate the basis of strain variation in Mtb, we performed genome-wide transposon mutagenesis coupled with next-generation sequencing (TnSeq) for a panel of Mtb clinical isolates and the reference strain H37Rv to compare genetic requirements for in vitro growth across these strains. We developed an analytic approach to identify quantitative differences in genetic requirements between these genetically diverse strains, which vary in genomic structure and gene content. Using this methodology, we found differences between strains in their requirements for genes involved in fundamental cellular processes, including redox homeostasis and central carbon metabolism. Among the genes with differential requirements were katG, which encodes the activator of the first-line antitubercular agent isoniazid, and glcB, which encodes malate synthase, the target of a novel small-molecule inhibitor. Differences among strains in their requirement for katG and glcB predicted differences in their response to these antimicrobial agents. Importantly, these strain-specific differences in antibiotic response could not be predicted by genetic variants identified through whole genome sequencing or by gene expression analysis. Our results provide novel insight into the basis of variation among Mtb strains and demonstrate that TnSeq is a scalable method to predict clinically important phenotypic differences among Mtb strains. PMID:29505613

  17. Analysis of membrane protein genes in a Brazilian isolate of Anaplasma marginale.

    PubMed

    G Junior, Daniel S; Araújo, Flábio R; Almeida Junior, Nalvo F; Adi, Said S; Cheung, Luciana M; Fragoso, Stenio P; Ramos, Carlos A N; Oliveira, Renato Henrique M de; Santos, Caroline S; Bacanelli, Gisele; Soares, Cleber O; Rosinha, Grácia M S; Fonseca, Adivaldo H

    2010-11-01

    The sequencing of the complete genome of Anaplasma marginale has enabled the identification of several genes that encode membrane proteins, thereby increasing the chances of identifying candidate immunogens. Little is known regarding the genetic variability of genes that encode membrane proteins in A. marginale isolates. The aim of the present study was to determine the degree of conservation of the predicted amino acid sequences of OMP1, OMP4, OMP5, OMP7, OMP8, OMP10, OMP14, OMP15, SODb, OPAG1, OPAG3, VirB3, VirB9-1, PepA, EF-Tu and AM854 proteins in a Brazilian isolate of A. marginale compared to other isolates. Hence, primers were used to amplify these genes: omp1, omp4, omp5, omp7, omp8, omp10, omp14, omp15, sodb, opag1, opag3, virb3, VirB9-1, pepA, ef-tu and am854. After polimerase chain reaction amplification, the products were cloned and sequenced using the Sanger method and the predicted amino acid sequence were multi-aligned using the CLUSTALW and MEGA 4 programs, comparing the predicted sequences between the Brazilian, Saint Maries, Florida and A. marginale centrale isolates. With the exception of outer membrane protein (OMP) 7, all proteins exhibited 92-100% homology to the other A. marginale isolates. However, only OMP1, OMP5, EF-Tu, VirB3, SODb and VirB9-1 were selected as potential immunogens capable of promoting cross-protection between isolates due to the high degree of homology (over 72%) also found with A. (centrale) marginale.

  18. Inference of Expanded Lrp-Like Feast/Famine Transcription Factor Targets in a Non-Model Organism Using Protein Structure-Based Prediction

    PubMed Central

    Ashworth, Justin; Plaisier, Christopher L.; Lo, Fang Yin; Reiss, David J.; Baliga, Nitin S.

    2014-01-01

    Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer. PMID:25255272

  19. Inference of expanded Lrp-like feast/famine transcription factor targets in a non-model organism using protein structure-based prediction.

    PubMed

    Ashworth, Justin; Plaisier, Christopher L; Lo, Fang Yin; Reiss, David J; Baliga, Nitin S

    2014-01-01

    Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer.

  20. Satellite remote sensing data can be used to model marine microbial metabolite turnover

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Larsen, Peter E.; Scott, Nicole; Post, Anton F.

    Sampling ecosystems, even at a local scale, at the temporal and spatial resolution necessary to capture natural variability in microbial communities are prohibitively expensive. We extrapolated marine surface microbial community structure and metabolic potential from 72 16S rRNA amplicon and 8 metagenomic observations using remotely sensed environmental parameters to create a system-scale model of marine microbial metabolism for 5904 grid cells (49 km2) in the Western English Chanel, across 3 years of weekly averages. Thirteen environmental variables predicted the relative abundance of 24 bacterial Orders and 1715 unique enzyme-encoding genes that encode turnover of 2893 metabolites. The genes’ predicted relativemore » abundance was highly correlated (Pearson Correlation 0.72, P-value <10-6) with their observed relative abundance in sequenced metagenomes. Predictions of the relative turnover (synthesis or consumption) of CO2 were significantly correlated with observed surface CO2 fugacity. The spatial and temporal variation in the predicted relative abundances of genes coding for cyanase, carbon monoxide and malate dehydrogenase were investigated along with the predicted inter-annual variation in relative consumption or production of ~3000 metabolites forming six significant temporal clusters. These spatiotemporal distributions could possibly be explained by the co-occurrence of anaerobic and aerobic metabolisms associated with localized plankton blooms or sediment resuspension, which facilitate the presence of anaerobic micro-niches. This predictive model provides a general framework for focusing future sampling and experimental design to relate biogeochemical turnover to microbial ecology.« less

  1. Genomic instability--an evolving hallmark of cancer.

    PubMed

    Negrini, Simona; Gorgoulis, Vassilis G; Halazonetis, Thanos D

    2010-03-01

    Genomic instability is a characteristic of most cancers. In hereditary cancers, genomic instability results from mutations in DNA repair genes and drives cancer development, as predicted by the mutator hypothesis. In sporadic (non-hereditary) cancers the molecular basis of genomic instability remains unclear, but recent high-throughput sequencing studies suggest that mutations in DNA repair genes are infrequent before therapy, arguing against the mutator hypothesis for these cancers. Instead, the mutation patterns of the tumour suppressor TP53 (which encodes p53), ataxia telangiectasia mutated (ATM) and cyclin-dependent kinase inhibitor 2A (CDKN2A; which encodes p16INK4A and p14ARF) support the oncogene-induced DNA replication stress model, which attributes genomic instability and TP53 and ATM mutations to oncogene-induced DNA damage.

  2. Whole Genome Sequences of Three Treponema pallidum ssp. pertenue Strains: Yaws and Syphilis Treponemes Differ in Less than 0.2% of the Genome Sequence

    PubMed Central

    Chen, Lei; Pospíšilová, Petra; Strouhal, Michal; Qin, Xiang; Mikalová, Lenka; Norris, Steven J.; Muzny, Donna M.; Gibbs, Richard A.; Fulton, Lucinda L.; Sodergren, Erica; Weinstock, George M.; Šmajs, David

    2012-01-01

    Background The yaws treponemes, Treponema pallidum ssp. pertenue (TPE) strains, are closely related to syphilis causing strains of Treponema pallidum ssp. pallidum (TPA). Both yaws and syphilis are distinguished on the basis of epidemiological characteristics, clinical symptoms, and several genetic signatures of the corresponding causative agents. Methodology/Principal Findings To precisely define genetic differences between TPA and TPE, high-quality whole genome sequences of three TPE strains (Samoa D, CDC-2, Gauthier) were determined using next-generation sequencing techniques. TPE genome sequences were compared to four genomes of TPA strains (Nichols, DAL-1, SS14, Chicago). The genome structure was identical in all three TPE strains with similar length ranging between 1,139,330 bp and 1,139,744 bp. No major genome rearrangements were found when compared to the four TPA genomes. The whole genome nucleotide divergence (dA) between TPA and TPE subspecies was 4.7 and 4.8 times higher than the observed nucleotide diversity (π) among TPA and TPE strains, respectively, corresponding to 99.8% identity between TPA and TPE genomes. A set of 97 (9.9%) TPE genes encoded proteins containing two or more amino acid replacements or other major sequence changes. The TPE divergent genes were mostly from the group encoding potential virulence factors and genes encoding proteins with unknown function. Conclusions/Significance Hypothetical genes, with genetic differences, consistently found between TPE and TPA strains are candidates for syphilitic treponemes virulence factors. Seventeen TPE genes were predicted under positive selection, and eleven of them coded either for predicted exported proteins or membrane proteins suggesting their possible association with the cell surface. Sequence changes between TPE and TPA strains and changes specific to individual strains represent suitable targets for subspecies- and strain-specific molecular diagnostics. PMID:22292095

  3. The TORMOZ Gene Encodes a Nucleolar Protein Required for Regulated Division Planes and Embryo Development in Arabidopsis[W

    PubMed Central

    Griffith, Megan E.; Mayer, Ulrike; Capron, Arnaud; Ngo, Quy A.; Surendrarao, Anandkumar; McClinton, Regina; Jürgens, Gerd; Sundaresan, Venkatesan

    2007-01-01

    Embryogenesis in Arabidopsis thaliana is marked by a predictable sequence of oriented cell divisions, which precede cell fate determination. We show that mutation of the TORMOZ (TOZ) gene yields embryos with aberrant cell division planes and arrested embryos that appear not to have established normal patterning. The defects in toz mutants differ from previously described mutations that affect embryonic cell division patterns. Longitudinal division planes of the proembryo are frequently replaced by transverse divisions and less frequently by oblique divisions, while divisions of the suspensor cells, which divide only transversely, appear generally unaffected. Expression patterns of selected embryo patterning genes are altered in the mutant embryos, implying that the positional cues required for their proper expression are perturbed by the misoriented divisions. The TOZ gene encodes a nucleolar protein containing WD repeats. Putative TOZ orthologs exist in other eukaryotes including Saccharomyces cerevisiae, where the protein is predicted to function in 18S rRNA biogenesis. We find that disruption of the Sp TOZ gene results in cell division defects in Schizosaccharomyces pombe. Previous studies in yeast and animal cells have identified nucleolar proteins that regulate the exit from M phase and cytokinesis, including factors involved in pre-rRNA processing. Our study suggests that in plant cells, nucleolar functions might interact with the processes of regulated cell divisions and influence the selection of longitudinal division planes during embryogenesis. PMID:17616738

  4. Analysis of strain-specific genes in glutamic acid-producing Corynebacterium glutamicum ssp. lactofermentum AJ 1511.

    PubMed

    Nishio, Yousuke; Koseki, Chie; Tonouchi, Naoto; Matsui, Kazuhiko; Sugimoto, Shinichi; Usuda, Yoshihiro

    2017-07-11

    Strains of the bacterium, Corynebacterium glutamicum, are widely used for the industrial production of L-glutamic acid and various other substances. C. glutamicum ssp. lactofermentum AJ 1511, formerly classified as Brevibacterium lactofermentum, and the closely related C. glutamicum ATCC 13032 have been used as industrial strains for more than 50 years. We determined the whole genome sequence of C. glutamicum AJ 1511 and performed genome-wide comparative analysis with C. glutamicum ATCC 13032 to determine strain-specific genetic differences. This analysis revealed that the genomes of the two industrial strains are highly similar despite the phenotypic differences between the two strains. Both strains harbored unique genes but gene transpositions or inversions were not observed. The largest unique region, a 220-kb AT-rich region located between 1.78 and 2.00 Mb position in C. glutamicum ATCC 13032 genome, was missing in the genome of C. glutamicum AJ 1511. The next two largest unique regions were present in C. glutamicum AJ 1511. The first region (413-484 kb position) contains several predicted transport proteins, enzymes involved in sugar metabolism, and transposases. The second region (1.47-1.50 Mb position) encodes restriction modification systems. A gene predicted to encode NADH-dependent glutamate dehydrogenase, which is involved in L-glutamate biosynthesis, is present in C. glutamicum AJ 1511. Strain-specific genes identified in this study are likely to govern phenotypes unique to each strain.

  5. An Interspecies Comparative Analysis of the Predicted Secretomes of the Necrotrophic Plant Pathogens Sclerotinia sclerotiorum and Botrytis cinerea

    PubMed Central

    2015-01-01

    Phytopathogenic fungi form intimate associations with host plant species and cause disease. To be successful, fungal pathogens communicate with a susceptible host through the secretion of proteinaceous effectors, hydrolytic enzymes and metabolites. Sclerotinia sclerotiorum and Botrytis cinerea are economically important necrotrophic fungal pathogens that cause disease on numerous crop species. Here, a powerful bioinformatics pipeline was used to predict the refined S. sclerotiorum and B. cinerea secretomes, identifying 432 and 499 proteins respectively. Analyses focusing on S. sclerotiorum revealed that 16% of the secretome encoding genes resided in small, sequence heterogeneous, gene clusters that were distributed over 13 of the 16 predicted chromosomes. Functional analyses highlighted the importance of plant cell hydrolysis, oxidation-reduction processes and the redox state to the S. sclerotiorum and B. cinerea secretomes and potentially host infection. Only 8% of the predicted proteins were distinct between the two secretomes. In contrast to S. sclerotiorum, the B. cinerea secretome lacked CFEM- or LysM-containing proteins. The 115 fungal and oomycete genome comparison identified 30 proteins specific to S. sclerotiorum and B. cinerea, plus 11 proteins specific to S. sclerotiorum and 32 proteins specific to B. cinerea. Expressed sequence tag (EST) and proteomic analyses showed that 246 S. sclerotiorum secretome encoding genes had EST support, including 101 which were only expressed in vitro and 49 which were only expressed in planta, whilst 42 predicted proteins were experimentally proven to be secreted. These detailed in silico analyses of two important necrotrophic pathogens will permit informed choices to be made when candidate effector proteins are selected for function analyses in planta. PMID:26107498

  6. RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome.

    PubMed

    Wenger, Yvan; Galliot, Brigitte

    2013-03-25

    Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.

  7. RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome

    PubMed Central

    2013-01-01

    Background Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. Results To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48’909 unique sequences including splice variants, representing approximately 24’450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10’597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11’270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. Conclusions We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events. PMID:23530871

  8. Acetylcholinesterase of Rhipicephalus (Boophilus) microplus and Phlebotomus papatasi: Gene Identification, Expression, and Biochemical Properties of Recombinant Proteins

    DTIC Science & Technology

    2013-01-01

    predicted amino acid sequences of the three encoded BmAChEs were no more closely related to one another than AChEs from different organisms and their...solely on nucleotide and amino acid sequence similarity; however, the cholinesterase gene family contains a number of related enzymes and structural...acetylcholinesterase of P. papatasi was cloned, sequenced , and expressed in the baculo- virus system to generate a recombinant enzyme for biochemical

  9. The BOS1 gene encodes an essential 27-kD putative membrane protein that is required for vesicular transport from the ER to the Golgi complex in yeast

    PubMed Central

    1991-01-01

    We recently described the identification of BOS1 (Newman, A., J. Shim, and S. Ferro-Novick. 1990. Mol. Cell. Biol. 10:3405-3414.). BOS1 is a gene that in multiple copy suppresses the growth and secretion defect of bet1 and sec22, two mutants that disrupt transport from the ER to the Golgi complex in yeast. The ability of BOS1 to specifically suppress mutants blocked at a particular stage of the secretory pathway suggested that this gene encodes a protein that functions in this process. The experiments presented in this study support this hypothesis. Specifically, the BOS1 gene was found to be essential for cellular growth. Furthermore, cells depleted of the Bos1 protein fail to transport pro-alpha-factor and carboxypeptidase Y (CPY) to the Golgi apparatus. This defect in export leads to the accumulation of an extensive network of ER and small vesicles. DNA sequence analysis predicts that Bos1 is a 27-kD protein containing a putative membrane- spanning domain. This prediction is supported by differential centrifugation experiments. Thus, Bos1 appears to be a membrane protein that functions in conjunction with Bet1 and Sec22 to facilitate the transport of proteins at a step subsequent to translocation into the ER but before entry into the Golgi apparatus. PMID:2007627

  10. A novel pathway for the biosynthesis of heme in Archaea: genome-based bioinformatic predictions and experimental evidence.

    PubMed

    Storbeck, Sonja; Rolfes, Sarah; Raux-Deery, Evelyne; Warren, Martin J; Jahn, Dieter; Layer, Gunhild

    2010-12-13

    Heme is an essential prosthetic group for many proteins involved in fundamental biological processes in all three domains of life. In Eukaryota and Bacteria heme is formed via a conserved and well-studied biosynthetic pathway. Surprisingly, in Archaea heme biosynthesis proceeds via an alternative route which is poorly understood. In order to formulate a working hypothesis for this novel pathway, we searched 59 completely sequenced archaeal genomes for the presence of gene clusters consisting of established heme biosynthetic genes and colocalized conserved candidate genes. Within the majority of archaeal genomes it was possible to identify such heme biosynthesis gene clusters. From this analysis we have been able to identify several novel heme biosynthesis genes that are restricted to archaea. Intriguingly, several of the encoded proteins display similarity to enzymes involved in heme d(1) biosynthesis. To initiate an experimental verification of our proposals two Methanosarcina barkeri proteins predicted to catalyze the initial steps of archaeal heme biosynthesis were recombinantly produced, purified, and their predicted enzymatic functions verified.

  11. A Novel Pathway for the Biosynthesis of Heme in Archaea: Genome-Based Bioinformatic Predictions and Experimental Evidence

    PubMed Central

    Storbeck, Sonja; Rolfes, Sarah; Raux-Deery, Evelyne; Warren, Martin J.; Jahn, Dieter; Layer, Gunhild

    2010-01-01

    Heme is an essential prosthetic group for many proteins involved in fundamental biological processes in all three domains of life. In Eukaryota and Bacteria heme is formed via a conserved and well-studied biosynthetic pathway. Surprisingly, in Archaea heme biosynthesis proceeds via an alternative route which is poorly understood. In order to formulate a working hypothesis for this novel pathway, we searched 59 completely sequenced archaeal genomes for the presence of gene clusters consisting of established heme biosynthetic genes and colocalized conserved candidate genes. Within the majority of archaeal genomes it was possible to identify such heme biosynthesis gene clusters. From this analysis we have been able to identify several novel heme biosynthesis genes that are restricted to archaea. Intriguingly, several of the encoded proteins display similarity to enzymes involved in heme d 1 biosynthesis. To initiate an experimental verification of our proposals two Methanosarcina barkeri proteins predicted to catalyze the initial steps of archaeal heme biosynthesis were recombinantly produced, purified, and their predicted enzymatic functions verified. PMID:21197080

  12. Mollusk genes encoding lysine tRNA (UUU) contain introns.

    PubMed

    Matsuo, M; Abe, Y; Saruta, Y; Okada, N

    1995-11-20

    New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.

  13. Functional metagenomics reveals novel β-galactosidases not predictable from gene sequences.

    PubMed

    Cheng, Jiujun; Romantsov, Tatyana; Engel, Katja; Doxey, Andrew C; Rose, David R; Neufeld, Josh D; Charles, Trevor C

    2017-01-01

    The techniques of metagenomics have allowed researchers to access the genomic potential of uncultivated microbes, but there remain significant barriers to determination of gene function based on DNA sequence alone. Functional metagenomics, in which DNA is cloned and expressed in surrogate hosts, can overcome these barriers, and make important contributions to the discovery of novel enzymes. In this study, a soil metagenomic library carried in an IncP cosmid was used for functional complementation for β-galactosidase activity in both Sinorhizobium meliloti (α-Proteobacteria) and Escherichia coli (γ-Proteobacteria) backgrounds. One β-galactosidase, encoded by six overlapping clones that were selected in both hosts, was identified as a member of glycoside hydrolase family 2. We could not identify ORFs obviously encoding possible β-galactosidases in 19 other sequenced clones that were only able to complement S. meliloti. Based on low sequence identity to other known glycoside hydrolases, yet not β-galactosidases, three of these ORFs were examined further. Biochemical analysis confirmed that all three encoded β-galactosidase activity. Lac36W_ORF11 and Lac161_ORF7 had conserved domains, but lacked similarities to known glycoside hydrolases. Lac161_ORF10 had neither conserved domains nor similarity to known glycoside hydrolases. Bioinformatic and structural modeling implied that Lac161_ORF10 protein represented a novel enzyme family with a five-bladed propeller glycoside hydrolase domain. By discovering founding members of three novel β-galactosidase families, we have reinforced the value of functional metagenomics for isolating novel genes that could not have been predicted from DNA sequence analysis alone.

  14. Characterization of 17 chaperone-usher fimbriae encoded by Proteus mirabilis reveals strong conservation

    PubMed Central

    Kuan, Lisa; Schaffer, Jessica N.; Zouzias, Christos D.

    2014-01-01

    Proteus mirabilis is a Gram-negative enteric bacterium that causes complicated urinary tract infections, particularly in patients with indwelling catheters. Sequencing of clinical isolate P. mirabilis HI4320 revealed the presence of 17 predicted chaperone-usher fimbrial operons. We classified these fimbriae into three groups by their genetic relationship to other chaperone-usher fimbriae. Sixteen of these fimbriae are encoded by all seven currently sequenced P. mirabilis genomes. The predicted protein sequence of the major structural subunit for 14 of these fimbriae was highly conserved (≥95 % identity), whereas three other structural subunits (Fim3A, UcaA and Fim6A) were variable. Further examination of 58 clinical isolates showed that 14 of the 17 predicted major structural subunit genes of the fimbriae were present in most strains (>85 %). Transcription of the predicted major structural subunit genes for all 17 fimbriae was measured under different culture conditions designed to mimic conditions in the urinary tract. The majority of the fimbrial genes were induced during stationary phase, static culture or colony growth when compared to exponential-phase aerated culture. Major structural subunit proteins for six of these fimbriae were detected using MS of proteins sheared from the surface of broth-cultured P. mirabilis, demonstrating that this organism may produce multiple fimbriae within a single culture. The high degree of conservation of P. mirabilis fimbriae stands in contrast to uropathogenic Escherichia coli and Salmonella enterica, which exhibit greater variability in their fimbrial repertoires. These findings suggest there may be evolutionary pressure for P. mirabilis to maintain a large fimbrial arsenal. PMID:24809384

  15. Human AZU-1 gene, variants thereof and expressed gene products

    DOEpatents

    Chen, Huei-Mei; Bissell, Mina

    2004-06-22

    A human AZU-1 gene, mutants, variants and fragments thereof. Protein products encoded by the AZU-1 gene and homologs encoded by the variants of AZU-1 gene acting as tumor suppressors or markers of malignancy progression and tumorigenicity reversion. Identification, isolation and characterization of AZU-1 and AZU-2 genes localized to a tumor suppressive locus at chromosome 10q26, highly expressed in nonmalignant and premalignant cells derived from a human breast tumor progression model. A recombinant full length protein sequences encoded by the AZU-1 gene and nucleotide sequences of AZU-1 and AZU-2 genes and variant and fragments thereof. Monoclonal or polyclonal antibodies specific to AZU-1, AZU-2 encoded protein and to AZU-1, or AZU-2 encoded protein homologs.

  16. Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways.

    PubMed

    Chen, Lei; Zhang, Yu-Hang; Wang, ShaoPeng; Zhang, YunHua; Huang, Tao; Cai, Yu-Dong

    2017-01-01

    Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems.

  17. Sequence and Expression Analyses of Ethylene Response Factors Highly Expressed in Latex Cells from Hevea brasiliensis

    PubMed Central

    Piyatrakul, Piyanuch; Yang, Meng; Putranto, Riza-Arief; Pirrello, Julien; Dessailly, Florence; Hu, Songnian; Summo, Marilyne; Theeravatanasuk, Kannikar; Leclercq, Julie; Kuswanhadi; Montoro, Pascal

    2014-01-01

    The AP2/ERF superfamily encodes transcription factors that play a key role in plant development and responses to abiotic and biotic stress. In Hevea brasiliensis, ERF genes have been identified by RNA sequencing. This study set out to validate the number of HbERF genes, and identify ERF genes involved in the regulation of latex cell metabolism. A comprehensive Hevea transcriptome was improved using additional RNA reads from reproductive tissues. Newly assembled contigs were annotated in the Gene Ontology database and were assigned to 3 main categories. The AP2/ERF superfamily is the third most represented compared with other transcription factor families. A comparison with genomic scaffolds led to an estimation of 114 AP2/ERF genes and 1 soloist in Hevea brasiliensis. Based on a phylogenetic analysis, functions were predicted for 26 HbERF genes. A relative transcript abundance analysis was performed by real-time RT-PCR in various tissues. Transcripts of ERFs from group I and VIII were very abundant in all tissues while those of group VII were highly accumulated in latex cells. Seven of the thirty-five ERF expression marker genes were highly expressed in latex. Subcellular localization and transactivation analyses suggested that HbERF-VII candidate genes encoded functional transcription factors. PMID:24971876

  18. Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement

    PubMed Central

    Blazier, J. Chris; Ruhlman, Tracey A.; Weng, Mao-Lun; Rehman, Sumaiyah K.; Sabir, Jamal S. M.; Jansen, Robert K.

    2016-01-01

    Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA. PMID:27087667

  19. Sequence and expression analyses of ethylene response factors highly expressed in latex cells from Hevea brasiliensis.

    PubMed

    Piyatrakul, Piyanuch; Yang, Meng; Putranto, Riza-Arief; Pirrello, Julien; Dessailly, Florence; Hu, Songnian; Summo, Marilyne; Theeravatanasuk, Kannikar; Leclercq, Julie; Kuswanhadi; Montoro, Pascal

    2014-01-01

    The AP2/ERF superfamily encodes transcription factors that play a key role in plant development and responses to abiotic and biotic stress. In Hevea brasiliensis, ERF genes have been identified by RNA sequencing. This study set out to validate the number of HbERF genes, and identify ERF genes involved in the regulation of latex cell metabolism. A comprehensive Hevea transcriptome was improved using additional RNA reads from reproductive tissues. Newly assembled contigs were annotated in the Gene Ontology database and were assigned to 3 main categories. The AP2/ERF superfamily is the third most represented compared with other transcription factor families. A comparison with genomic scaffolds led to an estimation of 114 AP2/ERF genes and 1 soloist in Hevea brasiliensis. Based on a phylogenetic analysis, functions were predicted for 26 HbERF genes. A relative transcript abundance analysis was performed by real-time RT-PCR in various tissues. Transcripts of ERFs from group I and VIII were very abundant in all tissues while those of group VII were highly accumulated in latex cells. Seven of the thirty-five ERF expression marker genes were highly expressed in latex. Subcellular localization and transactivation analyses suggested that HbERF-VII candidate genes encoded functional transcription factors.

  20. Evolution of Homospermidine Synthase in the Convolvulaceae: A Story of Gene Duplication, Gene Loss, and Periods of Various Selection Pressures[C][W][OA

    PubMed Central

    Kaltenegger, Elisabeth; Eich, Eckart; Ober, Dietrich

    2013-01-01

    Homospermidine synthase (HSS), the first pathway-specific enzyme of pyrrolizidine alkaloid biosynthesis, is known to have its origin in the duplication of a gene encoding deoxyhypusine synthase. To study the processes that followed this gene duplication event and gave rise to HSS, we identified sequences encoding HSS and deoxyhypusine synthase from various species of the Convolvulaceae. We show that HSS evolved only once in this lineage. This duplication event was followed by several losses of a functional gene copy attributable to gene loss or pseudogenization. Statistical analyses of sequence data suggest that, in those lineages in which the gene copy was successfully recruited as HSS, the gene duplication event was followed by phases of various selection pressures, including purifying selection, relaxed functional constraints, and possibly positive Darwinian selection. Site-specific mutagenesis experiments have confirmed that the substitution of sites predicted to be under positive Darwinian selection is sufficient to convert a deoxyhypusine synthase into a HSS. In addition, analyses of transcript levels have shown that HSS and deoxyhypusine synthase have also diverged with respect to their regulation. The impact of protein–protein interaction on the evolution of HSS is discussed with respect to current models of enzyme evolution. PMID:23572540

  1. The mitochondrial gene encoding ribosomal protein S12 has been translocated to the nuclear genome in Oenothera.

    PubMed Central

    Grohmann, L; Brennicke, A; Schuster, W

    1992-01-01

    The Oenothera mitochondrial genome contains only a gene fragment for ribosomal protein S12 (rps12), while other plants encode a functional gene in the mitochondrion. The complete Oenothera rps12 gene is located in the nucleus. The transit sequence necessary to target this protein to the mitochondrion is encoded by a 5'-extension of the open reading frame. Comparison of the amino acid sequence encoded by the nuclear gene with the polypeptides encoded by edited mitochondrial cDNA and genomic sequences of other plants suggests that gene transfer between mitochondrion and nucleus started from edited mitochondrial RNA molecules. Mechanisms and requirements of gene transfer and activation are discussed. Images PMID:1454526

  2. Computational analyses of mammalian lactate dehydrogenases: human, mouse, opossum and platypus LDHs.

    PubMed

    Holmes, Roger S; Goldberg, Erwin

    2009-10-01

    Computational methods were used to predict the amino acid sequences and gene locations for mammalian lactate dehydrogenase (LDH) genes and proteins using genome sequence databanks. Human LDHA, LDHC and LDH6A genes were located in tandem on chromosome 11, while LDH6B and LDH6C genes were on chromosomes 15 and 12, respectively. Opossum LDHC and LDH6B genes were located in tandem with the opossum LDHA gene on chromosome 5 and contained 7 (LDHA and LDHC) or 8 (LDH6B) exons. An amino acid sequence prediction for the opossum LDH6B subunit gave an extended N-terminal sequence, similar to the human and mouse LDH6B sequences, which may support the export of this enzyme into mitochondria. The platypus genome contained at least 3 LDH genes encoding LDHA, LDHB and LDH6B subunits. Phylogenetic studies and sequence analyses indicated that LDHA, LDHB and LDH6B genes are present in all mammalian genomes examined, including a monotreme species (platypus), whereas the LDHC gene may have arisen more recently in marsupial mammals.

  3. Computational analyses of mammalian lactate dehydrogenases: human, mouse, opossum and platypus LDHs

    PubMed Central

    Holmes, Roger S; Goldberg, Erwin

    2009-01-01

    Computational methods were used to predict the amino acid sequences and gene locations for mammalian lactate dehydrogenase (LDH) genes and proteins using genome sequence databanks. Human LDHA, LDHC and LDH6A genes were located in tandem on chromosome 11, while LDH6B and LDH6C genes were on chromosomes 15 and 12, respectively. Opossum LDHC and LDH6B genes were located in tandem with the opossum LDHA gene on chromosome 5 and contained 7 (LDHA and LDHC) or 8 (LDH6B) exons. An amino acid sequence prediction for the opossum LDH6B subunit gave an extended N-terminal sequence, similar to the human and mouse LDH6B sequences, which may support the export of this enzyme into mitochondria. The platypus genome contained at least 3 LDH genes encoding LDHA, LDHB and LDH6B subunits. Phylogenetic studies and sequence analyses indicated that LDHA, LDHB and LDH6B genes are present in all mammalian genomes examined, including a monotreme species (platypus), whereas the LDHC gene may have arisen more recently in marsupial mammals. PMID:19679512

  4. Consistency of gene starts among Burkholderia genomes

    PubMed Central

    2011-01-01

    Background Evolutionary divergence in the position of the translational start site among orthologous genes can have significant functional impacts. Divergence can alter the translation rate, degradation rate, subcellular location, and function of the encoded proteins. Results Existing Genbank gene maps for Burkholderia genomes suggest that extensive divergence has occurred--53% of ortholog sets based on Genbank gene maps had inconsistent gene start sites. However, most of these inconsistencies appear to be gene-calling errors. Evolutionary divergence was the most plausible explanation for only 17% of the ortholog sets. Correcting probable errors in the Genbank gene maps decreased the percentage of ortholog sets with inconsistent starts by 68%, increased the percentage of ortholog sets with extractable upstream intergenic regions by 32%, increased the sequence similarity of intergenic regions and predicted proteins, and increased the number of proteins with identifiable signal peptides. Conclusions Our findings highlight an emerging problem in comparative genomics: single-digit percent errors in gene predictions can lead to double-digit percentages of inconsistent ortholog sets. The work demonstrates a simple approach to evaluate and improve the quality of gene maps. PMID:21342528

  5. Digital Gene Expression Analysis Provides Insight into the Transcript Profile of the Genes Involved in Aporphine Alkaloid Biosynthesis in Lotus (Nelumbo nucifera)

    PubMed Central

    Yang, Mei; Zhu, Lingping; Li, Ling; Li, Juanjuan; Xu, Liming; Feng, Ji; Liu, Yanling

    2017-01-01

    The predominant alkaloids in lotus leaves are aporphine alkaloids. These are the most important active components and have many pharmacological properties, but little is known about their biosynthesis. We used digital gene expression (DGE) technology to identify differentially-expressed genes (DEGs) between two lotus cultivars with different alkaloid contents at four leaf development stages. We also predicted potential genes involved in aporphine alkaloid biosynthesis by weighted gene co-expression network analysis (WGCNA). Approximately 335 billion nucleotides were generated; and 94% of which were aligned against the reference genome. Of 22 thousand expressed genes, 19,000 were differentially expressed between the two cultivars at the four stages. Gene Ontology (GO) enrichment analysis revealed that catalytic activity and oxidoreductase activity were enriched significantly in most pairwise comparisons. In Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis, dozens of DEGs were assigned to the categories of biosynthesis of secondary metabolites, isoquinoline alkaloid biosynthesis, and flavonoid biosynthesis. The genes encoding norcoclaurine synthase (NCS), norcoclaurine 6-O-methyltransferase (6OMT), coclaurine N-methyltransferase (CNMT), N-methylcoclaurine 3′-hydroxylase (NMCH), and 3′-hydroxy-N-methylcoclaurine 4′-O-methyltransferase (4′OMT) in the common pathways of benzylisoquinoline alkaloid biosynthesis and the ones encoding corytuberine synthase (CTS) in aporphine alkaloid biosynthetic pathway, which have been characterized in other plants, were identified in lotus. These genes had positive effects on alkaloid content, albeit with phenotypic lag. The WGCNA of DEGs revealed that one network module was associated with the dynamic change of alkaloid content. Eleven genes encoding proteins with methyltransferase, oxidoreductase and CYP450 activities were identified. These were surmised to be genes involved in aporphine alkaloid biosynthesis. This transcriptomic database provides new directions for future studies on clarifying the aporphine alkaloid pathway. PMID:28197160

  6. The clpB gene of Bifidobacterium breve UCC 2003: transcriptional analysis and first insights into stress induction.

    PubMed

    Ventura, Marco; Kenny, John G; Zhang, Ziding; Fitzgerald, Gerald F; van Sinderen, Douwe

    2005-09-01

    The so-called clp genes, which encode components of the Clp proteolytic complex, are widespread among bacteria. The Bifidobacterium breve UCC 2003 genome contains a clpB gene with significant homology to predicted clpB genes from other members of the Actinobacteridae group. The heat- and osmotic-inducibility of the B. breve UCC 2003 clpB homologue was verified by slot-blot analysis, while Northern blot and primer extension analyses showed that the clpB gene is transcribed as a monocistronic unit with a single promoter. The role of a hspR homologue, known to control the regulation of clpB and dnaK gene expression in other high G+C content bacteria was investigated by gel mobility shift assays. Moreover the predicted 3D structure of HspR provides further insight into the binding mode of this protein to the clpB promoter region, and highlights the key amino acid residues believed to be involved in the protein-DNA interaction.

  7. Three reasons protein disorder analysis makes more sense in the light of collagen

    PubMed Central

    Oates, Matt E.; Tompa, Peter; Gough, Julian

    2016-01-01

    Abstract We have identified that the collagen helix has the potential to be disruptive to analyses of intrinsically disordered proteins. The collagen helix is an extended fibrous structure that is both promiscuous and repetitive. Whilst its sequence is predicted to be disordered, this type of protein structure is not typically considered as intrinsic disorder. Here, we show that collagen‐encoding proteins skew the distribution of exon lengths in genes. We find that previous results, demonstrating that exons encoding disordered regions are more likely to be symmetric, are due to the abundance of the collagen helix. Other related results, showing increased levels of alternative splicing in disorder‐encoding exons, still hold after considering collagen‐containing proteins. Aside from analyses of exons, we find that the set of proteins that contain collagen significantly alters the amino acid composition of regions predicted as disordered. We conclude that research in this area should be conducted in the light of the collagen helix. PMID:26941008

  8. Unbiased View of Synaptic and Neuronal Gene Complement in Ctenophores: Are There Pan-neuronal and Pan-synaptic Genes across Metazoa?

    PubMed Central

    Moroz, Leonid L.; Kohn, Andrea B.

    2015-01-01

    Hypotheses of origins and evolution of neurons and synapses are controversial, mostly due to limited comparative data. Here, we investigated the genome-wide distribution of the bilaterian “synaptic” and “neuronal” protein-coding genes in non-bilaterian basal metazoans (Ctenophora, Porifera, Placozoa, and Cnidaria). First, there are no recognized genes uniquely expressed in neurons across all metazoan lineages. None of the so-called pan-neuronal genes such as embryonic lethal abnormal vision (ELAV), Musashi, or Neuroglobin are expressed exclusively in neurons of the ctenophore Pleurobrachia. Second, our comparative analysis of about 200 genes encoding canonical presynaptic and postsynaptic proteins in bilaterians suggests that there are no true “pan-synaptic” genes or genes uniquely and specifically attributed to all classes of synapses. The majority of these genes encode receptive and secretory complexes in a broad spectrum of eukaryotes. Trichoplax (Placozoa) an organism without neurons and synapses has more orthologs of bilaterian synapse-related/neuron-related genes than do ctenophores—the group with well-developed neuronal and synaptic organization. Third, the majority of genes encoding ion channels and ionotropic receptors are broadly expressed in unicellular eukaryotes and non-neuronal tissues in metazoans. Therefore, they cannot be viewed as neuronal markers. Nevertheless, the co-expression of multiple types of ion channels and receptors does correlate with the presence of neural and synaptic organization. As an illustrative example, the ctenophore genomes encode a greater diversity of ion channels and ionotropic receptors compared with the genomes of the placozoan Trichoplax and the demosponge Amphimedon. Surprisingly, both placozoans and sponges have a similar number of orthologs of “synaptic” proteins as we identified in the genomes of two ctenophores. Ctenophores have a distinct synaptic organization compared with other animals. Our analysis of transcriptomes from 10 different ctenophores did not detect recognized orthologs of synthetic enzymes encoding several classical, low-molecular-weight (neuro)transmitters; glutamate signaling machinery is one of the few exceptions. Novel peptidergic signaling molecules were predicted for ctenophores, together with the diversity of putative receptors including SCNN1/amiloride-sensitive sodium channel-like channels, many of which could be examples of a lineage-specific expansion within this group. In summary, our analysis supports the hypothesis of independent evolution of neurons and, as corollary, a parallel evolution of synapses. We suggest that the formation of synaptic machinery might occur more than once over 600 million years of animal evolution. PMID:26454853

  9. Protein encoding genes in an ancient plant: analysis of codon usage, retained genes and splice sites in a moss, Physcomitrella patens

    PubMed Central

    Rensing, Stefan A; Fritzowsky, Dana; Lang, Daniel; Reski, Ralf

    2005-01-01

    Background The moss Physcomitrella patens is an emerging plant model system due to its high rate of homologous recombination, haploidy, simple body plan, physiological properties as well as phylogenetic position. Available EST data was clustered and assembled, and provided the basis for a genome-wide analysis of protein encoding genes. Results We have clustered and assembled Physcomitrella patens EST and CDS data in order to represent the transcriptome of this non-seed plant. Clustering of the publicly available data and subsequent prediction resulted in a total of 19,081 non-redundant ORF. Of these putative transcripts, approximately 30% have a homolog in both rice and Arabidopsis transcriptome. More than 130 transcripts are not present in seed plants but can be found in other kingdoms. These potential "retained genes" might have been lost during seed plant evolution. Functional annotation of these genes reveals unequal distribution among taxonomic groups and intriguing putative functions such as cytotoxicity and nucleic acid repair. Whereas introns in the moss are larger on average than in the seed plant Arabidopsis thaliana, position and amount of introns are approximately the same. Contrary to Arabidopsis, where CDS contain on average 44% G/C, in Physcomitrella the average G/C content is 50%. Interestingly, moss orthologs of Arabidopsis genes show a significant drift of codon fraction usage, towards the seed plant. While averaged codon bias is the same in Physcomitrella and Arabidopsis, the distribution pattern is different, with 15% of moss genes being unbiased. Species-specific, sensitive and selective splice site prediction for Physcomitrella has been developed using a dataset of 368 donor and acceptor sites, utilizing a support vector machine. The prediction accuracy is better than those achieved with tools trained on Arabidopsis data. Conclusion Analysis of the moss transcriptome displays differences in gene structure, codon and splice site usage in comparison with the seed plant Arabidopsis. Putative retained genes exhibit possible functions that might explain the peculiar physiological properties of mosses. Both the transcriptome representation (including a BLAST and retrieval service) and splice site prediction have been made available on , setting the basis for assembly and annotation of the Physcomitrella genome, of which draft shotgun sequences will become available in 2005. PMID:15784153

  10. Cytochrome P460 Genes from the Methanotroph Methylococcus capsulatus Bath†

    PubMed Central

    Bergmann, David J.; Zahn, James A.; Hooper, Alan B.; DiSpirito, Alan A.

    1998-01-01

    P460 cytochromes catalyze the oxidation of hydroxylamine to nitrite. They have been isolated from the ammonia-oxidizing bacterium Nitrosomonas europaea (R. H. Erickson and A. B. Hooper, Biochim. Biophys. Acta 275:231–244, 1972) and the methane-oxidizing bacterium Methylococcus capsulatus Bath (J. A. Zahn et al., J. Bacteriol. 176:5879–5887, 1994). A degenerate oligonucleotide probe was synthesized based on the N-terminal amino acid sequence of cytochrome P460 and used to identify a DNA fragment from M. capsulatus Bath that contains cyp, the gene encoding cytochrome P460. cyp is part of a gene cluster that contains three open reading frames (ORFs), the first predicted to encode a 59,000-Da membrane-bound polypeptide, the second predicted to encode a 12,000-Da periplasmic protein, and the third (cyp) encoding cytochrome P460. The products of the first two ORFs have no apparent similarity to any proteins in the GenBank database. The overall sequence similarity of the P460 cytochromes from M. capsulatus Bath and N. europaea was low (24.3% of residues identical), although short regions of conserved residues are present in the two proteins. Both cytochromes have a C-terminal, c-heme binding motif (CXXCH) and a conserved lysine residue (K61) that may provide an additional covalent cross-link to the heme (D. M. Arciero and A. B. Hooper, FEBS Lett. 410:457–460, 1997). Gene probing using cyp indicated that a cytochrome P460 similar to that from M. capsulatus Bath may be present in the type II methanotrophs Methylosinus trichosporium OB3b and Methylocystis parvus OBBP but not in the type I methanotrophs Methylobacter marinus A45, Methylomicrobium albus BG8, and Methylomonas sp. strains MN and MM2. Immunoblot analysis with antibodies against cytochrome P460 from M. capsulatus Bath indicated that the expression level of cytochrome P460 was not affected either by expression of the two different methane monooxygenases or by addition of ammonia to the culture medium. PMID:9851984

  11. Identification of a hybrid PKS-NRPS required for the biosynthesis of NG-391 in Metarhizium anisopliae var. anisopliae

    USDA-ARS?s Scientific Manuscript database

    A 19,818 kb genomic region harboring six predicted ORFs was identified in M. anisopliae ARSEF 2575. ORF4, putatively encoding a hybrid polyketide synthase-nonribosomal peptide synthetase (PKS-NRPS) was targeted using Agrobacterium-mediated gene knockout. Homologous recombinants failed to produce det...

  12. H2O2 Production in Species of the Lactobacillus acidophilus Group: a Central Role for a Novel NADH-Dependent Flavin Reductase

    PubMed Central

    Hertzberger, Rosanne; Arents, Jos; Dekker, Henk L.; Pridmore, R. David; Gysler, Christof; Kleerebezem, Michiel

    2014-01-01

    Hydrogen peroxide production is a well-known trait of many bacterial species associated with the human body. In the presence of oxygen, the probiotic lactic acid bacterium Lactobacillus johnsonii NCC 533 excretes up to 1 mM H2O2, inducing growth stagnation and cell death. Disruption of genes commonly assumed to be involved in H2O2 production (e.g., pyruvate oxidase, NADH oxidase, and lactate oxidase) did not affect this. Here we describe the purification of a novel NADH-dependent flavin reductase encoded by two highly similar genes (LJ_0548 and LJ_0549) that are conserved in lactobacilli belonging to the Lactobacillus acidophilus group. The genes are predicted to encode two 20-kDa proteins containing flavin mononucleotide (FMN) reductase conserved domains. Reductase activity requires FMN, flavin adenine dinucleotide (FAD), or riboflavin and is specific for NADH and not NADPH. The Km for FMN is 30 ± 8 μM, in accordance with its proposed in vivo role in H2O2 production. Deletion of the encoding genes in L. johnsonii led to a 40-fold reduction of hydrogen peroxide formation. H2O2 production in this mutant could only be restored by in trans complementation of both genes. Our work identifies a novel, conserved NADH-dependent flavin reductase that is prominently involved in H2O2 production in L. johnsonii. PMID:24487531

  13. CYC2 encodes a factor involved in mitochondrial import of yeast cytochrome c.

    PubMed Central

    Dumont, M E; Schlichter, J B; Cardillo, T S; Hayes, M K; Bethlendy, G; Sherman, F

    1993-01-01

    The gene CYC2 from the yeast Saccharomyces cerevisiae was previously shown to affect levels of mitochondrial cytochrome c by acting at a posttranslational step in cytochrome c biosynthesis. We report here the cloning and identification of the CYC2 gene product as a protein involved in import of cytochrome c into mitochondria. CYC2 encodes a 168-amino-acid open reading frame with at least two potential transmembrane segments. Antibodies against a synthetic peptide corresponding to the carboxyl terminus of the predicted sequence were raised. These antibodies recognize multiple bands on immunoblots of mitochondrial extracts. The intensities of these bands vary according to the gene dosage of CYC2 in various isogenic strains. Immunoblotting of subcellular fractions suggests that the CYC2 gene product is a mitochondrial protein. Deletion of CYC2 leads to accumulation of apocytochrome c in the cytoplasm. However, strains with deletions of this gene still import low levels of cytochrome c into mitochondria. The effects of cyc2 mutations are more pronounced in rho- strains than in rho+ strains, even though rho- strains that are CYC2+ contain normal levels of holocytochrome c. cyc2 mutations affect levels of iso-1-cytochrome c more than they do levels of iso-2-cytochrome c, apparently because of the greater susceptibility of apo-iso-1-cytochrome c to degradation in the cytoplasm. We propose that CYC2 encodes a factor that increases the efficiency of cytochrome c import into mitochondria. Images PMID:8413243

  14. Complete genome sequence of Streptococcus troglodytae TKU31 isolated from the oral cavity of a chimpanzee (Pan troglodytes).

    PubMed

    Okamoto, Masaaki; Naito, Mariko; Miyanohara, Mayu; Imai, Susumu; Nomura, Yoshiaki; Saito, Wataru; Momoi, Yasuko; Takada, Kazuko; Miyabe-Nishiwaki, Takako; Tomonaga, Masaki; Hanada, Nobuhiro

    2016-12-01

    Streptococcus troglodytae TKU31 was isolated from the oral cavity of a chimpanzee (Pan troglodytes) and was found to be the most closely related species of the mutans group streptococci to Streptococcus mutans. The complete sequence of TKU31 genome consists of a single circular chromosome that is 2,097,874 base pairs long and has a G + C content of 37.18%. It possesses 2082 coding sequences (CDSs), 65 tRNAs and five rRNA operons (15 rRNAs). Two clustered regularly interspaced short palindromic repeats, six insertion sequences and two predicted prophage elements were identified. The genome of TKU31 harbors some putative virulence associated genes, including gtfB, gtfC and gtfD genes encoding glucosyltransferase and gbpA, gbpB, gbpC and gbpD genes encoding glucan-binding cell wall-anchored protein. The deduced amino acid identity of the rhamnose-glucose polysaccharide F gene (rgpF), which is one of the serotype determinants, is 91% identical with that of S. mutans LJ23 (serotype k) strain. However, two other virulence-associated genes cnm and cbm, which encode the collagen-binding proteins, were not found in the TKU31 genome. The complete genome sequence of S. troglodytae TKU31 has been deposited at DDBJ/European Nucleotide Archive/GenBank under the accession no. AP014612. © 2016 The Societies and John Wiley & Sons Australia, Ltd.

  15. Identification and Characterization of LFD-2, a Predicted Fringe Protein Required for Membrane Integrity during Cell Fusion in Neurospora crassa

    PubMed Central

    Palma-Guerrero, Javier; Zhao, Jiuhai; Gonçalves, A. Pedro; Starr, Trevor L.

    2015-01-01

    The molecular mechanisms of membrane merger during somatic cell fusion in eukaryotic species are poorly understood. In the filamentous fungus Neurospora crassa, somatic cell fusion occurs between genetically identical germinated asexual spores (germlings) and between hyphae to form the interconnected network characteristic of a filamentous fungal colony. In N. crassa, two proteins have been identified to function at the step of membrane fusion during somatic cell fusion: PRM1 and LFD-1. The absence of either one of these two proteins results in an increase of germling pairs arrested during cell fusion with tightly appressed plasma membranes and an increase in the frequency of cell lysis of adhered germlings. The level of cell lysis in ΔPrm1 or Δlfd-1 germlings is dependent on the extracellular calcium concentration. An available transcriptional profile data set was used to identify genes encoding predicted transmembrane proteins that showed reduced expression levels in germlings cultured in the absence of extracellular calcium. From these analyses, we identified a mutant (lfd-2, for late fusion defect-2) that showed a calcium-dependent cell lysis phenotype. lfd-2 encodes a protein with a Fringe domain and showed endoplasmic reticulum and Golgi membrane localization. The deletion of an additional gene predicted to encode a low-affinity calcium transporter, fig1, also resulted in a strain that showed a calcium-dependent cell lysis phenotype. Genetic analyses showed that LFD-2 and FIG1 likely function in separate pathways to regulate aspects of membrane merger and repair during cell fusion. PMID:25595444

  16. A Global Overview of the Genetic and Functional Diversity in the Helicobacter pylori cag Pathogenicity Island

    PubMed Central

    Moodley, Yoshan; Uhr, Markus; Stamer, Christiana; Vauterin, Marc; Suerbaum, Sebastian; Achtman, Mark

    2010-01-01

    The Helicobacter pylori cag pathogenicity island (cagPAI) encodes a type IV secretion system. Humans infected with cagPAI–carrying H. pylori are at increased risk for sequelae such as gastric cancer. Housekeeping genes in H. pylori show considerable genetic diversity; but the diversity of virulence factors such as the cagPAI, which transports the bacterial oncogene CagA into host cells, has not been systematically investigated. Here we compared the complete cagPAI sequences for 38 representative isolates from all known H. pylori biogeographic populations. Their gene content and gene order were highly conserved. The phylogeny of most cagPAI genes was similar to that of housekeeping genes, indicating that the cagPAI was probably acquired only once by H. pylori, and its genetic diversity reflects the isolation by distance that has shaped this bacterial species since modern humans migrated out of Africa. Most isolates induced IL-8 release in gastric epithelial cells, indicating that the function of the Cag secretion system has been conserved despite some genetic rearrangements. More than one third of cagPAI genes, in particular those encoding cell-surface exposed proteins, showed signatures of diversifying (Darwinian) selection at more than 5% of codons. Several unknown gene products predicted to be under Darwinian selection are also likely to be secreted proteins (e.g. HP0522, HP0535). One of these, HP0535, is predicted to code for either a new secreted candidate effector protein or a protein which interacts with CagA because it contains two genetic lineages, similar to cagA. Our study provides a resource that can guide future research on the biological roles and host interactions of cagPAI proteins, including several whose function is still unknown. PMID:20808891

  17. Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

    PubMed Central

    Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

    1994-01-01

    To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378

  18. A global overview of the genetic and functional diversity in the Helicobacter pylori cag pathogenicity island.

    PubMed

    Olbermann, Patrick; Josenhans, Christine; Moodley, Yoshan; Uhr, Markus; Stamer, Christiana; Vauterin, Marc; Suerbaum, Sebastian; Achtman, Mark; Linz, Bodo

    2010-08-19

    The Helicobacter pylori cag pathogenicity island (cagPAI) encodes a type IV secretion system. Humans infected with cagPAI-carrying H. pylori are at increased risk for sequelae such as gastric cancer. Housekeeping genes in H. pylori show considerable genetic diversity; but the diversity of virulence factors such as the cagPAI, which transports the bacterial oncogene CagA into host cells, has not been systematically investigated. Here we compared the complete cagPAI sequences for 38 representative isolates from all known H. pylori biogeographic populations. Their gene content and gene order were highly conserved. The phylogeny of most cagPAI genes was similar to that of housekeeping genes, indicating that the cagPAI was probably acquired only once by H. pylori, and its genetic diversity reflects the isolation by distance that has shaped this bacterial species since modern humans migrated out of Africa. Most isolates induced IL-8 release in gastric epithelial cells, indicating that the function of the Cag secretion system has been conserved despite some genetic rearrangements. More than one third of cagPAI genes, in particular those encoding cell-surface exposed proteins, showed signatures of diversifying (Darwinian) selection at more than 5% of codons. Several unknown gene products predicted to be under Darwinian selection are also likely to be secreted proteins (e.g. HP0522, HP0535). One of these, HP0535, is predicted to code for either a new secreted candidate effector protein or a protein which interacts with CagA because it contains two genetic lineages, similar to cagA. Our study provides a resource that can guide future research on the biological roles and host interactions of cagPAI proteins, including several whose function is still unknown.

  19. Isolation and characterization of the genes for two small RNAs of herpesvirus papio and their comparison with Epstein-Barr virus-encoded EBER RNAs.

    PubMed Central

    Howe, J G; Shu, M D

    1988-01-01

    Genes for the Epstein-Barr virus-encoded RNAs (EBERs), two low-molecular-weight RNAs encoded by the human gammaherpesvirus Epstein-Barr virus (EBV), hybridize to two small RNAs in a baboon cell line that contains a similar virus, herpesvirus papio (HVP). The genes for the HVP RNAs (HVP-1 and HVP-2) are located together in the small unique region at the left end of the viral genome and are transcribed by RNA polymerase III in a rightward direction, similar to the EBERs. There is significant similarity between EBER1 and HVP-1 RNA, except for an insert of 22 nucleotides which increases the length of HVP-1 RNA to 190 nucleotides. There is less similarity between the sequences of EBER2 and HVP-2 RNA, but both have a length of about 170 nucleotides. The predicted secondary structure of each HVP RNA is remarkably similar to that of the respective EBER, implying that the secondary structures are important for function. Upstream from the initiation sites of all four RNA genes are several highly conserved sequences which may function in the regulation of transcription. The HVP RNAs, together with the EBERs, are highly abundant in transformed cells and are efficiently bound by the cellular La protein. Images PMID:2839701

  20. Distribution of genetic polymorphisms of genes encoding drug metabolizing enzymes & drug transporters - a review with Indian perspective.

    PubMed

    Umamaheswaran, Gurusamy; Kumar, Dhakchinamoorthi Krishna; Adithan, Chandrasekaran

    2014-01-01

    Phase I and II drug metabolizing enzymes (DME) and drug transporters are involved in the absorption, distribution, metabolism as well as elimination of many therapeutic agents, toxins and various pollutants. Presence of genetic polymorphisms in genes encoding these proteins has been associated with marked inter-individual variability in their activity that could result in variation in drug response, toxicity as well as in disease predisposition. The emergent field pharmacogenetics and pharmacogenomics (PGx) is a promising discipline, as it predicts disease risk, selection of proper medication with regard to response and toxicity, and appropriate drug dosage guidance based on an individual's genetic make-up. Consequently, genetic variations are essential to understand the ethnic differences in disease occurrence, development, prognosis, therapeutic response and toxicity. For that reason, it is necessary to establish the normative frequency of these genes in a particular population before unraveling the genotype-phenotype associations. Although a fair amount of allele frequency data are available in Indian populations, the existing pharmacogenetic data have not been compiled into a database. This review was intended to compile the normative frequency distribution of the variants of genes encoding DMEs (CYP450s, TPMT, GSTs, COMT, SULT1A1, NAT2 and UGTs) and transporter proteins (MDR1, OCT1 and SLCO1B1) with Indian perspective.

  1. Export of l-Isoleucine from Corynebacterium glutamicum: a Two-Gene-Encoded Member of a New Translocator Family

    PubMed Central

    Kennerknecht, Nicole; Sahm, Hermann; Yen, Ming-Ren; Pátek, Miroslav; Saier, Jr., Milton H.; Eggeling, Lothar

    2002-01-01

    Bacteria possess amino acid export systems, and Corynebacterium glutamicum excretes l-isoleucine in a process dependent on the proton motive force. In order to identify the system responsible for l-isoleucine export, we have used transposon mutagenesis to isolate mutants of C. glutamicum sensitive to the peptide isoleucyl-isoleucine. In one such mutant, strong peptide sensitivity resulted from insertion into a gene designated brnF encoding a hydrophobic protein predicted to possess seven transmembrane spanning helices. brnE is located downstream of brnF and encodes a second hydrophobic protein with four putative membrane-spanning helices. A mutant deleted of both genes no longer exports l-isoleucine, whereas an overexpressing strain exports this amino acid at an increased rate. BrnF and BrnE together are also required for the export of l-leucine and l-valine. BrnFE is thus a two-component export permease specific for aliphatic hydrophobic amino acids. Upstream of brnFE and transcribed divergently is an Lrp-like regulatory gene required for active export. Searches for homologues of BrnFE show that this type of exporter is widespread in prokaryotes but lacking in eukaryotes and that both gene products which together comprise the members of a novel family, the LIV-E family, generally map together within a single operon. Comparisons of the BrnF and BrnE phylogenetic trees show that gene duplication events in the early bacterial lineage gave rise to multiple paralogues that have been retained in α-proteobacteria but not in other prokaryotes analyzed. PMID:12081967

  2. Cyclomodulins in Urosepsis Strains of Escherichia coli▿

    PubMed Central

    Dubois, Damien; Delmas, Julien; Cady, Anne; Robin, Frédéric; Sivignon, Adeline; Oswald, Eric; Bonnet, Richard

    2010-01-01

    Determinants of urosepsis in Escherichia coli remain incompletely defined. Cyclomodulins (CMs) are a growing functional family of toxins that hijack the eukaryotic cell cycle. Four cyclomodulin types are actually known in E. coli: cytotoxic necrotizing factors (CNFs), cycle-inhibiting factor (Cif), cytolethal distending toxins (CDTs), and the pks-encoded toxin. In the present study, the distribution of CM-encoding genes and the functionality of these toxins were investigated in 197 E. coli strains isolated from patients with community-acquired urosepsis (n = 146) and from uninfected subjects (n = 51). This distribution was analyzed in relation to the phylogenetic background, clinical origin, and antibiotic resistance of the strains. It emerged from this study that strains harboring the pks island and the cnf1 gene (i) were strongly associated with the B2 phylogroup (P, <0.001), (ii) frequently harbored both toxin-encoded genes in phylogroup B2 (33%), and (iii) were predictive of a urosepsis origin (P, <0.001 to 0.005). However, the prevalences of the pks island among phylogroup B2 strains, in contrast to those of the cnf1 gene, were not significantly different between fecal and urosepsis groups, suggesting that the pks island is more important for the colonization process and the cnf1 gene for virulence. pks- or cnf1-harboring strains were significantly associated with susceptibility to antibiotics (amoxicillin, cotrimoxazole, and quinolones [P, <0.001 to 0.043]). Otherwise, only 6% and 1% of all strains harbored the cdtB and cif genes, respectively, with no particular distribution by phylogenetic background, antimicrobial susceptibility, or clinical origin. PMID:20375237

  3. The Ether-Cleaving Methyltransferase System of the Strict Anaerobe Acetobacterium dehalogenans: Analysis and Expression of the Encoding Genes▿

    PubMed Central

    Schilhabel, Anke; Studenik, Sandra; Vödisch, Martin; Kreher, Sandra; Schlott, Bernhard; Pierik, Antonio Y.; Diekert, Gabriele

    2009-01-01

    Anaerobic O-demethylases are inducible multicomponent enzymes which mediate the cleavage of the ether bond of phenyl methyl ethers and the transfer of the methyl group to tetrahydrofolate. The genes of all components (methyltransferases I and II, CP, and activating enzyme [AE]) of the vanillate- and veratrol-O-demethylases of Acetobacterium dehalogenans were sequenced and analyzed. In A. dehalogenans, the genes for methyltransferase I, CP, and methyltransferase II of both O-demethylases are clustered. The single-copy gene for AE is not included in the O-demethylase gene clusters. It was found that AE grouped with COG3894 proteins, the function of which was unknown so far. Genes encoding COG3894 proteins with 20 to 41% amino acid sequence identity with AE are present in numerous genomes of anaerobic microorganisms. Inspection of the domain structure and genetic context of these orthologs predicts that these are also reductive activases for corrinoid enzymes (RACEs), such as carbon monoxide dehydrogenase/acetyl coenzyme A synthases or anaerobic methyltransferases. The genes encoding the O-demethylase components were heterologously expressed with a C-terminal Strep-tag in Escherichia coli, and the recombinant proteins methyltransferase I, CP, and AE were characterized. Gel shift experiments showed that the AE comigrated with the CP. The formation of other protein complexes with the O-demethylase components was not observed under the conditions used. The results point to a strong interaction of the AE with the CP. This is the first report on the functional heterologous expression of acetogenic phenyl methyl ether-cleaving O-demethylases. PMID:19011025

  4. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    PubMed

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of E<10(-5)) are included in 27 clusters. Five clusters are associated with metabolism, containing P450 genes restricted to the Brassica family and predicted to be involved in secondary metabolism. Operon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  5. Characterisation of single domain ATP-binding cassette protien homologues of Theileria parva.

    PubMed

    Kibe, M K; Macklin, M; Gobright, E; Bishop, R; Urakawa, T; ole-MoiYoi, O K

    2001-09-01

    Two distinct genes encoding single domain, ATP-binding cassette transport protein homologues of Theileria parva were cloned and sequenced. Neither of the genes is tandemly duplicated. One gene, TpABC1, encodes a predicted protein of 593 amino acids with an N-terminal hydrophobic domain containing six potential membrane-spanning segments. A single discontinuous ATP-binding element was located in the C-terminal region of TpABC1. The second gene, TpABC2, also contains a single C-terminal ATP-binding motif. Copies of TpABC2 were present at four loci in the T. parva genome on three different chromosomes. TpABC1 exhibited allelic polymorphism between stocks of the parasite. Comparison of cDNA and genomic sequences revealed that TpABC1 contained seven short introns, between 29 and 84 bp in length. The full-length TpABC1 protein was expressed in insect cells using the baculovirus system. Application of antibodies raised against the recombinant antigen to western blots of T. parva piroplasm lysates detected an 85 kDa protein in this life-cycle stage.

  6. Current Understanding of Usher Syndrome Type II

    PubMed Central

    Yang, Jun; Wang, Le; Song, Hongman; Sokolov, Maxim

    2012-01-01

    Usher syndrome is the most common deafness-blindness caused by genetic mutations. To date, three genes have been identified underlying the most prevalent form of Usher syndrome, the type II form (USH2). The proteins encoded by these genes are demonstrated to form a complex in vivo. This complex is localized mainly at the periciliary membrane complex in photoreceptors and the ankle-link of the stereocilia in hair cells. Many proteins have been found to interact with USH2 proteins in vitro, suggesting that they are potential additional components of this USH2 complex and that the genes encoding these proteins may be the candidate USH2 genes. However, further investigations are critical to establish their existence in the USH2 complex in vivo. Based on the predicted functional domains in USH2 proteins, their cellular localizations in photoreceptors and hair cells, the observed phenotypes in USH2 mutant mice, and the known knowledge about diseases similar to USH2, putative biological functions of the USH2 complex have been proposed. Finally, therapeutic approaches for this group of diseases are now being actively explored. PMID:22201796

  7. A conserved gene family encodes transmembrane proteins with fibronectin, immunoglobulin and leucine-rich repeat domains (FIGLER)

    PubMed Central

    Munfus, Delicia L; Haga, Christopher L; Burrows, Peter D; Cooper, Max D

    2007-01-01

    Background In mouse the cytokine interleukin-7 (IL-7) is required for generation of B lymphocytes, but human IL-7 does not appear to have this function. A bioinformatics approach was therefore used to identify IL-7 receptor related genes in the hope of identifying the elusive human cytokine. Results Our database search identified a family of nine gene candidates, which we have provisionally named fibronectin immunoglobulin leucine-rich repeat (FIGLER). The FIGLER 1–9 genes are predicted to encode type I transmembrane glycoproteins with 6–12 leucine-rich repeats (LRR), a C2 type Ig domain, a fibronectin type III domain, a hydrophobic transmembrane domain, and a cytoplasmic domain containing one to four tyrosine residues. Members of this multichromosomal gene family possess 20–47% overall amino acid identity and are differentially expressed in cell lines and primary hematopoietic lineage cells. Genes for FIGLER homologs were identified in macaque, orangutan, chimpanzee, mouse, rat, dog, chicken, toad, and puffer fish databases. The non-human FIGLER homologs share 38–99% overall amino acid identity with their human counterpart. Conclusion The extracellular domain structure and absence of recognizable cytoplasmic signaling motifs in members of the highly conserved FIGLER gene family suggest a trophic or cell adhesion function for these molecules. PMID:17854505

  8. Genome sequence comparison reveals a candidate gene involved in male-hermaphrodite differentiation in papaya (Carica papaya) trees.

    PubMed

    Ueno, Hiroki; Urasaki, Naoya; Natsume, Satoshi; Yoshida, Kentaro; Tarora, Kazuhiko; Shudo, Ayano; Terauchi, Ryohei; Matsumura, Hideo

    2015-04-01

    The sex type of papaya (Carica papaya) is determined by the pair of sex chromosomes (XX, female; XY, male; and XY(h), hermaphrodite), in which there is a non-recombining genomic region in the Y and Y(h) chromosomes. This region is presumed to be involved in determination of males and hermaphrodites; it is designated as the male-specific region in the Y chromosome (MSY) and the hermaphrodite-specific region in the Y(h) chromosome (HSY). Here, we identified the genes determining male and hermaphrodite sex types by comparing MSY and HSY genomic sequences. In the MSY and HSY genomic regions, we identified 14,528 nucleotide substitutions and 965 short indels with a large gap and two highly diverged regions. In the predicted genes expressed in flower buds, we found no nucleotide differences leading to amino acid changes between the MSY and HSY. However, we found an HSY-specific transposon insertion in a gene (SVP like) showing a similarity to the Short Vegetative Phase (SVP) gene. Study of SVP-like transcripts revealed that the MSY allele encoded an intact protein, while the HSY allele encoded a truncated protein. Our findings demonstrated that the SVP-like gene is a candidate gene for male-hermaphrodite determination in papaya.

  9. A Network of Chromatin Factors Is Regulating the Transition to Postembryonic Development in Caenorhabditis elegans

    PubMed Central

    Erdelyi, Peter; Wang, Xing; Suleski, Marina; Wicky, Chantal

    2016-01-01

    Mi2 proteins are evolutionarily conserved, ATP-dependent chromatin remodelers of the CHD family that play key roles in stem cell differentiation and reprogramming. In Caenorhabditis elegans, the let-418 gene encodes one of the two Mi2 homologs, which is part of at least two chromatin complexes, namely the Nucleosome Remodeling and histone Deacetylase (NuRD) complex and the MEC complex, and functions in larval development, vulval morphogenesis, lifespan regulation, and cell fate determination. To explore the mechanisms involved in the action of LET-418/Mi2, we performed a genome-wide RNA interference (RNAi) screen for suppressors of early larval arrest associated with let-418 mutations. We identified 29 suppressor genes, of which 24 encode chromatin regulators, mostly orthologs of proteins present in transcriptional activator complexes. The remaining five genes vary broadly in their predicted functions. All suppressor genes could suppress multiple aspects of the let-418 phenotype, including developmental arrest and ectopic expression of germline genes in the soma. Analysis of available transcriptomic data and quantitative PCR revealed that LET-418 and the suppressors of early larval arrest are regulating common target genes. These suppressors might represent direct competitors of LET-418 complexes for chromatin regulation of crucial genes involved in the transition to postembryonic development. PMID:28007841

  10. A Network of Chromatin Factors Is Regulating the Transition to Postembryonic Development in Caenorhabditis elegans.

    PubMed

    Erdelyi, Peter; Wang, Xing; Suleski, Marina; Wicky, Chantal

    2017-02-09

    Mi2 proteins are evolutionarily conserved, ATP-dependent chromatin remodelers of the CHD family that play key roles in stem cell differentiation and reprogramming. In Caenorhabditis elegans , the let-418 gene encodes one of the two Mi2 homologs, which is part of at least two chromatin complexes, namely the Nucleosome Remodeling and histone Deacetylase (NuRD) complex and the MEC complex, and functions in larval development, vulval morphogenesis, lifespan regulation, and cell fate determination. To explore the mechanisms involved in the action of LET-418/Mi2, we performed a genome-wide RNA interference (RNAi) screen for suppressors of early larval arrest associated with let-418 mutations. We identified 29 suppressor genes, of which 24 encode chromatin regulators, mostly orthologs of proteins present in transcriptional activator complexes. The remaining five genes vary broadly in their predicted functions. All suppressor genes could suppress multiple aspects of the let-418 phenotype, including developmental arrest and ectopic expression of germline genes in the soma. Analysis of available transcriptomic data and quantitative PCR revealed that LET-418 and the suppressors of early larval arrest are regulating common target genes. These suppressors might represent direct competitors of LET-418 complexes for chromatin regulation of crucial genes involved in the transition to postembryonic development. Copyright © 2017 Erdelyi et al.

  11. Molecular cloning of allelopathy related genes and their relation to HHO in Eupatorium adenophorum.

    PubMed

    Guo, Huiming; Pei, Xixiang; Wan, Fanghao; Cheng, Hongmei

    2011-10-01

    In this study, conserved sequence regions of HMGR, DXR, and CHS (encoding 3-hydroxy-3-methylglutaryl-CoA reductase, 1-deoxyxylulose-5-phosphate reductoisomerase and chalcone synthase, respectively) were amplified by reverse transcriptase (RT)-PCR from Eupatorium adenophorum. Quantitative real-time PCR showed that the expression of CHS was related to the level of HHO, an allelochemical isolated from E. adenophorum. Semi-quantitative RT-PCR showed that there was no significant difference in expression of genes among three different tissues, except for CHS. Southern blotting indicated that at least three CHS genes are present in the E. adenophorum genome. A full-length cDNA from CHS genes (named EaCHS1, GenBank ID: FJ913888) was cloned. The 1,455 bp cDNA contained an open reading frame (1,206 bp) encoding a protein of 401 amino acids. Preliminary bioinformatics analysis of EaCHS1 revealed that EaCHS1 was a member of CHS family, the subcellular localization predicted that EaCHS1 was a cytoplasmic protein. To the best of our knowledge, this is the first report of conserved sequences of these genes and of a full-length EaCHS1 gene in E. adenophorum. The results indicated that CHS gene is related to allelopathy of E. adenophorum.

  12. Nucleotide sequence analysis reveals linked N-acetyl hydrolase, thioesterase, transport, and regulatory genes encoded by the bialaphos biosynthetic gene cluster of Streptomyces hygroscopicus.

    PubMed Central

    Raibaud, A; Zalacain, M; Holt, T G; Tizard, R; Thompson, C J

    1991-01-01

    Nucleotide sequence analysis of a 5,000-bp region of the bialaphos antibiotic production (bap) gene cluster defined five open reading frames (ORFs) which predicted structural genes in the order bah, ORF1, ORF2, and ORF3 followed by the regulatory gene, brpA (H. Anzai, T. Murakami, S. Imai, A. Satoh, K. Nagaoka, and C.J. Thompson, J. Bacteriol. 169:3482-3488, 1987). The four structural genes were translationally coupled and apparently cotranscribed from an undefined promoter(s) under the positive control of the brpA gene product. S1 mapping experiments indicated that brpA was transcribed by two promoters (brpAp1 and brpAp2) which initiate transcription 150 and 157 bp upstream of brp A within an intergenic region and at least one promoter further upstream within the bap gene cluster (brpAp3). All three transcripts were present at low levels during exponential growth and increased just before the stationary phase. The levels of the brpAp3 band continued to increase at the onset of stationary phase, whereas brpAp1-and brpAp2-protected fragments showed no further change. BrpA contained a possible helix-turn-helix motif at its C terminus which was similar to the C-terminal regulatory motif found in the receiver component of a family of two-component transcriptional activator proteins. This motif was not associated with the N-terminal domain conserved in other members of the family. The structural gene cluster sequenced began with bah, encoding a bialaphos acetylhydrolase which removes the N-acetyl group from bialaphos as one of the final steps in the biosynthetic pathway. The observation that Bah was similar to a rat and to a bacterial (Acinetobacter calcoaceticus) lipase probably reflects the fact that the ester bonds of triglycerides and the amide bond linking acetate to phosphinothricin are similar and hydrolysis is catalyzed by structurally related enzymes. This was followed by two regions encoding ORF1 and ORF2 which were similar to each other (48% nucleotide identity, 31% amino acid identity), as well as to GrsT, a protein encoded by a gene located adjacent to gramicidin S synthetase in Bacillus brevis, and to vertebrate (mallard duck and rat) thioesterases. The amino acid sequence and hydrophobicity profile of ORF3 indicated that it was related to a family of membrane transport proteins. It was strikingly similar to the citrate uptake protein encoded by the transposon Tn3411. Images PMID:2066341

  13. Positive selection on human gamete-recognition genes

    PubMed Central

    Stover, Daryn A.; Guerra, Vanessa; Mozaffari, Sahar V.; Ober, Carole; Mugal, Carina F.; Kaj, Ingemar

    2018-01-01

    Coevolution of genes that encode interacting proteins expressed on the surfaces of sperm and eggs can lead to variation in reproductive compatibility between mates and reproductive isolation between members of different species. Previous studies in mice and other mammals have focused in particular on evidence for positive or diversifying selection that shapes the evolution of genes that encode sperm-binding proteins expressed in the egg coat or zona pellucida (ZP). By fitting phylogenetic models of codon evolution to data from the 1000 Genomes Project, we identified candidate sites evolving under diversifying selection in the human genes ZP3 and ZP2. We also identified one candidate site under positive selection in C4BPA, which encodes a repetitive protein similar to the mouse protein ZP3R that is expressed in the sperm head and binds to the ZP at fertilization. Results from several additional analyses that applied population genetic models to the same data were consistent with the hypothesis of selection on those candidate sites leading to coevolution of sperm- and egg-expressed genes. By contrast, we found no candidate sites under selection in a fourth gene (ZP1) that encodes an egg coat structural protein not directly involved in sperm binding. Finally, we found that two of the candidate sites (in C4BPA and ZP2) were correlated with variation in family size and birth rate among Hutterite couples, and those two candidate sites were also in linkage disequilibrium in the same Hutterite study population. All of these lines of evidence are consistent with predictions from a previously proposed hypothesis of balancing selection on epistatic interactions between C4BPA and ZP3 at fertilization that lead to the evolution of co-adapted allele pairs. Such patterns also suggest specific molecular traits that may be associated with both natural reproductive variation and clinical infertility. PMID:29340252

  14. Genome sequence of the Fleming strain of Micrococcus luteus, a simple free- living actinobacterium

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Young, Michael; Artsatbanov, Vladislav; Beller, Harry R.

    Micrococcus luteus (NCTC2665, Fleming strain) has one of the smallest genomes of free living actinobacteria sequenced to date, comprising a single circular chromosome of 2,501,097 bp (G+C content 73%) predicted to encode 2403 proteins. The genome shows extensive synteny with that of the closely related organism, Kocuria rhizophila, from which it was taxonomically separated relatively recently. Despite its small size, the genome harbors 73 IS elements, almost all of which are closely related to elements found in other actinobacteria. An IS element is inserted into the rrs gene of one of only two rrn operons found in M. luteus. Themore » genome encodes only four sigma factors and fourteen response regulators, indicative of adaptation to a rather strict ecological niche (mammalian skin). The high sensitivity of M. luteus to {Beta}-lactam antibiotics may result from the presence of a reduced set of penicillin binding proteins and the absence of a wblC gene, which plays an important role in antibiotic resistance in other actinobacteria. Consistent with the restricted range of compounds it can use as a sole source of carbon for energy and growth, M. luteus has a minimal complement of genes concerned with carbohydrate transport and metabolism and its inability to utilize glucose as a sole carbon source may be due to the apparent absence of a gene encoding glucokinase. Uniquely among characterized bacteria, M. luteus appears to be able to metabolize glycogen only via trehalose, and to make trehalose only via glycogen. It has very few genes associated with secondary metabolism. In contrast to other actinobacteria, M. luteus encodes only one resuscitation-promoting factor (Rpf) required for emergence from dormancy and its complement of other dormancy-related proteins is also much reduced. M. luteus is capable of long-chain alkene biosynthesis, which is of interest for advanced biofuel production; a three gene cluster essential for this metabolism has been identified in the genome.« less

  15. A maize gene encoding an NADPH binding enzyme highly homologous to isoflavone reductases is activated in response to sulfur starvation.

    PubMed Central

    Petrucco, S; Bolchi, A; Foroni, C; Percudani, R; Rossi, G L; Ottonello, S

    1996-01-01

    we isolated a novel gene that is selectively induced both in roots and shoots in response to sulfur starvation. This gene encodes a cytosolic, monomeric protein of 33 kD that selectively binds NADPH. The predicted polypeptide is highly homologous ( > 70%) to leguminous isoflavone reductases (IFRs), but the maize protein (IRL for isoflavone reductase-like) belongs to a novel family of proteins present in a variety of plants. Anti-IRL antibodies specifically recognize IFR polypeptides, yet the maize protein is unable to use various isoflavonoids as substrates. IRL expression is correlated closely to glutathione availability: it is persistently induced in seedlings whose glutathione content is about fourfold lower than controls, and it is down-regulated rapidly when control levels of glutathione are restored. This glutathione-dependent regulation indicates that maize IRL may play a crucial role in the establishment of a thiol-independent response to oxidative stress under glutathione shortage conditions. PMID:8597660

  16. A maize gene encoding an NADPH binding enzyme highly homologous to isoflavone reductases is activated in response to sulfur starvation.

    PubMed

    Petrucco, S; Bolchi, A; Foroni, C; Percudani, R; Rossi, G L; Ottonello, S

    1996-01-01

    we isolated a novel gene that is selectively induced both in roots and shoots in response to sulfur starvation. This gene encodes a cytosolic, monomeric protein of 33 kD that selectively binds NADPH. The predicted polypeptide is highly homologous ( > 70%) to leguminous isoflavone reductases (IFRs), but the maize protein (IRL for isoflavone reductase-like) belongs to a novel family of proteins present in a variety of plants. Anti-IRL antibodies specifically recognize IFR polypeptides, yet the maize protein is unable to use various isoflavonoids as substrates. IRL expression is correlated closely to glutathione availability: it is persistently induced in seedlings whose glutathione content is about fourfold lower than controls, and it is down-regulated rapidly when control levels of glutathione are restored. This glutathione-dependent regulation indicates that maize IRL may play a crucial role in the establishment of a thiol-independent response to oxidative stress under glutathione shortage conditions.

  17. Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins

    PubMed Central

    Delcourt, Vivian; Lucier, Jean-François; Gagnon, Jules; Beaudoin, Maxime C; Vanderperre, Benoît; Breton, Marc-André; Motard, Julie; Jacques, Jean-François; Brunelle, Mylène; Gagnon-Arsenault, Isabelle; Fournier, Isabelle; Ouangraoua, Aida; Hunting, Darel J; Cohen, Alan A; Landry, Christian R; Scott, Michelle S

    2017-01-01

    Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins. PMID:29083303

  18. Novel 2,4-Dichlorophenoxyacetic Acid Degradation Genes from Oligotrophic Bradyrhizobium sp. Strain HW13 Isolated from a Pristine Environment

    PubMed Central

    Kitagawa, Wataru; Takami, Sachiko; Miyauchi, Keisuke; Masai, Eiji; Kamagata, Yoichi; Tiedje, James M.; Fukuda, Masao

    2002-01-01

    The tfd genes of Ralstonia eutropha JMP134 are the only well-characterized set of genes responsible for 2,4-dichlorophenoxyacetic acid (2,4-D) degradation among 2,4-D-degrading bacteria. A new family of 2,4-D degradation genes, cadRABKC, was cloned and characterized from Bradyrhizobium sp. strain HW13, a strain that was isolated from a buried Hawaiian soil that has never experienced anthropogenic chemicals. The cadR gene was inferred to encode an AraC/XylS type of transcriptional regulator from its deduced amino acid sequence. The cadABC genes were predicted to encode 2,4-D oxygenase subunits from their deduced amino acid sequences that showed 46, 44, and 37% identities with the TftA and TftB subunits of 2,4,5-trichlorophenoxyacetic acid (2,4,5-T) oxygenase of Burkholderia cepacia AC1100 and with a putative ferredoxin, ThcC, of Rhodococcus erythropolis NI86/21, respectively. They are thoroughly different from the 2,4-D dioxygenase gene, tfdA, of R. eutropha JMP134. The cadK gene was presumed to encode a 2,4-D transport protein from its deduced amino acid sequence that showed 60% identity with the 2,4-D transporter, TfdK, of strain JMP134. Sinorhizobium meliloti Rm1021 cells containing cadRABKC transformed several phenoxyacetic acids, including 2,4-D and 2,4,5-T, to corresponding phenol derivatives. Frameshift mutations indicated that each of the cadRABC genes was essential for 2,4-D conversion in strain Rm1021 but that cadK was not. Five 2,4-D degraders, including Bradyrhizobium and Sphingomonas strains, were found to have cadA gene homologs, suggesting that these 2,4-D degraders share 2,4-D degradation genes similar to those of strain HW13 cadABC. PMID:11751829

  19. The organization of the fuc regulon specifying L-fucose dissimilation in Escherichia coli K12 as determined by gene cloning.

    PubMed

    Chen, Y M; Zhu, Y; Lin, E C

    1987-12-01

    In Escherichia coli the six known genes specifying the utilization of L-fucose as carbon and energy source cluster at 60.2 min and constitute a regulon. These genes include fucP (encoding L-fucose permease), fucI (encoding L-fucose isomerase), fucK (encoding L-fuculose kinase), fucA (encoding L-fuculose 1-phosphate aldolase), fucO (encoding L-1,2-propanediol oxidoreductase), and fucR (encoding the regulatory protein). In this study the fuc genes were cloned and their positions on the chromosome were established by restriction endonuclease and complementation analyses. Clockwise, the gene order is: fucO-fucA-fucP-fucI-fucK-fucR. The operons comprising the structural genes and the direction of transcription were determined by complementation analysis and Southern blot hybridization. The fucPIK and fucA operons are transcribed clockwise. The fucO operon is transcribed counterclockwise. The fucR gene product activates the three structural operons in trans.

  20. Functional analysis of the Brassica napus L. phytoene synthase (PSY) gene family.

    PubMed

    López-Emparán, Ada; Quezada-Martinez, Daniela; Zúñiga-Bustos, Matías; Cifuentes, Víctor; Iñiguez-Luy, Federico; Federico, María Laura

    2014-01-01

    Phytoene synthase (PSY) has been shown to catalyze the first committed and rate-limiting step of carotenogenesis in several crop species, including Brassica napus L. Due to its pivotal role, PSY has been a prime target for breeding and metabolic engineering the carotenoid content of seeds, tubers, fruits and flowers. In Arabidopsis thaliana, PSY is encoded by a single copy gene but small PSY gene families have been described in monocot and dicotyledonous species. We have recently shown that PSY genes have been retained in a triplicated state in the A- and C-Brassica genomes, with each paralogue mapping to syntenic locations in each of the three "Arabidopsis-like" subgenomes. Most importantly, we have shown that in B. napus all six members are expressed, exhibiting overlapping redundancy and signs of subfunctionalization among photosynthetic and non photosynthetic tissues. The question of whether this large PSY family actually encodes six functional enzymes remained to be answered. Therefore, the objectives of this study were to: (i) isolate, characterize and compare the complete protein coding sequences (CDS) of the six B. napus PSY genes; (ii) model their predicted tridimensional enzyme structures; (iii) test their phytoene synthase activity in a heterologous complementation system and (iv) evaluate their individual expression patterns during seed development. This study further confirmed that the six B. napus PSY genes encode proteins with high sequence identity, which have evolved under functional constraint. Structural modeling demonstrated that they share similar tridimensional protein structures with a putative PSY active site. Significantly, all six B. napus PSY enzymes were found to be functional. Taking into account the specific patterns of expression exhibited by these PSY genes during seed development and recent knowledge of PSY suborganellar localization, the selection of transgene candidates for metabolic engineering the carotenoid content of oilseeds is discussed.

  1. Identification of a transcriptional activation domain in yeast repressor activator protein 1 (Rap1) using an altered DNA-binding specificity variant

    PubMed Central

    Johnson, Amanda N.; Weil, P. Anthony

    2017-01-01

    Repressor activator protein 1 (Rap1) performs multiple vital cellular functions in the budding yeast Saccharomyces cerevisiae. These include regulation of telomere length, transcriptional repression of both telomere-proximal genes and the silent mating type loci, and transcriptional activation of hundreds of mRNA-encoding genes, including the highly transcribed ribosomal protein- and glycolytic enzyme-encoding genes. Studies of the contributions of Rap1 to telomere length regulation and transcriptional repression have yielded significant mechanistic insights. However, the mechanism of Rap1 transcriptional activation remains poorly understood because Rap1 is encoded by a single copy essential gene and is involved in many disparate and essential cellular functions, preventing easy interpretation of attempts to directly dissect Rap1 structure-function relationships. Moreover, conflicting reports on the ability of Rap1-heterologous DNA-binding domain fusion proteins to serve as chimeric transcriptional activators challenge use of this approach to study Rap1. Described here is the development of an altered DNA-binding specificity variant of Rap1 (Rap1AS). We used Rap1AS to map and characterize a 41-amino acid activation domain (AD) within the Rap1 C terminus. We found that this AD is required for transcription of both chimeric reporter genes and authentic chromosomal Rap1 enhancer-containing target genes. Finally, as predicted for a bona fide AD, mutation of this newly identified AD reduced the efficiency of Rap1 binding to a known transcriptional coactivator TFIID-binding target, Taf5. In summary, we show here that Rap1 contains an AD required for Rap1-dependent gene transcription. The Rap1AS variant will likely also be useful for studies of the functions of Rap1 in other biological pathways. PMID:28196871

  2. Bacterial infection as assessed by in vivo gene expression

    PubMed Central

    Heithoff, Douglas M.; Conner, Christopher P.; Hanna, Philip C.; Julio, Steven M.; Hentschel, Ute; Mahan, Michael J.

    1997-01-01

    In vivo expression technology (IVET) has been used to identify >100 Salmonella typhimurium genes that are specifically expressed during infection of BALB/c mice and/or murine cultured macrophages. Induction of these genes is shown to be required for survival in the animal under conditions of the IVET selection. One class of in vivo induced (ivi) genes, iviVI-A and iviVI-B, constitute an operon that resides in a region of the Salmonella genome with low G+C content and presumably has been acquired by horizontal transfer. These ivi genes encode predicted proteins that are similar to adhesins and invasins from prokaryotic and eukaryotic pathogens (Escherichia coli [tia], Plasmodium falciparum [PfEMP1]) and have coopted the PhoPQ regulatory circuitry of Salmonella virulence genes. Examination of the in vivo induction profile indicates (i) many ivi genes encode regulatory functions (e.g., phoPQ and pmrAB) that serve to enhance the sensitivity and amplitude of virulence gene expression (e.g., spvB); (ii) the biochemical function of many metabolic genes may not represent their sole contribution to virulence; (iii) the host ecology can be inferred from the biochemical functions of ivi genes; and (iv) nutrient limitation plays a dual signaling role in pathogenesis: to induce metabolic functions that complement host nutritional deficiencies and to induce virulence functions required for immediate survival and spread to subsequent host sites. PMID:9023360

  3. Expression of uncharacterized male germ cell-specific genes and discovery of novel sperm-tail proteins in mice.

    PubMed

    Kwon, Jun Tae; Ham, Sera; Jeon, Suyeon; Kim, Youil; Oh, Seungmin; Cho, Chunghee

    2017-01-01

    The identification and characterization of germ cell-specific genes are essential if we hope to comprehensively understand the mechanisms of spermatogenesis and fertilization. Here, we searched the mouse UniGene databases and identified 13 novel genes as being putatively testis-specific or -predominant. Our in silico and in vitro analyses revealed that the expressions of these genes are testis- and germ cell-specific, and that they are regulated in a stage-specific manner during spermatogenesis. We generated antibodies against the proteins encoded by seven of the genes to facilitate their characterization in male germ cells. Immunoblotting and immunofluorescence analyses revealed that one of these proteins was expressed only in testicular germ cells, three were expressed in both testicular germ cells and testicular sperm, and the remaining three were expressed in sperm of the testicular stages and in mature sperm from the epididymis. Further analysis of the latter three proteins showed that they were all associated with cytoskeletal structures in the sperm flagellum. Among them, MORN5, which is predicted to contain three MORN motifs, is conserved between mouse and human sperm. In conclusion, we herein identify 13 authentic genes with male germ cell-specific expression, and provide comprehensive information about these genes and their encoded products. Our finding will facilitate future investigations into the functional roles of these novel genes in spermatogenesis and sperm functions.

  4. A novel immediate-early response gene of endothelium is induced by cytokines and encodes a secreted protein.

    PubMed

    Holzman, L B; Marks, R M; Dixit, V M

    1990-11-01

    We have previously described the cloning of a group of novel cellular immediate-early response genes whose expression in human umbilical vein endothelial cells is induced by tumor necrosis factor alpha in the presence of cycloheximide. These genes are likely to participate in mediating the response of the vascular endothelium to proinflammatory cytokines. In this study, we further characterized one of these novel gene products named B61. Sequence analysis of cDNA clones encoding B61 revealed that its protein product has no significant homology to previously described proteins. Southern analysis suggested that B61 is an evolutionarily conserved single-copy gene. B61 is primarily a hydrophilic molecule but contains both a hydrophobic N-terminal and a hydrophobic C-terminal region. The N-terminal region is typical of a signal peptide, which is consistent with the secreted nature of the protein. The mature form of the predicted protein consists of 187 amino acid residues and has a molecular weight of 22,000. Immunoprecipitation of metabolically labeled human umbilical vein endothelial cell preparations revealed that B61 is a 25-kilodalton secreted protein which is markedly induced by tumor necrosis factor.

  5. A novel immediate-early response gene of endothelium is induced by cytokines and encodes a secreted protein.

    PubMed Central

    Holzman, L B; Marks, R M; Dixit, V M

    1990-01-01

    We have previously described the cloning of a group of novel cellular immediate-early response genes whose expression in human umbilical vein endothelial cells is induced by tumor necrosis factor alpha in the presence of cycloheximide. These genes are likely to participate in mediating the response of the vascular endothelium to proinflammatory cytokines. In this study, we further characterized one of these novel gene products named B61. Sequence analysis of cDNA clones encoding B61 revealed that its protein product has no significant homology to previously described proteins. Southern analysis suggested that B61 is an evolutionarily conserved single-copy gene. B61 is primarily a hydrophilic molecule but contains both a hydrophobic N-terminal and a hydrophobic C-terminal region. The N-terminal region is typical of a signal peptide, which is consistent with the secreted nature of the protein. The mature form of the predicted protein consists of 187 amino acid residues and has a molecular weight of 22,000. Immunoprecipitation of metabolically labeled human umbilical vein endothelial cell preparations revealed that B61 is a 25-kilodalton secreted protein which is markedly induced by tumor necrosis factor. Images PMID:2233719

  6. Fission yeast cdc24(+) encodes a novel replication factor required for chromosome integrity.

    PubMed

    Gould, K L; Burns, C G; Feoktistova, A; Hu, C P; Pasion, S G; Forsburg, S L

    1998-07-01

    A mutation within the Schizosaccharomyces pombe cdc24(+) gene was identified previously in a screen for cell division cycle mutants and the cdc24(+) gene was determined to be essential for S phase in this yeast. We have isolated the cdc24(+) gene by complementation of a new temperature-sensitive allele of the gene, cdc24-G1. The DNA sequence predicts the presence of an open reading frame punctuated by six introns which encodes a pioneer protein of 58 kD. A cdc24 null mutant was generated by homologous recombination. Haploid cells lacking cdc24(+) are inviable, indicating that cdc24(+) is an essential gene. The transcript of cdc24(+) is present at constant levels throughout the cell cycle. Cells lacking cdc24(+) function show a checkpoint-dependent arrest with a 2N DNA content, indicating a block late in S phase. Arrest is accompanied by a rapid loss of viability and chromosome breakage. An S. pombe homolog of the replicative DNA helicase DNA2 of S. cerevisiae suppresses cdc24. These results suggest that Cdc24p plays a role in the progression of normal DNA replication and is required to maintain genomic integrity.

  7. Fission yeast cdc24(+) encodes a novel replication factor required for chromosome integrity.

    PubMed Central

    Gould, K L; Burns, C G; Feoktistova, A; Hu, C P; Pasion, S G; Forsburg, S L

    1998-01-01

    A mutation within the Schizosaccharomyces pombe cdc24(+) gene was identified previously in a screen for cell division cycle mutants and the cdc24(+) gene was determined to be essential for S phase in this yeast. We have isolated the cdc24(+) gene by complementation of a new temperature-sensitive allele of the gene, cdc24-G1. The DNA sequence predicts the presence of an open reading frame punctuated by six introns which encodes a pioneer protein of 58 kD. A cdc24 null mutant was generated by homologous recombination. Haploid cells lacking cdc24(+) are inviable, indicating that cdc24(+) is an essential gene. The transcript of cdc24(+) is present at constant levels throughout the cell cycle. Cells lacking cdc24(+) function show a checkpoint-dependent arrest with a 2N DNA content, indicating a block late in S phase. Arrest is accompanied by a rapid loss of viability and chromosome breakage. An S. pombe homolog of the replicative DNA helicase DNA2 of S. cerevisiae suppresses cdc24. These results suggest that Cdc24p plays a role in the progression of normal DNA replication and is required to maintain genomic integrity. PMID:9649516

  8. The non-essential UL50 gene of avian infectious laryngotracheitis virus encodes a functional dUTPase which is not a virulence factor.

    PubMed

    Fuchs, W; Ziemann, K; Teifke, J P; Werner, O; Mettenleiter, T C

    2000-03-01

    The DNA sequence of the infectious laryngotracheitis virus (ILTV) UL50, UL51 and UL52 gene homologues was determined. Although the deduced UL50 protein lacks the first of five conserved domains of the corresponding proteins of mammalian alphaherpesviruses, the ILTV gene product was also shown to possess dUTPase activity. The generation of UL50-negative ILTV mutants was facilitated by recombination plasmids encoding green fluorescent protein (GFP), and expression constructs of predicted transactivator proteins of ILTV (alphaTIF, ICP4) were successfully used to increase the infectivity of viral genomic DNA. A GFP-expressing UL50-deletion mutant of ILTV showed reduced cell-to-cell spread in vitro, and was attenuated in vivo. A similar deletion mutant without the foreign gene, however, propagated like wild-type ILTV in cell culture and was pathogenic in chickens. We conclude that the viral dUTPase is not required for efficient replication of ILTV in the respiratory tract of infected animals. The replication defect of the GFP-expressing ILTV recombinant is most likely caused by toxic effects of the reporter gene product, since spontaneously occurring inactivation mutants exhibited wild-type-like growth.

  9. Structure and expression of genes for a class of cysteine-rich proteins of the cuticle layers of differentiating wool and hair follicles

    PubMed Central

    1990-01-01

    The major histological components of the hair follicle are the hair cortex and cuticle. The hair cuticle cells encase and protect the cortex and undergo a different developmental program to that of the cortex. We report the molecular characterization of a set of evolutionarily conserved hair genes which are transcribed in the hair cuticle late in follicle development. Two genes were isolated and characterized, one expressed in the human follicle and one in the sheep follicle. Each gene encodes a small protein of 16 kD, containing greater than 50 cysteine residues, ranging from 31 to 36 mol% cysteine. Their high cysteine content and in vitro expression data identify them as ultra-high-sulfur (UHS) keratin proteins. The predicted proteins are composed almost entirely of cysteine-rich and glycine-rich repeats. Genomic blots reveal that the UHS keratin proteins are encoded by related multigene families in both the human and sheep genomes. Tissue in situ hybridization demonstrates that the expression of both genes is localized to the hair fiber cuticle and occurs at a late stage in fiber morphogenesis. PMID:1703541

  10. Evolutionary Characteristics of Missing Proteins: Insights into the Evolution of Human Chromosomes Related to Missing-Protein-Encoding Genes.

    PubMed

    Xu, Aishi; Li, Guang; Yang, Dong; Wu, Songfeng; Ouyang, Hongsheng; Xu, Ping; He, Fuchu

    2015-12-04

    Although the "missing protein" is a temporary concept in C-HPP, the biological information for their "missing" could be an important clue in evolutionary studies. Here we classified missing-protein-encoding genes into two groups, the genes encoding PE2 proteins (with transcript evidence) and the genes encoding PE3/4 proteins (with no transcript evidence). These missing-protein-encoding genes distribute unevenly among different chromosomes, chromosomal regions, or gene clusters. In the view of evolutionary features, PE3/4 genes tend to be young, spreading at the nonhomology chromosomal regions and evolving at higher rates. Interestingly, there is a higher proportion of singletons in PE3/4 genes than the proportion of singletons in all genes (background) and OTCSGs (organ, tissue, cell type-specific genes). More importantly, most of the paralogous PE3/4 genes belong to the newly duplicated members of the paralogous gene groups, which mainly contribute to special biological functions, such as "smell perception". These functions are heavily restricted into specific type of cells, tissues, or specific developmental stages, acting as the new functional requirements that facilitated the emergence of the missing-protein-encoding genes during evolution. In addition, the criteria for the extremely special physical-chemical proteins were first set up based on the properties of PE2 proteins, and the evolutionary characteristics of those proteins were explored. Overall, the evolutionary analyses of missing-protein-encoding genes are expected to be highly instructive for proteomics and functional studies in the future.

  11. [Genetic instability of probiotic characteristics in the Bifidobacterium longum subsp. longum B379M strain during cultivation and maintenance].

    PubMed

    Averina, O V; Nezametdinova, V Z; Alekseeva, M G; Danilenko, V N

    2012-11-01

    The stability of inheriting several genes in the Russian commercial strain Bifidobacterium longum subsp. longum B379M during cultivation and maintenance under laboratory conditions has been studied. The examined genes code for probiotic characteristics, such as utilization of several sugars (lacA2 gene, encoding beta-galactosidase; ara gene, encoding arabinosidase; and galA gene, encoding arabinogalactan endo-beta-galactosidase); synthesis of bacteriocins (lans gene, encoding lanthionine synthetase); and mobile gene tet(W), conferring resistance to the antibiotic tetracycline. The other gene families studied include the genes responsible for signal transduction and adaptation to stress conditions in the majority of bacteria (serine/threonine protein kinases and the toxin-antitoxin systems of MazEF and RelBE types) and transcription regulators (genes encoding WhiB family proteins). Genomic DNA was analyzed by PCR using specially selected primers. A loss of the genes galA and tet(W) has been shown. It is proposed to expand the requirements on probiotic strains, namely, to control retention of the key probiotic genes using molecular biological methods.

  12. De novo Transcriptome Analysis of Rhizoctonia solani AG1 IA Strain Early Invasion in Zoysia japonica Root.

    PubMed

    Zhu, Chen; Ai, Lin; Wang, Li; Yin, Pingping; Liu, Chenglan; Li, Shanshan; Zeng, Huiming

    2016-01-01

    Zoysia japonica brown spot was caused by necrotrophic fungus Rhizoctonia solani invasion, which led to severe financial loss in city lawn and golf ground maintenance. However, little was known about the molecular mechanism of R. solani pathogenicity in Z. japonica. In this study we examined early stage interaction between R. solani AG1 IA strain and Z. japonica cultivar "Zenith" root by cell ultra-structure analysis, pathogenesis-related proteins assay and transcriptome analysis to explore molecular clues for AG1 IA strain pathogenicity in Z. japonica. No obvious cell structure damage was found in infected roots and most pathogenesis-related protein activities showedg a downward trend especially in 36 h post inoculation, which exhibits AG1 IA strain stealthy invasion characteristic. According to Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database classification, most DEGs in infected "Zenith" roots dynamically changed especially in three aspects, signal transduction, gene translation, and protein synthesis. Total 3422 unigenes of "Zenith" root were predicted into 14 kinds of resistance (R) gene class. Potential fungal resistance related unigenes of "Zenith" root were involved in ligin biosynthesis, phytoalexin synthesis, oxidative burst, wax biosynthesis, while two down-regulated unigenes encoding leucine-rich repeat receptor protein kinase and subtilisin-like protease might be important for host-derived signal perception to AG1 IA strain invasion. According to Pathogen Host Interaction (PHI) database annotation, 1508 unigenes of AG1 IA strain were predicted and classified into 37 known pathogen species, in addition, unigenes encoding virulence, signaling, host stress tolerance, and potential effector were also predicted. This research uncovered transcriptional profiling during the early phase interaction between R. solani AG1 IA strain and Z. japonica, and will greatly help identify key pathogenicity of AG1 IA strain.

  13. Identification of a Novel Mucin Gene HCG22 Associated With Steroid-Induced Ocular Hypertension

    PubMed Central

    Jeong, Shinwu; Patel, Nitin; Edlund, Christopher K.; Hartiala, Jaana; Hazelett, Dennis J.; Itakura, Tatsuo; Wu, Pei-Chang; Avery, Robert L.; Davis, Janet L.; Flynn, Harry W.; Lalwani, Geeta; Puliafito, Carmen A.; Wafapoor, Hussein; Hijikata, Minako; Keicho, Naoto; Gao, Xiaoyi; Argüeso, Pablo; Allayee, Hooman; Coetzee, Gerhard A.; Pletcher, Mathew T.; Conti, David V.; Schwartz, Stephen G.; Eaton, Alexander M.; Fini, M. Elizabeth

    2015-01-01

    Purpose. The pathophysiology of ocular hypertension (OH) leading to primary open-angle glaucoma shares many features with a secondary form of OH caused by treatment with glucocorticoids, but also exhibits distinct differences. In this study, a pharmacogenomics approach was taken to discover candidate genes for this disorder. Methods. A genome-wide association study was performed, followed by an independent candidate gene study, using a cohort enrolled from patients treated with off-label intravitreal triamcinolone, and handling change in IOP as a quantitative trait. Results. An intergenic quantitative trait locus (QTL) was identified at chromosome 6p21.33 near the 5′ end of HCG22 that attained the accepted statistical threshold for genome-level significance. The HCG22 transcript, encoding a novel mucin protein, was expressed in trabecular meshwork cells, and expression was stimulated by IL-1, and inhibited by triamcinolone acetate and TGF-β. Bioinformatic analysis defined the QTL as an approximately 4 kilobase (kb) linkage disequilibrium block containing 10 common single nucleotide polymorphisms (SNPs). Four of these SNPs were identified in the National Center for Biotechnology Information (NCBI) GTEx eQTL browser as modifiers of HCG22 expression. Most are predicted to disrupt or improve motifs for transcription factor binding, the most relevant being disruption of the glucocorticoid receptor binding motif. A second QTL was identified within the predicted signal peptide of the HCG22 encoded protein that could affect its secretion. Translation, O-glycosylation, and secretion of the predicted HCG22 protein was verified in cultured trabecular meshwork cells. Conclusions. Identification of two independent QTLs that could affect expression of the HCG22 mucin gene product via two different mechanisms (transcription or secretion) is highly suggestive of a role in steroid-induced OH. PMID:25813999

  14. Isolated gene encoding an enzyme with UDP-glucose pyrophosphorylase and phosphoglucomutase activities from Cyclotella cryptica

    DOEpatents

    Jarvis, Eric E.; Roessler, Paul G.

    1999-01-01

    The present invention relates to a cloned gene which encodes an enzyme, the purified enzyme, and the applications and products resulting from the use of the gene and enzyme. The gene, isolated from Cyclotella cryptica, encodes a multifunctional enzyme that has both UDP-glucose pyrophosphorylase and phosphoglucomutase activities.

  15. Human Genomic Signatures of Brain Oscillations During Memory Encoding.

    PubMed

    Berto, Stefano; Wang, Guang-Zhong; Germi, James; Lega, Bradley C; Konopka, Genevieve

    2018-05-01

    Memory encoding is an essential step for all learning. However, the genetic and molecular mechanisms underlying human memory encoding remain poorly understood, and how this molecular framework permits the emergence of specific patterns of brain oscillations observed during mnemonic processing is unknown. Here, we directly compare intracranial electroencephalography recordings from the neocortex in individuals performing an episodic memory task with human gene expression from the same areas. We identify genes correlated with oscillatory memory effects across 6 frequency bands. These genes are enriched for autism-related genes and have preferential expression in neurons, in particular genes encoding synaptic proteins and ion channels, supporting the idea that the genes regulating voltage gradients are involved in the modulation of oscillatory patterns during successful memory encoding across brain areas. Memory-related genes are distinct from those correlated with other forms of cognitive processing and resting state fMRI. These data are the first to identify correlations between gene expression and active human brain states as well as provide a molecular window into memory encoding oscillations in the human brain.

  16. Mapping Antigenic Sites of an Immunodominant Surface Lipoprotein of Mycoplasma agalactiae, AvgC, with the Use of Synthetic Peptides

    PubMed Central

    Santona, Antonella; Carta, Franco; Fraghí, Peppinetta; Turrini, Franco

    2002-01-01

    As a first step toward the design of an epitope vaccine to prevent contagious agalactia, the strongly immunogenic 55-kDa protein of Mycoplasma agalactiae was studied and found to correspond to the AvgC protein encoded by the avgC gene. The avg genes of M. agalactiae, which encode four variable surface lipoproteins, display a significant homology to the vsp (variable membrane surface lipoproteins) genes of the bovine pathogen Mycoplasma bovis at their promoter region as well as their N-terminus-encoding regions. Some members of the Vsp family are known to be involved in cytoadhesion to host cells. In order to localize immunogenic peptides in the AvgC antigen, the protein sequence was submitted to epitope prediction analysis, and five sets of overlapping peptides, corresponding to five selected regions, were synthesized by Spot synthesis. Reactive peptides were selected by immunobinding assay with sera from infected sheep. The three most immunogenic epitopes were shown to be surface exposed by immunoprecipitation assays, and one of these was specifically recognized by all tested sera. Our study indicates that selected epitopes of the AvgC lipoprotein may be used to develop a peptide-based vaccine which is effective against M. agalactiae infection. PMID:11748179

  17. Regression Analysis of Combined Gene Expression Regulation in Acute Myeloid Leukemia

    PubMed Central

    Li, Yue; Liang, Minggao; Zhang, Zhaolei

    2014-01-01

    Gene expression is a combinatorial function of genetic/epigenetic factors such as copy number variation (CNV), DNA methylation (DM), transcription factors (TF) occupancy, and microRNA (miRNA) post-transcriptional regulation. At the maturity of microarray/sequencing technologies, large amounts of data measuring the genome-wide signals of those factors became available from Encyclopedia of DNA Elements (ENCODE) and The Cancer Genome Atlas (TCGA). However, there is a lack of an integrative model to take full advantage of these rich yet heterogeneous data. To this end, we developed RACER (Regression Analysis of Combined Expression Regulation), which fits the mRNA expression as response using as explanatory variables, the TF data from ENCODE, and CNV, DM, miRNA expression signals from TCGA. Briefly, RACER first infers the sample-specific regulatory activities by TFs and miRNAs, which are then used as inputs to infer specific TF/miRNA-gene interactions. Such a two-stage regression framework circumvents a common difficulty in integrating ENCODE data measured in generic cell-line with the sample-specific TCGA measurements. As a case study, we integrated Acute Myeloid Leukemia (AML) data from TCGA and the related TF binding data measured in K562 from ENCODE. As a proof-of-concept, we first verified our model formalism by 10-fold cross-validation on predicting gene expression. We next evaluated RACER on recovering known regulatory interactions, and demonstrated its superior statistical power over existing methods in detecting known miRNA/TF targets. Additionally, we developed a feature selection procedure, which identified 18 regulators, whose activities clustered consistently with cytogenetic risk groups. One of the selected regulators is miR-548p, whose inferred targets were significantly enriched for leukemia-related pathway, implicating its novel role in AML pathogenesis. Moreover, survival analysis using the inferred activities identified C-Fos as a potential AML prognostic marker. Together, we provided a novel framework that successfully integrated the TCGA and ENCODE data in revealing AML-specific regulatory program at global level. PMID:25340776

  18. Virulence factors encoded by Legionella longbeachae identified on the basis of the genome sequence analysis of clinical isolate D-4968.

    PubMed

    Kozak, Natalia A; Buss, Meghan; Lucas, Claressa E; Frace, Michael; Govil, Dhwani; Travis, Tatiana; Olsen-Rasmussen, Melissa; Benson, Robert F; Fields, Barry S

    2010-02-01

    Legionella longbeachae causes most cases of legionellosis in Australia and may be underreported worldwide due to the lack of L. longbeachae-specific diagnostic tests. L. longbeachae displays distinctive differences in intracellular trafficking, caspase 1 activation, and infection in mouse models compared to Legionella pneumophila, yet these two species have indistinguishable clinical presentations in humans. Unlike other legionellae, which inhabit freshwater systems, L. longbeachae is found predominantly in moist soil. In this study, we sequenced and annotated the genome of an L. longbeachae clinical isolate from Oregon, isolate D-4968, and compared it to the previously published genomes of L. pneumophila. The results revealed that the D-4968 genome is larger than the L. pneumophila genome and has a gene order that is different from that of the L. pneumophila genome. Genes encoding structural components of type II, type IV Lvh, and type IV Icm/Dot secretion systems are conserved. In contrast, only 42/140 homologs of genes encoding L. pneumophila Icm/Dot substrates have been found in the D-4968 genome. L. longbeachae encodes numerous proteins with eukaryotic motifs and eukaryote-like proteins unique to this species, including 16 ankyrin repeat-containing proteins and a novel U-box protein. We predict that these proteins are secreted by the L. longbeachae Icm/Dot secretion system. In contrast to the L. pneumophila genome, the L. longbeachae D-4968 genome does not contain flagellar biosynthesis genes, yet it contains a chemotaxis operon. The lack of a flagellum explains the failure of L. longbeachae to activate caspase 1 and trigger pyroptosis in murine macrophages. These unique features of L. longbeachae may reflect adaptation of this species to life in soil.

  19. Identification of a hybrid PKS-NRPS required for the biosynthesis of NG-391 and NG-393 metabolites in Metarhizium anisopliae

    USDA-ARS?s Scientific Manuscript database

    A 19,818 kb genomic region harboring six predicted ORFs was identified in M. anisopliae ARSEF 2575. The ORF4 CDS, putatively encoding a hybrid polyketide synthase/nonribosomal peptide synthetase (PKS-NRPS) was targeted using Agrobacterium-mediated gene knockout. Homologous, but not heterolog...

  20. The nop gene from Phanerochaete chrysosporium encodes a peroxidase with novel structural features

    Treesearch

    Luis F. Larrondo; Angel Gonzalez; Tomas Perez-Acle; Dan Cullen; Rafael Vicuna

    2005-01-01

    Inspection of the genome of the ligninolytic basidiomycete Phanerochaete chrysosporium revealed an unusual peroxidase-like sequence. The corresponding full length cDNA was sequenced and an archetypal secretion signal predicted. The deduced mature protein (NoP, novel peroxidase) contains 295 aa residues and is therefore considerably shorter than other Class II (fungal)...

  1. A draft genome sequence of “Candidatus Liberibacter asiaticus” from California, USA

    USDA-ARS?s Scientific Manuscript database

    The draft genome sequence of “Candidatus Liberibacter asiaticus” strain HHCA, collected from a lemon tree in California, USA, is reported. The HHCA strain has a genome size of 1,118,244 bp, with G+C content of 36.6%. The HHCA genome encodes 1,191 predicted open reading frames and 51 RNA genes....

  2. Genomic approach to therapeutic target validation identifies a glucose-lowering GLP1R variant protective for coronary heart disease

    PubMed Central

    Scott, Robert A.; Freitag, Daniel F.; Li, Li; Chu, Audrey Y.; Surendran, Praveen; Young, Robin; Grarup, Niels; Stancáková, Alena; Chen, Yuning; V.Varga, Tibor; Yaghootkar, Hanieh; Luan, Jian'an; Zhao, Jing Hua; Willems, Sara M.; Wessel, Jennifer; Wang, Shuai; Maruthur, Nisa; Michailidou, Kyriaki; Pirie, Ailith; van der Lee, Sven J.; Gillson, Christopher; Olama, Ali Amin Al; Amouyel, Philippe; Arriola, Larraitz; Arveiler, Dominique; Aviles-Olmos, Iciar; Balkau, Beverley; Barricarte, Aurelio; Barroso, Inês; Garcia, Sara Benlloch; Bis, Joshua C.; Blankenberg, Stefan; Boehnke, Michael; Boeing, Heiner; Boerwinkle, Eric; Borecki, Ingrid B.; Bork-Jensen, Jette; Bowden, Sarah; Caldas, Carlos; Caslake, Muriel; Cupples, L. Adrienne; Cruchaga, Carlos; Czajkowski, Jacek; den Hoed, Marcel; Dunn, Janet A.; Earl, Helena M.; Ehret, Georg B.; Ferrannini, Ele; Ferrieres, Jean; Foltynie, Thomas; Ford, Ian; Forouhi, Nita G.; Gianfagna, Francesco; Gonzalez, Carlos; Grioni, Sara; Hiller, Louise; Jansson, Jan-Håkan; Jørgensen, Marit E.; Jukema, J. Wouter; Kaaks, Rudolf; Kee, Frank; Kerrison, Nicola D.; Key, Timothy J.; Kontto, Jukka; Kote-Jarai, Zsofia; Kraja, Aldi T.; Kuulasmaa, Kari; Kuusisto, Johanna; Linneberg, Allan; Liu, Chunyu; Marenne, Gaëlle; Mohlke, Karen L.; Morris, Andrew P.; Muir, Kenneth; Müller-Nurasyid, Martina; Munroe, Patricia B.; Navarro, Carmen; Nielsen, Sune F.; Nilsson, Peter M.; Nordestgaard, Børge G.; Packard, Chris J.; Palli, Domenico; Panico, Salvatore; Peloso, Gina M.; Perola, Markus; Peters, Annette; Poole, Christopher J.; Quirós, J. Ramón; Rolandsson, Olov; Sacerdote, Carlotta; Salomaa, Veikko; Sánchez, María-José; Sattar, Naveed; Sharp, Stephen J.; Sims, Rebecca; Slimani, Nadia; Smith, Jennifer A.; Thompson, Deborah J.; Trompet, Stella; Tumino, Rosario; van der A, Daphne L.; van der Schouw, Yvonne T.; Virtamo, Jarmo; Walker, Mark; Walter, Klaudia; Abraham, Jean E.; Amundadottir, Laufey T.; Aponte, Jennifer L.; Butterworth, Adam S.; Dupuis, Josée; Easton, Douglas F.; Eeles, Rosalind A.; Erdmann, Jeanette; Franks, Paul W.; Frayling, Timothy M.; Hansen, Torben; Howson, Joanna M. M.; Jørgensen, Torben; Kooner, Jaspal; Laakso, Markku; Langenberg, Claudia; McCarthy, Mark I.; Pankow, James S.; Pedersen, Oluf; Riboli, Elio; Rotter, Jerome I.; Saleheen, Danish; Samani, Nilesh J.; Schunkert, Heribert; Vollenweider, Peter; O'Rahilly, Stephen; Deloukas, Panos; Danesh, John; Goodarzi, Mark O.; Kathiresan, Sekar; Meigs, James B.; Ehm, Margaret G.; Wareham, Nicholas J.; Waterworth, Dawn M.

    2016-01-01

    Regulatory authorities have indicated that new drugs to treat type 2 diabetes (T2D) should not be associated with an unacceptable increase in cardiovascular risk. Human genetics may be able to inform development of antidiabetic therapies by predicting cardiovascular and other health endpoints. We therefore investigated the association of variants in 6 genes that encode drug targets for obesity or T2D with a range of metabolic traits in up to 11,806 individuals by targeted exome sequencing, and follow-up in 39,979 individuals by targeted genotyping, with additional in silico follow up in consortia. We used these data to first compare associations of variants in genes encoding drug targets with the effects of pharmacological manipulation of those targets in clinical trials. We then tested the association those variants with disease outcomes, including coronary heart disease, to predict cardiovascular safety of these agents. A low-frequency missense variant (Ala316Thr;rs10305492) in the gene encoding glucagon-like peptide-1 receptor (GLP1R), the target of GLP1R agonists, was associated with lower fasting glucose and lower T2D risk, consistent with GLP1R agonist therapies. The minor allele was also associated with protection against heart disease, thus providing evidence that GLP1R agonists are not likely to be associated with an unacceptable increase in cardiovascular risk. Our results provide an encouraging signal that these agents may be associated with benefit, a question currently being addressed in randomised controlled trials. Genetic variants associated with metabolic traits and multiple disease outcomes can be used to validate therapeutic targets at an early stage in the drug development process. PMID:27252175

  3. Genomic sequence of mandarin fish rhabdovirus with an unusual small non-transcriptional ORF.

    PubMed

    Tao, Jian-Jun; Zhou, Guang-Zhou; Gui, Jian-Fang; Zhang, Qi-Ya

    2008-03-01

    The complete genome of mandarin fish Siniperca chuatsi rhabdovirus (SCRV) was cloned and sequenced. It comprises 11,545 nucleotides and contains five genes encoding the nucleoprotein N, the phosphoprotein P, the matrix protein M, the glycoprotein G, and the RNA-dependent RNA polymerase protein L. At the 3' and 5' termini of SCRV genome, leader and trailer sequences show inverse complementarity. The N, P, M and G proteins share the highest sequence identities (ranging from 14.8 to 41.5%) with the respective proteins of rhabdovirus 903/87, the L protein has the highest identity with those of vesiculoviruses, especially with Chandipura virus (44.7%). Phylogenetic analysis of L proteins showed that SCRV clustered with spring vireamia of carp virus (SVCV) and was most closely related to viruses in the genus Vesiculovirus. In addition, an overlapping open reading frame (ORF) predicted to encode a protein similar to vesicular stomatitis virus C protein is present within the P gene of SCRV. Furthermore, an unoverlapping small ORF downstream of M ORF within M gene is predicted (tentatively called orf4). Therefore, the genomic organization of SCRV can be proposed as 3' leader-N-P/C-M-(orf4)-G-L-trailer 5'. Orf4 transcription or translation products could not be detected by northern or Western blot, respectively, though one similar mRNA band to M mRNA was found. This is the first report on one small unoverlapping ORF in M gene of a fish rhabdovirus.

  4. Isolated gene encoding an enzyme with UDP-glucose pyrophosphorylase and phosphoglucomutase activities from Cyclotella cryptica

    DOEpatents

    Jarvis, E.E.; Roessler, P.G.

    1999-07-27

    The present invention relates to a cloned gene which encodes an enzyme, the purified enzyme, and the applications and products resulting from the use of the gene and enzyme. The gene, isolated from Cyclotella cryptica, encodes a multifunctional enzyme that has both UDP-glucose pyrophosphorylase and phosphoglucomutase activities. 8 figs.

  5. Genetic Determinants Influencing Human Serum Metabolome among African Americans

    PubMed Central

    Yu, Bing; Zheng, Yan; Alexander, Danny; Morrison, Alanna C.; Coresh, Josef; Boerwinkle, Eric

    2014-01-01

    Phenotypes proximal to gene action generally reflect larger genetic effect sizes than those that are distant. The human metabolome, a result of multiple cellular and biological processes, are functional intermediate phenotypes proximal to gene action. Here, we present a genome-wide association study of 308 untargeted metabolite levels among African Americans from the Atherosclerosis Risk in Communities (ARIC) Study. Nineteen significant common variant-metabolite associations were identified, including 13 novel loci (p<1.6×10−10). These loci were associated with 7–50% of the difference in metabolite levels per allele, and the variance explained ranged from 4% to 20%. Fourteen genes were identified within the nineteen loci, and four of them contained non-synonymous substitutions in four enzyme-encoding genes (KLKB1, SIAE, CPS1, and NAT8); the other significant loci consist of eight other enzyme-encoding genes (ACE, GATM, ACY3, ACSM2B, THEM4, ADH4, UGT1A, TREH), a transporter gene (SLC6A13) and a polycystin protein gene (PKD2L1). In addition, four potential disease-associated paths were identified, including two direct longitudinal predictive relationships: NAT8 with N-acetylornithine, N-acetyl-1-methylhistidine and incident chronic kidney disease, and TREH with trehalose and incident diabetes. These results highlight the value of using endophenotypes proximal to gene function to discover new insights into biology and disease pathology. PMID:24625756

  6. Identification and characterization of the steroid 15α-hydroxylase gene from Penicillium raistrickii.

    PubMed

    Jia, Longgang; Dong, Jianzhang; Wang, Ruijie; Mao, Shuhong; Lu, Fuping; Singh, Suren; Wang, Zhengxiang; Liu, Xiaoguang

    2017-08-01

    Penicillium raistrickii ATCC 10490 is used for the commercial preparation of 15α-13-methy-estr-4-ene-3,17-dione, a key intermediate in the synthesis of gestodene, which is a major component of third-generation contraceptive pills. Although it was previously shown that a cytochrome P450 enzyme in P. raistrickii is involved in steroid 15α-hydroxylation, the gene encoding the steroid 15α-hydroxylase remained unknown. In this study, we report the cloning and characterization of the 15α-hydroxylase gene from P. raistrickii ATCC 10490 by combining transcriptomic profiling with functional heterologous expression in Saccharomyces cerevisiae. The full-length open reading frame (ORF) of the 15α-hydroxylase gene P450pra is 1563 bp and predicted to encode a cytochrome P450 protein of 520 amino acids. Targeted gene deletion revealed that P450pra is solely responsible for 15α-hydroxylation activity on 13-methy-estr-4-ene-3,17-dione in P. raistrickii ATCC 10490. The identification of the 15α-hydroxylase gene from P. raistrickii should help elucidate the molecular basis of regio- and stereo-specificity of steroid 15α-hydroxylation and aid in the engineering of more efficient industrial strains for useful steroid 15α-hydroxylation reactions.

  7. Evolutionary analysis of hydrophobin gene family in two wood-degrading basidiomycetes, Phlebia brevispora and Heterobasidion annosum s.l.

    PubMed Central

    2013-01-01

    Background Hydrophobins are small secreted cysteine-rich proteins that play diverse roles during different phases of fungal life cycle. In basidiomycetes, hydrophobin-encoding genes often form large multigene families with up to 40 members. The evolutionary forces driving hydrophobin gene expansion and diversification in basidiomycetes are poorly understood. The functional roles of individual genes within such gene families also remain unclear. The relationship between the hydrophobin gene number, the genome size and the lifestyle of respective fungal species has not yet been thoroughly investigated. Here, we present results of our survey of hydrophobin gene families in two species of wood-degrading basidiomycetes, Phlebia brevispora and Heterobasidion annosum s.l. We have also investigated the regulatory pattern of hydrophobin-encoding genes from H. annosum s.s. during saprotrophic growth on pine wood as well as on culture filtrate from Phlebiopsis gigantea using micro-arrays. These data are supplemented by results of the protein structure modeling for a representative set of hydrophobins. Results We have identified hydrophobin genes from the genomes of two wood-degrading species of basidiomycetes, Heterobasidion irregulare, representing one of the microspecies within the aggregate H. annosum s.l., and Phlebia brevispora. Although a high number of hydrophobin-encoding genes were observed in H. irregulare (16 copies), a remarkable expansion of these genes was recorded in P. brevispora (26 copies). A significant expansion of hydrophobin-encoding genes in other analyzed basidiomycetes was also documented (1–40 copies), whereas contraction through gene loss was observed among the analyzed ascomycetes (1–11 copies). Our phylogenetic analysis confirmed the important role of gene duplication events in the evolution of hydrophobins in basidiomycetes. Increased number of hydrophobin-encoding genes appears to have been linked to the species’ ecological strategy, with the non-pathogenic fungi having increased numbers of hydrophobins compared with their pathogenic counterparts. However, there was no significant relationship between the number of hydrophobin-encoding genes and genome size. Furthermore, our results revealed significant differences in the expression levels of the 16 H. annosum s.s. hydrophobin-encoding genes which suggest possible differences in their regulatory patterns. Conclusions A considerable expansion of the hydrophobin-encoding genes in basidiomycetes has been observed. The distribution and number of hydrophobin-encoding genes in the analyzed species may be connected to their ecological preferences. Results of our analysis also have shown that H. annosum s.l. hydrophobin-encoding genes may be under positive selection. Our gene expression analysis revealed differential expression of H. annosum s.s. hydrophobin genes under different growth conditions, indicating their possible functional diversification. PMID:24188142

  8. Glutathione S-transferase-encoding gene as a potential probe for environmental bacterial isolates capable of degrading polycyclic aromatic hydrocarbons.

    PubMed Central

    Lloyd-Jones, G; Lau, P C

    1997-01-01

    Homologs of the glutathione S-transferase (GST)-encoding gene were identified in a collection of aromatic hydrocarbon-degrading Sphingomonas spp. isolated from New Zealand, Antarctica, and the United States by using PCR primers designed from the GST-encoding gene of Sphingomonas paucimobilis EPA505. Sequence analysis of PCR fragments generated from these isolates and of the GST gene amplified from DNA extracted from polycyclic aromatic hydrocarbon (PAH)-contaminated soil revealed a high degree of conservation, which may make the GST-encoding gene a potentially useful marker for PAH-degrading bacteria. PMID:9251217

  9. Enterotoxin-encoding genes in Staphylococcus spp. from bulk goat milk.

    PubMed

    Lyra, Daniele G; Sousa, Francisca G C; Borges, Maria F; Givisiez, Patrícia E N; Queiroga, Rita C R E; Souza, Evandro L; Gebreyes, Wondwossen A; Oliveira, Celso J B

    2013-02-01

    Although Staphylococcus aureus has been implicated as the main Staphylococcus species causing human food poisoning, recent studies have shown that coagulase-negative Staphylococcus could also harbor enterotoxin-encoding genes. Such organisms are often present in goat milk and are the most important mastitis-causing agents. Therefore, this study aimed to investigate the occurrence of enterotoxin-encoding genes among coagulase-positive (CoPS) and coagulase-negative (CoNS) staphylococci isolated from raw goat milk produced in the semi-arid region of Paraiba, the most important region for goat milk production in Brazil. Enterotoxin-encoding genes were screened in 74 staphylococci isolates (30 CoPS and 44 CoNS) by polymerase chain reaction targeting the genes sea, seb, sec, sed, see, seg, seh, and sei. Enterotoxin-encoding genes were found in nine (12.2%) isolates, and four different genes (sea, sec, seg, and sei) were identified amongst the isolates. The most frequent genes were seg and sei, which were often found simultaneously in 44.5% of the isolates. The gene sec was the most frequent among the classical genes, and sea was found only in one isolate. All CoPS isolates (n=7) harboring enterotoxigenic genes were identified as S. aureus. The two coagulase-negative isolates were S. haemolyticus and S. hominis subsp. hominis and they harbored sei and sec genes, respectively. A higher frequency of enterotoxin-encoding genes was observed amongst CoPS (23.3%) than CoNS (4.5%) isolates (p<0.05), reinforcing the importance of S. aureus as a potential foodborne agent. However, the potential risk posed by CoNS in goat milk should not be ignored because it has a higher occurrence in goat milk and enterotoxin-encoding genes were detected in some isolates.

  10. Molecular characterization of two serine proteases expressed in gut tissue of the African trypanosome vector, Glossina morsitans morsitans.

    PubMed

    Yan, J; Cheng, Q; Li, C B; Aksoy, S

    2001-02-01

    Serine proteases are major insect gut enzymes involved in digestion of dietary proteins, and in addition they have been implicated in the process of pathogen establishment in several vector insects. The medically important vector, tsetse fly (Diptera:Glossinidiae), is involved in the transmission of African trypanosomes, which cause devastating diseases in animals and humans. Both the male and female tsetse can transmit trypanosomes and both are strict bloodfeeders throughout all stages of their development. Here, we describe the characterization of two putative serine protease-encoding genes, Glossina serine protease-1 (Gsp1) and Glossina serine protease-2 (Gsp2) from gut tissue. Both putative cDNA products represent prepro peptides with hydrophobic signal peptide sequences associated with their 5'-end terminus. The Gsp1 cDNA encodes a putative mature protein of 245 amino acids with a molecular mass of 26 428 Da, while the predicted size of the 228 amino acid mature peptide encoded by Gsp2 cDNA is 24 573 Da. Both deduced peptides contain the Asp/His/Ser catalytic triad and the conserved residues surrounding it which are characteristic of serine proteases. In addition, both proteins have the six-conserved cysteine residues to form the three-cysteine bonds typically present in invertebrate serine proteases. Based on the presence of substrate specific residues, the Gsp1 gene encodes a chymotrypsin-like protease while Gsp2 gene encodes for a protein with trypsin-like activity. Both proteins are encoded by few loci in tsetse genome, being present in one or two copies only. The mRNA expression levels for the genes do not vary extensively throughout the digestive cycle, and high levels of mRNAs can be readily detected in the gut tissue of newly emerged flies. The levels of trypsin and chymotrypsin activities in the gut lumen increase following blood feeding and change significantly in the gut cells throughout the digestion cycle. Hence, the regulation of expression for trypsin and chymotrypsin occurs at the post-transcriptional level in tsetse. Both the coding sequences and patterns of expression of Gsp1 and Gsp2 genes are similar to the serine proteases that have been reported from the bloodfeeding insect Stomoxys calcitrans.

  11. The SdiA-Regulated Gene srgE Encodes a Type III Secreted Effector

    PubMed Central

    Habyarimana, Fabien; Sabag-Daigle, Anice

    2014-01-01

    Salmonella enterica serovar Typhimurium is a food-borne pathogen that causes severe gastroenteritis. The ability of Salmonella to cause disease depends on two type III secretion systems (T3SSs) encoded in two distinct Salmonella pathogenicity islands, 1 and 2 (SPI1 and SPI2, respectively). S. Typhimurium encodes a solo LuxR homolog, SdiA, which can detect the acyl-homoserine lactones (AHLs) produced by other bacteria and upregulate the rck operon and the srgE gene. SrgE is predicted to encode a protein of 488 residues with a coiled-coil domain between residues 345 and 382. In silico studies have provided conflicting predictions as to whether SrgE is a T3SS substrate. Therefore, in this work, we tested the hypothesis that SrgE is a T3SS effector by two methods, a β-lactamase activity assay and a split green fluorescent protein (GFP) complementation assay. SrgE with β-lactamase fused to residue 40, 100, 150, or 300 was indeed expressed and translocated into host cells, but SrgE with β-lactamase fused to residue 400 or 488 was not expressed, suggesting interference by the coiled-coil domain. Similarly, SrgE with GFP S11 fused to residue 300, but not to residue 488, was expressed and translocated into host cells. With both systems, translocation into host cells was dependent upon SPI2. A phylogenetic analysis indicated that srgE is found only within Salmonella enterica subspecies. It is found sporadically within both typhoidal and nontyphoidal serovars, although the SrgE protein sequences found within typhoidal serovars tend to cluster separately from those found in nontyphoidal serovars, suggesting functional diversification. PMID:24727228

  12. Evolution and Structural Organization of the C Proteins of Paramyxovirinae

    PubMed Central

    Karlin, David G.

    2014-01-01

    The phosphoprotein (P) gene of most Paramyxovirinae encodes several proteins in overlapping frames: P and V, which share a common N-terminus (PNT), and C, which overlaps PNT. Overlapping genes are of particular interest because they encode proteins originated de novo, some of which have unknown structural folds, challenging the notion that nature utilizes only a limited, well-mapped area of fold space. The C proteins cluster in three groups, comprising measles, Nipah, and Sendai virus. We predicted that all C proteins have a similar organization: a variable, disordered N-terminus and a conserved, α-helical C-terminus. We confirmed this predicted organization by biophysically characterizing recombinant C proteins from Tupaia paramyxovirus (measles group) and human parainfluenza virus 1 (Sendai group). We also found that the C of the measles and Nipah groups have statistically significant sequence similarity, indicating a common origin. Although the C of the Sendai group lack sequence similarity with them, we speculate that they also have a common origin, given their similar genomic location and structural organization. Since C is dispensable for viral replication, unlike PNT, we hypothesize that C may have originated de novo by overprinting PNT in the ancestor of Paramyxovirinae. Intriguingly, in measles virus and Nipah virus, PNT encodes STAT1-binding sites that overlap different regions of the C-terminus of C, indicating they have probably originated independently. This arrangement, in which the same genetic region encodes simultaneously a crucial functional motif (a STAT1-binding site) and a highly constrained region (the C-terminus of C), seems paradoxical, since it should severely reduce the ability of the virus to adapt. The fact that it originated twice suggests that it must be balanced by an evolutionary advantage, perhaps from reducing the size of the genetic region vulnerable to mutations. PMID:24587180

  13. Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.

    PubMed Central

    Borodovsky, M; Rudd, K E; Koonin, E V

    1994-01-01

    The unannotated regions of the Escherichia coli genome DNA sequence from the EcoSeq6 database, totaling 1,278 'intergenic' sequences of the combined length of 359,279 basepairs, were analyzed using computer-assisted methods with the aim of identifying putative unknown genes. The proposed strategy for finding new genes includes two key elements: i) prediction of expressed open reading frames (ORFs) using the GeneMark method based on Markov chain models for coding and non-coding regions of Escherichia coli DNA, and ii) search for protein sequence similarities using programs based on the BLAST algorithm and programs for motif identification. A total of 354 putative expressed ORFs were predicted by GeneMark. Using the BLASTX and TBLASTN programs, it was shown that 208 ORFs located in the unannotated regions of the E. coli chromosome are significantly similar to other protein sequences. Identification of 182 ORFs as probable genes was supported by GeneMark and BLAST, comprising 51.4% of the GeneMark 'hits' and 87.5% of the BLAST 'hits'. 73 putative new genes, comprising 20.6% of the GeneMark predictions, belong to ancient conserved protein families that include both eubacterial and eukaryotic members. This value is close to the overall proportion of highly conserved sequences among eubacterial proteins, indicating that the majority of the putative expressed ORFs that are predicted by GeneMark, but have no significant BLAST hits, nevertheless are likely to be real genes. The majority of the putative genes identified by BLAST search have been described since the release of the EcoSeq6 database, but about 70 genes have not been detected so far. Among these new identifications are genes encoding proteins with a variety of predicted functions including dehydrogenases, kinases, several other metabolic enzymes, ATPases, rRNA methyltransferases, membrane proteins, and different types of regulatory proteins. Images PMID:7984428

  14. γ-PGA Hydrolases of Phage Origin in Bacillus subtilis and Other Microbial Genomes.

    PubMed

    Mamberti, Stefania; Prati, Paola; Cremaschi, Paolo; Seppi, Claudio; Morelli, Carlo F; Galizzi, Alessandro; Fabbi, Massimo; Calvio, Cinzia

    2015-01-01

    Poly-γ-glutamate (γ-PGA) is an industrially interesting polymer secreted mainly by members of the class Bacilli which forms a shield able to protect bacteria from phagocytosis and phages. Few enzymes are known to degrade γ-PGA; among them is a phage-encoded γ-PGA hydrolase, PghP. The supposed role of PghP in phages is to ensure access to the surface of bacterial cells by dismantling the γ-PGA barrier. We identified four unannotated B. subtilis genes through similarity of their encoded products to PghP; in fact these genes reside in prophage elements of B. subtilis genome. The recombinant products of two of them demonstrate efficient polymer degradation, confirming that sequence similarity reflects functional homology. Genes encoding similar γ-PGA hydrolases were identified in phages specific for the order Bacillales and in numerous microbial genomes, not only belonging to that order. The distribution of the γ-PGA biosynthesis operon was also investigated with a bioinformatics approach; it was found that the list of organisms endowed with γ-PGA biosynthetic functions is larger than expected and includes several pathogenic species. Moreover in non-Bacillales bacteria the predicted γ-PGA hydrolase genes are preferentially found in species that do not have the genetic asset for polymer production. Our findings suggest that γ-PGA hydrolase genes might have spread across microbial genomes via horizontal exchanges rather than via phage infection. We hypothesize that, in natural habitats rich in γ-PGA supplied by producer organisms, the availability of hydrolases that release glutamate oligomers from γ-PGA might be a beneficial trait under positive selection.

  15. Three copies of a single protein II-encoding sequence in the genome of Neisseria gonorrhoeae JS3: evidence for gene conversion and gene duplication.

    PubMed

    van der Ley, P

    1988-11-01

    Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.

  16. Unbiased View of Synaptic and Neuronal Gene Complement in Ctenophores: Are There Pan-neuronal and Pan-synaptic Genes across Metazoa?

    PubMed

    Moroz, Leonid L; Kohn, Andrea B

    2015-12-01

    Hypotheses of origins and evolution of neurons and synapses are controversial, mostly due to limited comparative data. Here, we investigated the genome-wide distribution of the bilaterian "synaptic" and "neuronal" protein-coding genes in non-bilaterian basal metazoans (Ctenophora, Porifera, Placozoa, and Cnidaria). First, there are no recognized genes uniquely expressed in neurons across all metazoan lineages. None of the so-called pan-neuronal genes such as embryonic lethal abnormal vision (ELAV), Musashi, or Neuroglobin are expressed exclusively in neurons of the ctenophore Pleurobrachia. Second, our comparative analysis of about 200 genes encoding canonical presynaptic and postsynaptic proteins in bilaterians suggests that there are no true "pan-synaptic" genes or genes uniquely and specifically attributed to all classes of synapses. The majority of these genes encode receptive and secretory complexes in a broad spectrum of eukaryotes. Trichoplax (Placozoa) an organism without neurons and synapses has more orthologs of bilaterian synapse-related/neuron-related genes than do ctenophores-the group with well-developed neuronal and synaptic organization. Third, the majority of genes encoding ion channels and ionotropic receptors are broadly expressed in unicellular eukaryotes and non-neuronal tissues in metazoans. Therefore, they cannot be viewed as neuronal markers. Nevertheless, the co-expression of multiple types of ion channels and receptors does correlate with the presence of neural and synaptic organization. As an illustrative example, the ctenophore genomes encode a greater diversity of ion channels and ionotropic receptors compared with the genomes of the placozoan Trichoplax and the demosponge Amphimedon. Surprisingly, both placozoans and sponges have a similar number of orthologs of "synaptic" proteins as we identified in the genomes of two ctenophores. Ctenophores have a distinct synaptic organization compared with other animals. Our analysis of transcriptomes from 10 different ctenophores did not detect recognized orthologs of synthetic enzymes encoding several classical, low-molecular-weight (neuro)transmitters; glutamate signaling machinery is one of the few exceptions. Novel peptidergic signaling molecules were predicted for ctenophores, together with the diversity of putative receptors including SCNN1/amiloride-sensitive sodium channel-like channels, many of which could be examples of a lineage-specific expansion within this group. In summary, our analysis supports the hypothesis of independent evolution of neurons and, as corollary, a parallel evolution of synapses. We suggest that the formation of synaptic machinery might occur more than once over 600 million years of animal evolution. © The Author 2015. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.

  17. A Genomic View of the Sea Urchin Nervous System

    PubMed Central

    Burke, RD; Angerer, LM; Elphick, MR; Humphrey, GW; Yaguchi, S; Kiyama, T; Liang, S; Mu, X; Agca, C; Klein, WH; Brandhorst, BP; Rowe, M; Wilson, K; Churcher, AM; Taylor, JS; Chen, N; Murray, G; Wang, D; Mellott, D; Olinski, R; Hallböök, F; Thorndyke, MC

    2007-01-01

    The sequencing of the Strongylocentrotus purpuratus genome provides a unique opportunity to investigate the function and evolution of neural genes. The neurobiology of sea urchins is of particular interest because they have a close phylogenetic relationship with chordates, yet a distinctive pentaradiate body plan and unusual neural organization. Orthologues of transcription factors that regulate neurogenesis in other animals have been identified and several are expressed in neurogenic domains before gastrulation indicating that they may operate near the top of a conserved neural gene regulatory network. A family of genes encoding voltage-gated ion channels is present but, surprisingly, genes encoding gap junction proteins (connexins and pannexins) appear to be absent. Genes required for synapse formation and function have been identified and genes for synthesis and transport of neurotransmitters are present. There is a large family of G-protein-coupled receptors, including 874 rhodopsin-type receptors, 28 metabotropic glutamate-like receptors and a remarkably expanded group of 161 secretin receptor-like proteins. Absence of cannabinoid, lysophospholipid and melanocortin receptors indicates that this group may be unique to chordates. There are at least 37 putative G-protein coupled peptide receptors and precursors for several neuropeptides and peptide hormones have been identified, including SALMFamides, NGFFFamide, a vasotocin-like peptide, glycoprotein hormones, and insulin/insulin-like growth factors. Identification of a neurotrophin-like gene and Trk receptor in sea urchin indicates that this neural signaling system is not unique to chordates. Several hundred chemoreceptor genes have been predicted using several approaches, a number similar to that for other animals. Intriguingly, genes encoding homologues of rhodopsin, Pax6 and several other key mammalian retinal transcription factors are expressed in tube feet, suggesting tube feet function as photosensory organs. Analysis of the sea urchin genome presents a unique perspective on the evolutionary history of deuterostome nervous systems and reveals new approaches to investigate the development and neurobiology of sea urchins. PMID:16965768

  18. Transcription Factors Encoded on Core and Accessory Chromosomes of Fusarium oxysporum Induce Expression of Effector Genes

    PubMed Central

    van der Does, H. Charlotte; Schmidt, Sarah M.; Langereis, Léon; Hughes, Timothy R.

    2016-01-01

    Proteins secreted by pathogens during host colonization largely determine the outcome of pathogen-host interactions and are commonly called ‘effectors’. In fungal plant pathogens, coordinated transcriptional up-regulation of effector genes is a key feature of pathogenesis and effectors are often encoded in genomic regions with distinct repeat content, histone code and rate of evolution. In the tomato pathogen Fusarium oxysporum f. sp. lycopersici (Fol), effector genes reside on one of four accessory chromosomes, known as the ‘pathogenicity’ chromosome, which can be exchanged between strains through horizontal transfer. The three other accessory chromosomes in the Fol reference strain may also be important for virulence towards tomato. Expression of effector genes in Fol is highly up-regulated upon infection and requires Sge1, a transcription factor encoded on the core genome. Interestingly, the pathogenicity chromosome itself contains 13 predicted transcription factor genes and for all except one, there is a homolog on the core genome. We determined DNA binding specificity for nine transcription factors using oligonucleotide arrays. The binding sites for homologous transcription factors were highly similar, suggesting that extensive neofunctionalization of DNA binding specificity has not occurred. Several DNA binding sites are enriched on accessory chromosomes, and expression of FTF1, its core homolog FTF2 and SGE1 from a constitutive promoter can induce expression of effector genes. The DNA binding sites of only these three transcription factors are enriched among genes up-regulated during infection. We further show that Ftf1, Ftf2 and Sge1 can activate transcription from their binding sites in yeast. RNAseq analysis revealed that in strains with constitutive expression of FTF1, FTF2 or SGE1, expression of a similar set of plant-responsive genes on the pathogenicity chromosome is induced, including most effector genes. We conclude that the Fol pathogenicity chromosome may be partially transcriptionally autonomous, but there are also extensive transcriptional connections between core and accessory chromosomes. PMID:27855160

  19. Predicting essential genes for identifying potential drug targets in Aspergillus fumigatus.

    PubMed

    Lu, Yao; Deng, Jingyuan; Rhodes, Judith C; Lu, Hui; Lu, Long Jason

    2014-06-01

    Aspergillus fumigatus (Af) is a ubiquitous and opportunistic pathogen capable of causing acute, invasive pulmonary disease in susceptible hosts. Despite current therapeutic options, mortality associated with invasive Af infections remains unacceptably high, increasing 357% since 1980. Therefore, there is an urgent need for the development of novel therapeutic strategies, including more efficacious drugs acting on new targets. Thus, as noted in a recent review, "the identification of essential genes in fungi represents a crucial step in the development of new antifungal drugs". Expanding the target space by rapidly identifying new essential genes has thus been described as "the most important task of genomics-based target validation". In previous research, we were the first to show that essential gene annotation can be reliably transferred between distantly related four Prokaryotic species. In this study, we extend our machine learning approach to the much more complex Eukaryotic fungal species. A compendium of essential genes is predicted in Af by transferring known essential gene annotations from another filamentous fungus Neurospora crassa. This approach predicts essential genes by integrating diverse types of intrinsic and context-dependent genomic features encoded in microbial genomes. The predicted essential datasets contained 1674 genes. We validated our results by comparing our predictions with known essential genes in Af, comparing our predictions with those predicted by homology mapping, and conducting conditional expressed alleles. We applied several layers of filters and selected a set of potential drug targets from the predicted essential genes. Finally, we have conducted wet lab knockout experiments to verify our predictions, which further validates the accuracy and wide applicability of the machine learning approach. The approach presented here significantly extended our ability to predict essential genes beyond orthologs and made it possible to predict an inventory of essential genes in Eukaryotic fungal species, amongst which a preferred subset of suitable drug targets may be selected. By selecting the best new targets, we believe that resultant drugs would exhibit an unparalleled clinical impact against a naive pathogen population. Additional benefits that a compendium of essential genes can provide are important information on cell function and evolutionary biology. Furthermore, mapping essential genes to pathways may also reveal critical check points in the pathogen's metabolism. Finally, this approach is highly reproducible and portable, and can be easily applied to predict essential genes in many more pathogenic microbes, especially those unculturable. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. Using co-expression analysis and stress-based screens to uncover Arabidopsis peroxisomal proteins involved in drought response

    DOE PAGES

    Li, Jiying; Hu, Jianping; Bassham, Diane

    2015-09-14

    Peroxisomes are essential organelles that house a wide array of metabolic reactions important for plant growth and development. However, our knowledge regarding the role of peroxisomal proteins in various biological processes, including plant stress response, is still incomplete. Recent proteomic studies of plant peroxisomes significantly increased the number of known peroxisomal proteins and greatly facilitated the study of peroxisomes at the systems level. The objectives of this study were to determine whether genes that encode peroxisomal proteins with related functions are co-expressed in Arabidopsis and identify peroxisomal proteins involved in stress response using in silico analysis and mutant screens. Usingmore » microarray data from online databases, we performed hierarchical clustering analysis to generate a comprehensive view of transcript level changes for Arabidopsis peroxisomal genes during development and under abiotic and biotic stress conditions. Many genes involved in the same metabolic pathways exhibited co-expression, some genes known to be involved in stress response are regulated by the corresponding stress conditions, and function of some peroxisomal proteins could be predicted based on their coexpression pattern. Since drought caused expression changes to the highest number of genes that encode peroxisomal proteins, we subjected a subset of Arabidopsis peroxisomal mutants to a drought stress assay. Mutants of the LON2 protease and the photorespiratory enzyme hydroxypyruvate reductase 1 (HPR1) showed enhanced susceptibility to drought, suggesting the involvement of peroxisomal quality control and photorespiration in drought resistance. Lastly, our study provided a global view of how genes that encode peroxisomal proteins respond to developmental and environmental cues and began to reveal additional peroxisomal proteins involved in stress response, thus opening up new avenues to investigate the role of peroxisomes in plant adaptation to environmental stresses.« less

  1. Oncoprotein AEG-1 is an endoplasmic reticulum RNA-binding protein whose interactome is enriched in organelle resident protein-encoding mRNAs.

    PubMed

    Hsu, Jack C-C; Reid, David W; Hoffman, Alyson M; Sarkar, Devanand; Nicchitta, Christopher V

    2018-05-01

    Astrocyte elevated gene-1 (AEG-1), an oncogene whose overexpression promotes tumor cell proliferation, angiogenesis, invasion, and enhanced chemoresistance, is thought to function primarily as a scaffolding protein, regulating PI3K/Akt and Wnt/β-catenin signaling pathways. Here we report that AEG-1 is an endoplasmic reticulum (ER) resident integral membrane RNA-binding protein (RBP). Examination of the AEG-1 RNA interactome by HITS-CLIP and PAR-CLIP methodologies revealed a high enrichment for endomembrane organelle-encoding transcripts, most prominently those encoding ER resident proteins, and within this cohort, for integral membrane protein-encoding RNAs. Cluster mapping of the AEG-1/RNA interaction sites demonstrated a normalized rank order interaction of coding sequence >5' untranslated region, with 3' untranslated region interactions only weakly represented. Intriguingly, AEG-1/membrane protein mRNA interaction sites clustered downstream from encoded transmembrane domains, suggestive of a role in membrane protein biogenesis. Secretory and cytosolic protein-encoding mRNAs were also represented in the AEG-1 RNA interactome, with the latter category notably enriched in genes functioning in mRNA localization, translational regulation, and RNA quality control. Bioinformatic analyses of RNA-binding motifs and predicted secondary structure characteristics indicate that AEG-1 lacks established RNA-binding sites though shares the property of high intrinsic disorder commonly seen in RBPs. These data implicate AEG-1 in the localization and regulation of secretory and membrane protein-encoding mRNAs and provide a framework for understanding AEG-1 function in health and disease. © 2018 Hsu et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  2. Aberrant RNA splicing in cancer; expression changes and driver mutations of splicing factor genes.

    PubMed

    Sveen, A; Kilpinen, S; Ruusulehto, A; Lothe, R A; Skotheim, R I

    2016-05-12

    Alternative splicing is a widespread process contributing to structural transcript variation and proteome diversity. In cancer, the splicing process is commonly disrupted, resulting in both functional and non-functional end-products. Cancer-specific splicing events are known to contribute to disease progression; however, the dysregulated splicing patterns found on a genome-wide scale have until recently been less well-studied. In this review, we provide an overview of aberrant RNA splicing and its regulation in cancer. We then focus on the executors of the splicing process. Based on a comprehensive catalog of splicing factor encoding genes and analyses of available gene expression and somatic mutation data, we identify cancer-associated patterns of dysregulation. Splicing factor genes are shown to be significantly differentially expressed between cancer and corresponding normal samples, and to have reduced inter-individual expression variation in cancer. Furthermore, we identify enrichment of predicted cancer-critical genes among the splicing factors. In addition to previously described oncogenic splicing factor genes, we propose 24 novel cancer-critical splicing factors predicted from somatic mutations.

  3. Identification of a β-glucosidase from the Mucor circinelloides genome by peptide pattern recognition.

    PubMed

    Huang, Yuhong; Busk, Peter Kamp; Grell, Morten Nedergaard; Zhao, Hai; Lange, Lene

    2014-12-01

    Mucor circinelloides produces plant cell wall degrading enzymes that allow it to grow on complex polysaccharides. Although the genome of M. circinelloides has been sequenced, only few plant cell wall degrading enzymes are annotated in this species. We applied peptide pattern recognition, which is a non-alignment based method for sequence analysis to map conserved sequences in glycoside hydrolase families. The conserved sequences were used to identify similar genes in the M. circinelloides genome. We found 12 different novel genes encoding members of the GH3, GH5, GH9, GH16, GH38, GH47 and GH125 families in M. circinelloides. One of the two GH3-encoding genes was predicted to encode a β-glucosidase (EC 3.2.1.21). We expressed this gene in Pichia pastoris KM71H and found that the purified recombinant protein had relative high β-glucosidase activity (1.73U/mg) at pH5 and 50°C. The Km and Vmax with p-nitrophenyl-β-d-glucopyranoside as substrate was 0.20mM and 2.41U/mg, respectively. The enzyme was not inhibited by glucose and retained 84% activity at glucose concentrations up to 140mM. Although zygomycetes are not considered to be important degraders of lignocellulosic biomass in nature, the present finding of an active β-glucosidase in M. circinelloides demonstrates that enzymes from this group of fungi have a potential for cellulose degradation. Copyright © 2014 Elsevier Inc. All rights reserved.

  4. Evidence for a Ustilago maydis Steroid 5α-Reductase by Functional Expression in Arabidopsis det2-1 Mutants1

    PubMed Central

    Basse, Christoph W.; Kerschbamer, Christine; Brustmann, Markus; Altmann, Thomas; Kahmann, Regine

    2002-01-01

    We have identified a gene (udh1) in the basidiomycete Ustilago maydis that is induced during the parasitic interaction with its host plant maize (Zea mays). udh1 encodes a protein with high similarity to mammalian and plant 5α-steroid reductases. Udh1 differs from those of known 5α-steroid reductases by six additional domains, partially predicted to be membrane-spanning. A fusion protein of Udh1 and the green fluorescent protein provided evidence for endoplasmic reticulum localization in U. maydis. The function of the Udh1 protein was demonstrated by complementing Arabidopsis det2-1 mutants, which display a dwarf phenotype due to a mutation in the 5α-steroid reductase encoding DET2 gene. det2-1 mutant plants expressing either the udh1 or the DET2 gene controlled by the cauliflower mosaic virus 35S promoter differed from wild-type Columbia plants by accelerated stem growth, flower and seed development and a reduction in size and number of rosette leaves. The accelerated growth phenotype of udh1 transgenic plants was stably inherited and was favored under reduced light conditions. Truncation of the N-terminal 70 amino acids of the Udh1 protein abolished the ability to restore growth in det2-1 plants. Our results demonstrate the existence of a 5α-steroid reductase encoding gene in fungi and suggest a common ancestor between fungal, plant, and mammalian proteins. PMID:12068114

  5. Evidence for a Ustilago maydis steroid 5alpha-reductase by functional expression in Arabidopsis det2-1 mutants.

    PubMed

    Basse, Christoph W; Kerschbamer, Christine; Brustmann, Markus; Altmann, Thomas; Kahmann, Regine

    2002-06-01

    We have identified a gene (udh1) in the basidiomycete Ustilago maydis that is induced during the parasitic interaction with its host plant maize (Zea mays). udh1 encodes a protein with high similarity to mammalian and plant 5alpha-steroid reductases. Udh1 differs from those of known 5alpha-steroid reductases by six additional domains, partially predicted to be membrane-spanning. A fusion protein of Udh1 and the green fluorescent protein provided evidence for endoplasmic reticulum localization in U. maydis. The function of the Udh1 protein was demonstrated by complementing Arabidopsis det2-1 mutants, which display a dwarf phenotype due to a mutation in the 5alpha-steroid reductase encoding DET2 gene. det2-1 mutant plants expressing either the udh1 or the DET2 gene controlled by the cauliflower mosaic virus 35S promoter differed from wild-type Columbia plants by accelerated stem growth, flower and seed development and a reduction in size and number of rosette leaves. The accelerated growth phenotype of udh1 transgenic plants was stably inherited and was favored under reduced light conditions. Truncation of the N-terminal 70 amino acids of the Udh1 protein abolished the ability to restore growth in det2-1 plants. Our results demonstrate the existence of a 5alpha-steroid reductase encoding gene in fungi and suggest a common ancestor between fungal, plant, and mammalian proteins.

  6. Proteogenomic characterization of human colon and rectal cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Bing; Wang, Jing; Wang, Xiaojing

    2014-09-18

    We analyzed proteomes of colon and rectal tumors previously characterized by the Cancer Genome Atlas (TCGA) and performed integrated proteogenomic analyses. Protein sequence variants encoded by somatic genomic variations displayed reduced expression compared to protein variants encoded by germline variations. mRNA transcript abundance did not reliably predict protein expression differences between tumors. Proteomics identified five protein expression subtypes, two of which were associated with the TCGA "MSI/CIMP" transcriptional subtype, but had distinct mutation and methylation patterns and associated with different clinical outcomes. Although CNAs showed strong cis- and trans-effects on mRNA expression, relatively few of these extend to the proteinmore » level. Thus, proteomics data enabled prioritization of candidate driver genes. Our analyses identified HNF4A, a novel candidate driver gene in tumors with chromosome 20q amplifications. Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords novel insights into cancer biology.« less

  7. Investigating sesquiterpene biosynthesis in Ginkgo biloba: molecular cloning and functional characterization of (E,E)-farnesol and α-bisabolene synthases.

    PubMed

    Parveen, Iffat; Wang, Mei; Zhao, Jianping; Chittiboyina, Amar G; Tabanca, Nurhayat; Ali, Abbas; Baerson, Scott R; Techen, Natascha; Chappell, Joe; Khan, Ikhlas A; Pan, Zhiqiang

    2015-11-01

    Ginkgo biloba is one of the oldest living tree species and has been extensively investigated as a source of bioactive natural compounds, including bioactive flavonoids, diterpene lactones, terpenoids and polysaccharides which accumulate in foliar tissues. Despite this chemical diversity, relatively few enzymes associated with any biosynthetic pathway from ginkgo have been characterized to date. In the present work, predicted transcripts potentially encoding enzymes associated with the biosynthesis of diterpenoid and terpenoid compounds, including putative terpene synthases, were first identified by mining publicly-available G. biloba RNA-seq data sets. Recombinant enzyme studies with two of the TPS-like sequences led to the identification of GbTPS1 and GbTPS2, encoding farnesol and bisabolene synthases, respectively. Additionally, the phylogenetic analysis revealed the two terpene synthase genes as primitive genes that might have evolved from an ancestral diterpene synthase.

  8. Cellular and molecular biology of orphan G protein-coupled receptors.

    PubMed

    Oh, Da Young; Kim, Kyungjin; Kwon, Hyuk Bang; Seong, Jae Young

    2006-01-01

    The superfamily of G protein-coupled receptors (GPCRs) is the largest and most diverse group of membrane-spanning proteins. It plays a variety of roles in pathophysiological processes by transmitting extracellular signals to cells via heterotrimeric G proteins. Completion of the human genome project revealed the presence of approximately 168 genes encoding established nonsensory GPCRs, as well as 207 genes predicted to encode novel GPCRs for which the natural ligands remained to be identified, the so-called orphan GPCRs. Eighty-six of these orphans have now been paired to novel or previously known molecules, and 121 remain to be deorphaned. A better understanding of the GPCR structures and classification; knowledge of the receptor activation mechanism, either dependent on or independent of an agonist; increased understanding of the control of GPCR-mediated signal transduction; and development of appropriate ligand screening systems may improve the probability of discovering novel ligands for the remaining orphan GPCRs.

  9. A Metagenome-Wide Association Study and Arrayed Mutant Library Confirm Acetobacter Lipopolysaccharide Genes Are Necessary for Association with Drosophila melanogaster.

    PubMed

    White, K Makay; Matthews, Melinda K; Hughes, Rachel C; Sommer, Andrew J; Griffitts, Joel S; Newell, Peter D; Chaston, John M

    2018-03-28

    A metagenome wide association (MGWA) study of bacterial host association determinants in Drosophila predicted that LPS biosynthesis genes are significantly associated with host colonization. We were unable to create site-directed mutants for each of the predicted genes in Acetobacter , so we created an arrayed transposon insertion library using Acetobacter fabarum DsW_054 isolated from Drosophila Creation of the A. fabarum DsW_054 gene knock-out library was performed by combinatorial mapping and Illumina sequencing of random transposon insertion mutants. Transposon insertion locations for 6,418 mutants were successfully mapped, including hits within 63% of annotated genes in the A. fabarum DsW_054 genome. For 45/45 members of the library, insertion sites were verified by arbitrary PCR and Sanger sequencing. Mutants with insertions in four different LPS biosynthesis genes were selected from the library to validate the MGWA predictions. Insertion mutations in two genes biosynthetically upstream of Lipid-A formation, lpxC and lpxB , show significant differences in host association, whereas mutations in two genes encoding LPS biosynthesis functions downstream of Lipid-A biosynthesis had no effect. These results suggest an impact of bacterial cell surface molecules on the bacterial capacity for host association. Also, the transposon insertion mutant library will be a useful resource for ongoing research on the genetic basis for Acetobacter traits. Copyright © 2018 White et al.

  10. Regulatory gene networks and the properties of the developmental process

    NASA Technical Reports Server (NTRS)

    Davidson, Eric H.; McClay, David R.; Hood, Leroy

    2003-01-01

    Genomic instructions for development are encoded in arrays of regulatory DNA. These specify large networks of interactions among genes producing transcription factors and signaling components. The architecture of such networks both explains and predicts developmental phenomenology. Although network analysis is yet in its early stages, some fundamental commonalities are already emerging. Two such are the use of multigenic feedback loops to ensure the progressivity of developmental regulatory states and the prevalence of repressive regulatory interactions in spatial control processes. Gene regulatory networks make it possible to explain the process of development in causal terms and eventually will enable the redesign of developmental regulatory circuitry to achieve different outcomes.

  11. Comparative analysis of amino acid composition in the active site of nirk gene encoding copper-containing nitrite reductase (CuNiR) in bacterial spp.

    PubMed

    Adhikari, Utpal Kumar; Rahman, M Mizanur

    2017-04-01

    The nirk gene encoding the copper-containing nitrite reductase (CuNiR), a key catalytic enzyme in the environmental denitrification process that helps to produce nitric oxide from nitrite. The molecular mechanism of denitrification process is definitely complex and in this case a theoretical investigation has been conducted to know the sequence information and amino acid composition of the active site of CuNiR enzyme using various Bioinformatics tools. 10 Fasta formatted sequences were retrieved from the NCBI database and the domain and disordered regions identification and phylogenetic analyses were done on these sequences. The comparative modeling of protein was performed through Modeller 9v14 program and visualized by PyMOL tools. Validated protein models were deposited in the Protein Model Database (PMDB) (PMDB id: PM0080150 to PM0080159). Active sites of nirk encoding CuNiR enzyme were identified by Castp server. The PROCHECK showed significant scores for four protein models in the most favored regions of the Ramachandran plot. Active sites and cavities prediction exhibited that the amino acid, namely Glycine, Alanine, Histidine, Aspartic acid, Glutamic acid, Threonine, and Glutamine were common in four predicted protein models. The present in silico study anticipates that active site analyses result will pave the way for further research on the complex denitrification mechanism of the selected species in the experimental laboratory. Copyright © 2016. Published by Elsevier Ltd.

  12. Two host cytoplasmic effectors are required for pathogenesis of Phytophthora sojae by suppression of host defenses.

    PubMed

    Liu, Tingli; Ye, Wenwu; Ru, Yanyan; Yang, Xinyu; Gu, Biao; Tao, Kai; Lu, Shan; Dong, Suomeng; Zheng, Xiaobo; Shan, Weixing; Wang, Yuanchao; Dou, Daolong

    2011-01-01

    Phytophthora sojae encodes hundreds of putative host cytoplasmic effectors with conserved FLAK motifs following signal peptides, termed crinkling- and necrosis-inducing proteins (CRN) or Crinkler. Their functions and mechanisms in pathogenesis are mostly unknown. Here, we identify a group of five P. sojae-specific CRN-like genes with high levels of sequence similarity, of which three are putative pseudogenes. Functional analysis shows that the two functional genes encode proteins with predicted nuclear localization signals that induce contrasting responses when expressed in Nicotiana benthamiana and soybean (Glycine max). PsCRN63 induces cell death, while PsCRN115 suppresses cell death elicited by the P. sojae necrosis-inducing protein (PsojNIP) or PsCRN63. Expression of CRN fragments with deleted signal peptides and FLAK motifs demonstrates that the carboxyl-terminal portions of PsCRN63 or PsCRN115 are sufficient for their activities. However, the predicted nuclear localization signal is required for PsCRN63 to induce cell death but not for PsCRN115 to suppress cell death. Furthermore, silencing of the PsCRN63 and PsCRN115 genes in P. sojae stable transformants leads to a reduction of virulence on soybean. Intriguingly, the silenced transformants lose the ability to suppress host cell death and callose deposition on inoculated plants. These results suggest a role for CRN effectors in the suppression of host defense responses.

  13. Functional requirements for bacteriophage growth: gene essentiality and expression in mycobacteriophage Giles.

    PubMed

    Dedrick, Rebekah M; Marinelli, Laura J; Newton, Gerald L; Pogliano, Kit; Pogliano, Joseph; Hatfull, Graham F

    2013-05-01

    Bacteriophages represent a majority of all life forms, and the vast, dynamic population with early origins is reflected in their enormous genetic diversity. A large number of bacteriophage genomes have been sequenced. They are replete with novel genes without known relatives. We know little about their functions, which genes are required for lytic growth, and how they are expressed. Furthermore, the diversity is such that even genes with required functions - such as virion proteins and repressors - cannot always be recognized. Here we describe a functional genomic dissection of mycobacteriophage Giles, in which the virion proteins are identified, genes required for lytic growth are determined, the repressor is identified, and the transcription patterns determined. We find that although all of the predicted phage genes are expressed either in lysogeny or in lytic growth, 45% of the predicted genes are non-essential for lytic growth. We also describe genes required for DNA replication, show that recombination is required for lytic growth, and that Giles encodes a novel repressor. RNAseq analysis reveals abundant expression of a small non-coding RNA in a lysogen and in late lytic growth, although it is non-essential for lytic growth and does not alter lysogeny. © 2013 Blackwell Publishing Ltd.

  14. Permanent draft genome sequence of Comamonas testosteroni KF-1

    PubMed Central

    Weiss, Michael; Kesberg, Anna I.; LaButti, Kurt M.; Pitluck, Sam; Bruce, David; Hauser, Loren; Copeland, Alex; Woyke, Tanja; Lowry, Stephen; Lucas, Susan; Land, Miriam; Goodwin, Lynne; Kjelleberg, Staffan; Cook, Alasdair M.; Buhmann, Matthias; Thomas, Torsten; Schleheck, David

    2013-01-01

    Comamonas testosteroni KF-1 is a model organism for the elucidation of the novel biochemical degradation pathways for xenobiotic 4-sulfophenylcarboxylates (SPC) formed during biodegradation of synthetic 4-sulfophenylalkane surfactants (linear alkylbenzenesulfonates, LAS) by bacterial communities. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 6,026,527 bp long chromosome (one sequencing gap) exhibits an average G+C content of 61.79% and is predicted to encode 5,492 protein-coding genes and 114 RNA genes. PMID:23991256

  15. Mosaic Structure of a Multiple-Drug-Resistant, Conjugative Plasmid from Campylobacter jejuni

    DTIC Science & Technology

    2005-01-30

    allele of each gene in the respective clones. There were three genes predicted to encode alleles of strep- tomycin-inactivating enzymes from Enterococcus ...aminoglycoside 6-adenyltransferase/E. faecium /NP_863159 3 cpp50 2599–2811 26.3 473 70 100/100 (70) Unknown of pTet plasmid/C. jejuni strain 81-176/YP_063493 4... faecium /NP_863159 24 sat4 17692–18222 37.7 180 176 94/94 (176) Streptothricin acetyltransferase/E. faecium /AAM77897 25 aphA-3 18315–19109 44.9 264

  16. 'Ca. Liberibacter asiaticus' proteins orthologous with pSymA-encoded proteins of Sinorhizobium meliloti: hypothetical roles in plant host interation

    USDA-ARS?s Scientific Manuscript database

    A nitrogen-fixing alfalfa-nodulating microsymbiont, Sinorhizobium meliloti, has a genome consisting of a 3.5 Mbp circular chromosome and two megaplasmids totaling 3.0 Mbp, one a 1.3 Mbp pSymA carrying nonessential ‘accessory’ genes including nif, nod and others involved in plant interaction. Predict...

  17. A draft whole genome sequence of “Candidatus Liberibacter asiaticus” strain TX2351 from Asian citrus psyllids in Texas, USA

    USDA-ARS?s Scientific Manuscript database

    The draft genome sequence of “Candidatus Liberibacter asiaticus” strain TX2351 collected from ACP in South Texas has been determined. The TX2351 genome is 1,252,043 bp in size with a 36.5% G+C content, encoding 1,184 predicted open reading frames and 51 RNA genes....

  18. Complete Genome Sequence of Dehalobacterium formicoaceticum Strain DMC, a Strictly Anaerobic Dichloromethane-Degrading Bacterium

    DOE PAGES

    Chen, Gao; Murdoch, Robert W.; Mack, E. Erin; ...

    2017-09-14

    Dehalobacterium formicoaceticum utilizes dichloromethane as the sole energy source in defined anoxic bicarbonate-buffered mineral salt medium. The products are formate, acetate, inorganic chloride, and biomass. The bacterium’s genome was sequenced using PacBio, assembled, and annotated. The complete genome consists of one 3.77-Mb circular chromosome harboring 3,935 predicted protein-encoding genes.

  19. Complete Genome Sequence of Dehalobacterium formicoaceticum Strain DMC, a Strictly Anaerobic Dichloromethane-Degrading Bacterium

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Gao; Murdoch, Robert W.; Mack, E. Erin

    Dehalobacterium formicoaceticum utilizes dichloromethane as the sole energy source in defined anoxic bicarbonate-buffered mineral salt medium. The products are formate, acetate, inorganic chloride, and biomass. The bacterium’s genome was sequenced using PacBio, assembled, and annotated. The complete genome consists of one 3.77-Mb circular chromosome harboring 3,935 predicted protein-encoding genes.

  20. Characterization of receptor of activated C kinase 1 (RACK1) and functional analysis during larval metamorphosis of the oyster Crassostrea angulata.

    PubMed

    Yang, Bingye; Pu, Fei; Qin, Ji; You, Weiwei; Ke, Caihuan

    2014-03-10

    During a large-scale screen of the larval transcriptome library of the Portuguese oyster, Crassostrea angulata, the oyster gene RACK, which encodes a receptor of activated protein kinase C protein was isolated and characterized. The cDNA is 1,148 bp long and has a predicted open reading frame encoding 317 aa. The predicted protein shows high sequence identity to many RACK proteins of different organisms including molluscs, fish, amphibians and mammals, suggesting that it is conserved during evolution. The structural analysis of the Ca-RACK1 genomic sequence implies that the Ca-RACK1 gene has seven exons and six introns, extending approximately 6.5 kb in length. It is expressed ubiquitously in many oyster tissues as detected by RT-PCR analysis. The Ca-RACK1 mRNA expression pattern was markedly increased at larval metamorphosis; and was further increased along with Ca-RACK1 protein synthesis during epinephrine-induced metamorphosis. These results indicate that the Ca-RACK1 plays an important role in tissue differentiation and/or in cell growth during larval metamorphosis in the oyster, C. angulata. Copyright © 2013 Elsevier B.V. All rights reserved.

  1. Cre-lox Univector acceptor vectors for functional screening in protoplasts: analysis of Arabidopsis donor cDNAs encoding ABSCISIC ACID INSENSITIVE1-Like protein phosphatases

    PubMed Central

    Jia, Fan; Gampala, Srinivas S.L.; Mittal, Amandeep; Luo, Qingjun; Rock, Christopher D.

    2009-01-01

    The 14,200 available full length Arabidopsis thaliana cDNAs in the Universal Plasmid System (UPS) donor vector pUNI51 should be applied broadly and efficiently to leverage a “functional map-space” of homologous plant genes. We have engineered Cre-lox UPS host acceptor vectors (pCR701- 705) with N-terminal epitope tags in frame with the loxH site and downstream from the maize Ubiquitin promoter for use in transient protoplast expression assays and particle bombardment transformation of monocots. As an example of the utility of these vectors, we recombined them with several Arabidopsis cDNAs encoding Ser/Thr protein phosphatase type 2C (PP2Cs) known from genetic studies or predicted by hierarchical clustering meta-analysis to be involved in ABA and stress responses. Our functional results in Zea mays mesophyll protoplasts on ABA-inducible expression effects on the Late Embryogenesis Abundant promoter ProEm:GUS reporter were consistent with predictions and resulted in identification of novel activities of some PP2Cs. Deployment of these vectors can facilitate functional genomics and proteomics and identification of novel gene activities. PMID:19499346

  2. Structural characteristics of ScBx genes controlling the biosynthesis of hydroxamic acids in rye (Secale cereale L.).

    PubMed

    Bakera, Beata; Makowska, Bogna; Groszyk, Jolanta; Niziołek, Michał; Orczyk, Wacław; Bolibok-Brągoszewska, Hanna; Hromada-Judycka, Aneta; Rakoczy-Trojanowska, Monika

    2015-08-01

    Benzoxazinoids (BX) are major secondary metabolites of gramineous plants that play an important role in disease resistance and allelopathy. They also have many other unique properties including anti-bacterial and anti-fungal activity, and the ability to reduce alfa-amylase activity. The biosynthesis and modification of BX are controlled by the genes Bx1 ÷ Bx10, GT and glu, and the majority of these Bx genes have been mapped in maize, wheat and rye. However, the genetic basis of BX biosynthesis remains largely uncharacterized apart from some data from maize and wheat. The aim of this study was to isolate, sequence and characterize five genes (ScBx1, ScBx2, ScBx3, ScBx4 and ScBx5) encoding enzymes involved in the synthesis of DIBOA, an important defense compound of rye. Using a modified 3D procedure of BAC library screening, seven BAC clones containing all of the ScBx genes were isolated and sequenced. Bioinformatic analyses of the resulting contigs were used to examine the structure and other features of these genes, including their promoters, introns and 3'UTRs. Comparative analysis showed that the ScBx genes are similar to those of other Poaceae species, especially to the TaBx genes. The polymorphisms present both in the coding sequences and non-coding regions of ScBx in relation to other Bx genes are predicted to have an impact on the expression, structure and properties of the encoded proteins.

  3. Comparative genomic analysis of the multispecies probiotic-marketed product VSL#3.

    PubMed

    Douillard, François P; Mora, Diego; Eijlander, Robyn T; Wels, Michiel; de Vos, Willem M

    2018-01-01

    Several probiotic-marketed formulations available for the consumers contain live lactic acid bacteria and/or bifidobacteria. The multispecies product commercialized as VSL#3 has been used for treating various gastro-intestinal disorders. However, like many other products, the bacterial strains present in VSL#3 have only been characterized to a limited extent and their efficacy as well as their predicted mode of action remain unclear, preventing further applications or comparative studies. In this work, the genomes of all eight bacterial strains present in VSL#3 were sequenced and characterized, to advance insights into the possible mode of action of this product and also to serve as a basis for future work and trials. Phylogenetic and genomic data analysis allowed us to identify the 7 species present in the VSL#3 product as specified by the manufacturer. The 8 strains present belong to the species Streptococcus thermophilus, Lactobacillus acidophilus, Lactobacillus paracasei, Lactobacillus plantarum, Lactobacillus helveticus, Bifidobacterium breve and B. animalis subsp. lactis (two distinct strains). Comparative genomics revealed that the draft genomes of the S. thermophilus and L. helveticus strains were predicted to encode most of the defence systems such as restriction modification and CRISPR-Cas systems. Genes associated with a variety of potential probiotic functions were also identified. Thus, in the three Bifidobacterium spp., gene clusters were predicted to encode tight adherence pili, known to promote bacteria-host interaction and intestinal barrier integrity, and to impact host cell development. Various repertoires of putative signalling proteins were predicted to be encoded by the genomes of the Lactobacillus spp., i.e. surface layer proteins, LPXTG-containing proteins, or sortase-dependent pili that may interact with the intestinal mucosa and dendritic cells. Taken altogether, the individual genomic characterization of the strains present in the VSL#3 product confirmed the product specifications, determined its coding capacity as well as identified potential probiotic functions.

  4. Comparative genomic analysis of the multispecies probiotic-marketed product VSL#3

    PubMed Central

    Mora, Diego; Eijlander, Robyn T.; Wels, Michiel; de Vos, Willem M.

    2018-01-01

    Several probiotic-marketed formulations available for the consumers contain live lactic acid bacteria and/or bifidobacteria. The multispecies product commercialized as VSL#3 has been used for treating various gastro-intestinal disorders. However, like many other products, the bacterial strains present in VSL#3 have only been characterized to a limited extent and their efficacy as well as their predicted mode of action remain unclear, preventing further applications or comparative studies. In this work, the genomes of all eight bacterial strains present in VSL#3 were sequenced and characterized, to advance insights into the possible mode of action of this product and also to serve as a basis for future work and trials. Phylogenetic and genomic data analysis allowed us to identify the 7 species present in the VSL#3 product as specified by the manufacturer. The 8 strains present belong to the species Streptococcus thermophilus, Lactobacillus acidophilus, Lactobacillus paracasei, Lactobacillus plantarum, Lactobacillus helveticus, Bifidobacterium breve and B. animalis subsp. lactis (two distinct strains). Comparative genomics revealed that the draft genomes of the S. thermophilus and L. helveticus strains were predicted to encode most of the defence systems such as restriction modification and CRISPR-Cas systems. Genes associated with a variety of potential probiotic functions were also identified. Thus, in the three Bifidobacterium spp., gene clusters were predicted to encode tight adherence pili, known to promote bacteria-host interaction and intestinal barrier integrity, and to impact host cell development. Various repertoires of putative signalling proteins were predicted to be encoded by the genomes of the Lactobacillus spp., i.e. surface layer proteins, LPXTG-containing proteins, or sortase-dependent pili that may interact with the intestinal mucosa and dendritic cells. Taken altogether, the individual genomic characterization of the strains present in the VSL#3 product confirmed the product specifications, determined its coding capacity as well as identified potential probiotic functions. PMID:29451876

  5. When Genome-Based Approach Meets the “Old but Good”: Revealing Genes Involved in the Antibacterial Activity of Pseudomonas sp. P482 against Soft Rot Pathogens

    PubMed Central

    Krzyżanowska, Dorota M.; Ossowicki, Adam; Rajewska, Magdalena; Maciąg, Tomasz; Jabłońska, Magdalena; Obuchowski, Michał; Heeb, Stephan; Jafra, Sylwia

    2016-01-01

    Dickeya solani and Pectobacterium carotovorum subsp. brasiliense are recently established species of bacterial plant pathogens causing black leg and soft rot of many vegetables and ornamental plants. Pseudomonas sp. strain P482 inhibits the growth of these pathogens, a desired trait considering the limited measures to combat these diseases. In this study, we determined the genetic background of the antibacterial activity of P482, and established the phylogenetic position of this strain. Pseudomonas sp. P482 was classified as Pseudomonas donghuensis. Genome mining revealed that the P482 genome does not contain genes determining the synthesis of known antimicrobials. However, the ClusterFinder algorithm, designed to detect atypical or novel classes of secondary metabolite gene clusters, predicted 18 such clusters in the genome. Screening of a Tn5 mutant library yielded an antimicrobial negative transposon mutant. The transposon insertion was located in a gene encoding an HpcH/HpaI aldolase/citrate lyase family protein. This gene is located in a hypothetical cluster predicted by the ClusterFinder, together with the downstream homologs of four nfs genes, that confer production of a non-fluorescent siderophore by P. donghuensis HYST. Site-directed inactivation of the HpcH/HpaI aldolase gene, the adjacent short chain dehydrogenase gene, as well as a homolog of an essential nfs cluster gene, all abolished the antimicrobial activity of the P482, suggesting their involvement in a common biosynthesis pathway. However, none of the mutants showed a decreased siderophore yield, neither was the antimicrobial activity of the wild type P482 compromised by high iron bioavailability. A genomic region comprising the nfs cluster and three upstream genes is involved in the antibacterial activity of P. donghuensis P482 against D. solani and P. carotovorum subsp. brasiliense. The genes studied are unique to the two known P. donghuensis strains. This study illustrates that mining of microbial genomes is a powerful approach for predictingthe presence of novel secondary-metabolite encoding genes especially when coupled with transposon mutagenesis. PMID:27303376

  6. A novel polyketide biosynthesis gene cluster is involved in fruiting body morphogenesis in the filamentous fungi Sordaria macrospora and Neurospora crassa.

    PubMed

    Nowrousian, Minou

    2009-04-01

    During fungal fruiting body development, hyphae aggregate to form multicellular structures that protect and disperse the sexual spores. Analysis of microarray data revealed a gene cluster strongly upregulated during fruiting body development in the ascomycete Sordaria macrospora. Real time PCR analysis showed that the genes from the orthologous cluster in Neurospora crassa are also upregulated during development. The cluster encodes putative polyketide biosynthesis enzymes, including a reducing polyketide synthase. Analysis of knockout strains of a predicted dehydrogenase gene from the cluster showed that mutants in N. crassa and S. macrospora are delayed in fruiting body formation. In addition to the upregulated cluster, the N. crassa genome comprises another cluster containing a polyketide synthase gene, and five additional reducing polyketide synthase (rpks) genes that are not part of clusters. To study the role of these genes in sexual development, expression of the predicted rpks genes in S. macrospora (five genes) and N. crassa (six genes) was analyzed; all but one are upregulated during sexual development. Analysis of knockout strains for the N. crassa rpks genes showed that one of them is essential for fruiting body formation. These data indicate that polyketides produced by RPKSs are involved in sexual development in filamentous ascomycetes.

  7. Function and specificity of synthetic Hox transcription factors in vivo

    PubMed Central

    Papadopoulos, Dimitrios K.; Vukojević, Vladana; Adachi, Yoshitsugu; Terenius, Lars; Rigler, Rudolf; Gehring, Walter J.

    2010-01-01

    Homeotic (Hox) genes encode transcription factors that confer segmental identity along the anteroposterior axis of the embryo. However the molecular mechanisms underlying Hox-mediated transcription and the differential requirements for specificity in the regulation of the vast number of Hox-target genes remain ill-defined. Here we show that synthetic Sex combs reduced (Scr) genes that encode the Scr C terminus containing the homedomain (HD) and YPWM motif (Scr-HD) are functional in vivo. Synthetic Scr-HD peptides can induce ectopic salivary glands in the embryo and homeotic transformations in the adult fly, act as transcriptional activators and repressors during development, and participate in protein-protein interactions. Their transformation capacity was found to be enhanced over their full-length counterpart and mutations known to transform the full-length protein into constitutively active or inactive variants behaved accordingly in the synthetic peptides. Our results show that synthetic Scr-HD genes are sufficient for homeotic function in Drosophila and suggest that the N terminus of Scr has a role in transcriptional potency, rather than specificity. We also demonstrate that synthetic peptides behave largely in a predictable way, by exhibiting Scr-specific phenotypes throughout development, which makes them an important tool for synthetic biology. PMID:20147626

  8. Characterization of the Rana grylio virus 3{beta}-hydroxysteroid dehydrogenase and its novel role in suppressing virus-induced cytopathic effect

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sun Wei; Huang Youhua; Zhao Zhe

    2006-12-08

    The 3{beta}-hydroxysteroid dehydrogenase (3{beta}-HSD) isoenzymes play a key role in cellular steroid hormone synthesis. Here, a 3{beta}-HSD gene homolog was cloned from Rana grylio virus (RGV), a member of family Iridoviridae. RGV 3{beta}-HSD gene has 1068 bp, encoding a 355 aa predicted protein. Transcription analyses showed that RGV 3{beta}-HSD gene was transcribed immediate-early during infection from an initiation site 19 nucleotides upstream of the translation start site. Confocal microscopy revealed that the 3{beta}-HSD-EGFP fusion protein was exclusively colocalized with the mitochondria marker (pDsRed2-Mito) in EPC cells. Upon morphological observation and MTT assay, it was revealed that overexpression of RGV 3{beta}-HSDmore » in EPC cells could apparently suppress RGV-induced cytopathic effect (CPE). The present studies indicate that the RGV immediate-early 3{beta}-HSD gene encodes a mitochondria-localized protein, which has a novel role in suppressing virus-induced CPE. All these suggest that RGV 3{beta}-HSD might be a protein involved in host-virus interaction.« less

  9. Biodiversity of genes encoding anti-microbial traits within plant associated microbes

    PubMed Central

    Mousa, Walaa K.; Raizada, Manish N.

    2015-01-01

    The plant is an attractive versatile home for diverse associated microbes. A subset of these microbes produces a diversity of anti-microbial natural products including polyketides, non-ribosomal peptides, terpenoids, heterocylic nitrogenous compounds, volatile compounds, bacteriocins, and lytic enzymes. In recent years, detailed molecular analysis has led to a better understanding of the underlying genetic mechanisms. New genomic and bioinformatic tools have permitted comparisons of orthologous genes between species, leading to predictions of the associated evolutionary mechanisms responsible for diversification at the genetic and corresponding biochemical levels. The purpose of this review is to describe the biodiversity of biosynthetic genes of plant-associated bacteria and fungi that encode selected examples of antimicrobial natural products. For each compound, the target pathogen and biochemical mode of action are described, in order to draw attention to the complexity of these phenomena. We review recent information of the underlying molecular diversity and draw lessons through comparative genomic analysis of the orthologous coding sequences (CDS). We conclude by discussing emerging themes and gaps, discuss the metabolic pathways in the context of the phylogeny and ecology of their microbial hosts, and discuss potential evolutionary mechanisms that led to the diversification of biosynthetic gene clusters. PMID:25914708

  10. Trichoderma genes

    DOEpatents

    Foreman, Pamela [Los Altos, CA; Goedegebuur, Frits [Vlaardingen, NL; Van Solingen, Pieter [Naaldwijk, NL; Ward, Michael [San Francisco, CA

    2012-06-19

    Described herein are novel gene sequences isolated from Trichoderma reesei. Two genes encoding proteins comprising a cellulose binding domain, one encoding an arabionfuranosidase and one encoding an acetylxylanesterase are described. The sequences, CIP1 and CIP2, contain a cellulose binding domain. These proteins are especially useful in the textile and detergent industry and in pulp and paper industry.

  11. The rice blast resistance gene Ptr encodes an atypical protein required for broad spectrum disease resistance

    USDA-ARS?s Scientific Manuscript database

    Plant resistance (R) genes typically encode proteins with nucleotide binding site-leucine rich repeat (NLR) domains. We identified a novel, broad-spectrum rice blast R gene, Ptr, encoding a non-NLR protein with four Armadillo repeats. Ptr was originally identified by fast neutron mutagenesis as a ...

  12. Protective Vaccination against Blood-Stage Malaria of Plasmodium chabaudi: Differential Gene Expression in the Liver of Balb/c Mice toward the End of Crisis Phase

    PubMed Central

    Al-Quraishy, Saleh A.; Dkhil, Mohamed A.; Abdel-Baki, Abdel-Azeem A.; Delic, Denis; Wunderlich, Frank

    2016-01-01

    Protective vaccination induces self-healing of otherwise fatal blood-stage malaria of Plasmodium chabaudi in female Balb/c mice. To trace processes critically involved in self-healing, the liver, an effector against blood-stage malaria, is analyzed for possible changes of its transcriptome in vaccination-protected in comparison to non-protected mice toward the end of the crisis phase. Gene expression microarray analyses reveal that vaccination does not affect constitutive expression of mRNA and lincRNA. However, malaria induces significant (p < 0.01) differences in hepatic gene and lincRNA expression in vaccination-protected vs. non-vaccinated mice toward the end of crisis phase. In vaccination-protected mice, infections induce up-regulations of 276 genes and 40 lincRNAs and down-regulations of 200 genes and 43 lincRNAs, respectively, by >3-fold as compared to the corresponding constitutive expressions. Massive up-regulations, partly by >100-fold, are found for genes as RhD, Add2, Ank1, Ermap, and Slc4a, which encode proteins of erythrocytic surface membranes, and as Gata1 and Gfi1b, which encode transcription factors involved in erythrocytic development. Also, Cldn13 previously predicted to be expressed on erythroblast surfaces is up-regulated by >200-fold, though claudins are known as main constituents of tight junctions acting as paracellular barriers between epithelial cells. Other genes are up-regulated by <100- and >10-fold, which can be subgrouped in genes encoding proteins known to be involved in mitosis, in cell cycle regulation, and in DNA repair. Our data suggest that protective vaccination enables the liver to respond to P. chabaudi infections with accelerated regeneration and extramedullary erythropoiesis during crisis, which contributes to survival of otherwise lethal blood-stage malaria. PMID:27471498

  13. Structure and expression of 12-oxophytodienoate reductase (subgroup I) genes in pea, and characterization of the oxidoreductase activities of their recombinant products.

    PubMed

    Matsui, H; Nakamura, G; Ishiga, Y; Toshima, H; Inagaki, Y; Toyoda, K; Shiraishi, T; Ichinose, Y

    2004-02-01

    Recently, we observed that expression of a pea gene (S64) encoding an oxophytodienoic acid reductase (OPR) was induced by a suppressor of pea defense responses, secreted by the pea pathogen Mycosphaerella pinodes. Because it is known that OPRs are usually encoded by families of homologous genes, we screened for genomic and cDNA clones encoding members of this putative OPR family in pea. We isolated five members of the OPR gene family from a pea genomic DNA library, and amplified six cDNA clones, including S64, by RT-PCR (reverse transcriptase-PCR). Sequencing analysis revealed that S64 corresponds to PsOPR2, and the amino acid sequences of the predicted products of the six OPR-like genes shared more than 80% identity with each other. Based on their sequence similarity, all these OPR-like genes code for OPRs of subgroup I, i.e., enzymes which are not required for jasmonic acid biosynthesis. However, the genes varied in their exon/intron organization and in their promoter sequences. To investigate the expression of each individual OPR-like gene, RT-PCR was performed using gene-specific primers. The results indicated that the OPR-like gene most strongly induced by the inoculation of pea plants with a compatible pathogen and by treatment with the suppressor from M. pinodes was PsOPR2. Furthermore, the ability of the six recombinant OPR-like proteins to reduce a model substrate, 2-cyclohexen-1-one (2-CyHE), was investigated. The results indicated that PsOPR1, 4 and 6 display robust activity, and PsOPR2 has a most remarkable ability to reduce 2-CyHE, whereas PsOPR3 has little and PsOPR5 does not reduce this compound. Thus, the six OPR-like proteins can be classified into four types. Interestingly, the gene structures, expression profiles, and enzymatic activities used to classify each member of the pea OPR-like gene family are clearly correlated, indicating that each member of this OPR-like family has a distinct function.

  14. Selenium Pretreatment Alleviated LPS-Induced Immunological Stress Via Upregulation of Several Selenoprotein Encoding Genes in Murine RAW264.7 Cells.

    PubMed

    Wang, Longqiong; Jing, Jinzhong; Yan, Hui; Tang, Jiayong; Jia, Gang; Liu, Guangmang; Chen, Xiaoling; Tian, Gang; Cai, Jingyi; Shang, Haiying; Zhao, Hua

    2018-04-18

    This study was conducted to profile selenoprotein encoding genes in mouse RAW264.7 cells upon lipopolysaccharide (LPS) challenge and integrate their roles into immunological regulation in response to selenium (Se) pretreatment. LPS was used to develop immunological stress in macrophages. Cells were pretreated with different levels of Se (0, 0.5, 1.0, 1.5, 2.0 μmol Se/L) for 2 h, followed by LPS (100 ng/mL) stimulation for another 3 h. The mRNA expression of 24 selenoprotein encoding genes and 9 inflammation-related genes were investigated. The results showed that LPS (100 ng/mL) effectively induced immunological stress in RAW264.7 cells with induced inflammation cytokines, IL-6 and TNF-α, mRNA expression, and cellular secretion. LPS increased (P < 0.05) mRNA profiles of 9 inflammation-related genes in cells, while short-time Se pretreatment modestly reversed (P < 0.05) the LPS-induced upregulation of 7 genes (COX-2, ICAM-1, IL-1β, IL-6, IL-10, iNOS, and MCP-1) and further increased (P < 0.05) expression of IFN-β and TNF-α in stressed cells. Meanwhile, LPS decreased (P < 0.05) mRNA levels of 18 selenoprotein encoding genes and upregulated mRNA levels of TXNRD1 and TXNRD3 in cells. Se pretreatment recovered (P < 0.05) expression of 3 selenoprotein encoding genes (GPX1, SELENOH, and SELENOW) in a dose-dependent manner and increased (P < 0.05) expression of another 5 selenoprotein encoding genes (SELENOK, SELENOM, SELENOS, SELENOT, and TXNRD2) only at a high level (2.0 μmol Se/L). Taken together, LPS-induced immunological stress in RAW264.7 cells accompanied with the global downregulation of selenoprotein encoding genes and Se pretreatment alleviated immunological stress via upregulation of a subset of selenoprotein encoding genes.

  15. Disruption of the psbA gene by the copy correction mechanism reveals that the expression of plastid-encoded genes is regulated by photosynthesis activity.

    PubMed

    Khan, Muhammad Sarwar; Hameed, Waqar; Nozoe, Mikio; Shiina, Takashi

    2007-05-01

    The functional analysis of genes encoded by the chloroplast genome of tobacco by reverse genetics is routine. Nevertheless, for a small number of genes their deletion generates heteroplasmic genotypes, complicating their analysis. There is thus the need for additional strategies to develop deletion mutants for these genes. We have developed a homologous copy correction-based strategy for deleting/mutating genes encoded on the chloroplast genome. This system was used to produce psbA knockouts. The resulting plants are homoplasmic and lack photosystem II (PSII) activity. Further, the deletion mutants exhibit a distinct phenotype; young leaves are green, whereas older leaves are bleached, irrespective of light conditions. This suggests that senescence is promoted by the absence of psbA. Analysis of the transcript levels indicates that NEP (nuclear-encoded plastid RNA polymerase)-dependent plastid genes are up regulated in the psbA deletion mutants, whereas the bleached leaves retain plastid-encoded plastid RNA polymerase activity. Hence, the expression of NEP-dependent plastid genes may be regulated by photosynthesis, either directly or indirectly.

  16. The Evolutionary Fate of the Genes Encoding the Purine Catabolic Enzymes in Hominoids, Birds, and Reptiles

    PubMed Central

    Keebaugh, Alaine C.; Thomas, James W.

    2010-01-01

    Gene loss has been proposed to play a major role in adaptive evolution, and recent studies are beginning to reveal its importance in human evolution. However, the potential consequence of a single gene-loss event upon the fates of functionally interrelated genes is poorly understood. Here, we use the purine metabolic pathway as a model system in which to explore this important question. The loss of urate oxidase (UOX) activity, a necessary step in this pathway, has occurred independently in the hominoid and bird/reptile lineages. Because the loss of UOX would have removed the functional constraint upon downstream genes in this pathway, these downstream genes are generally assumed to have subsequently deteriorated. In this study, we used a comparative genomics approach to empirically determine the fate of UOX itself and the downstream genes in five hominoids, two birds, and a reptile. Although we found that the loss of UOX likely triggered the genetic deterioration of the immediate downstream genes in the hominoids, surprisingly in the birds and reptiles, the UOX locus itself and some of the downstream genes were present in the genome and predicted to encode proteins. To account for the variable pattern of gene retention and loss after the inactivation of UOX, we hypothesize that although gene loss is a common fate for genes that have been rendered obsolete due to the upstream loss of an enzyme a metabolic pathway, it is also possible that same lack of constraint will foster the evolution of new functions or allow the optimization of preexisting alternative functions in the downstream genes, thereby resulting in gene retention. Thus, adaptive single-gene losses have the potential to influence the long-term evolutionary fate of functionally interrelated genes. PMID:20106906

  17. The evolutionary fate of the genes encoding the purine catabolic enzymes in hominoids, birds, and reptiles.

    PubMed

    Keebaugh, Alaine C; Thomas, James W

    2010-06-01

    Gene loss has been proposed to play a major role in adaptive evolution, and recent studies are beginning to reveal its importance in human evolution. However, the potential consequence of a single gene-loss event upon the fates of functionally interrelated genes is poorly understood. Here, we use the purine metabolic pathway as a model system in which to explore this important question. The loss of urate oxidase (UOX) activity, a necessary step in this pathway, has occurred independently in the hominoid and bird/reptile lineages. Because the loss of UOX would have removed the functional constraint upon downstream genes in this pathway, these downstream genes are generally assumed to have subsequently deteriorated. In this study, we used a comparative genomics approach to empirically determine the fate of UOX itself and the downstream genes in five hominoids, two birds, and a reptile. Although we found that the loss of UOX likely triggered the genetic deterioration of the immediate downstream genes in the hominoids, surprisingly in the birds and reptiles, the UOX locus itself and some of the downstream genes were present in the genome and predicted to encode proteins. To account for the variable pattern of gene retention and loss after the inactivation of UOX, we hypothesize that although gene loss is a common fate for genes that have been rendered obsolete due to the upstream loss of an enzyme a metabolic pathway, it is also possible that same lack of constraint will foster the evolution of new functions or allow the optimization of preexisting alternative functions in the downstream genes, thereby resulting in gene retention. Thus, adaptive single-gene losses have the potential to influence the long-term evolutionary fate of functionally interrelated genes.

  18. Vibrio Phage KVP40 Encodes a Functional NAD+ Salvage Pathway.

    PubMed

    Lee, Jae Yun; Li, Zhiqun; Miller, Eric S

    2017-05-01

    The genome of T4-type Vibrio bacteriophage KVP40 has five genes predicted to encode proteins of pyridine nucleotide metabolism, of which two, nadV and natV , would suffice for an NAD + salvage pathway. NadV is an apparent nicotinamide phosphoribosyltransferase (NAmPRTase), and NatV is an apparent bifunctional nicotinamide mononucleotide adenylyltransferase (NMNATase) and nicotinamide-adenine dinucleotide pyrophosphatase (Nudix hydrolase). Genes encoding the predicted salvage pathway were cloned and expressed in Escherichia coli , the proteins were purified, and their enzymatic properties were examined. KVP40 NadV NAmPRTase is active in vitro , and a clone complements a Salmonella mutant defective in both the bacterial de novo and salvage pathways. Similar to other NAmPRTases, the KVP40 enzyme displayed ATPase activity indicative of energy coupling in the reaction mechanism. The NatV NMNATase activity was measured in a coupled reaction system demonstrating NAD + biosynthesis from nicotinamide, phosphoribosyl pyrophosphate, and ATP. The NatV Nudix hydrolase domain was also shown to be active, with preferred substrates of ADP-ribose, NAD + , and NADH. Expression analysis using reverse transcription-quantitative PCR (qRT-PCR) and enzyme assays of infected Vibrio parahaemolyticus cells demonstrated nadV and natV transcription during the early and delayed-early periods of infection when other KVP40 genes of nucleotide precursor metabolism are expressed. The distribution and phylogeny of NadV and NatV proteins among several large double-stranded DNA (dsDNA) myophages, and also those from some very large siphophages, suggest broad relevance of pyridine nucleotide scavenging in virus-infected cells. NAD + biosynthesis presents another important metabolic resource control point by large, rapidly replicating dsDNA bacteriophages. IMPORTANCE T4-type bacteriophages enhance DNA precursor synthesis through reductive reactions that use NADH/NADPH as the electron donor and NAD + for ADP-ribosylation of proteins involved in transcribing and translating the phage genome. We show here that phage KVP40 encodes a functional pyridine nucleotide scavenging pathway that is expressed during the metabolic period of the infection cycle. The pathway is conserved in other large, dsDNA phages in which the two genes, nadV and natV , share an evolutionary history in their respective phage-host group. Copyright © 2017 American Society for Microbiology.

  19. Cloning and expression analysis of the ornithine decarboxylase gene (PbrODC) of the pathogenic fungus Paracoccidioides brasiliensis.

    PubMed

    Niño-Vega, Gustavo A; Sorais, Françoise; Calcagno, Ana-María; Ruiz-Herrera, José; Martínez-Espinoza, Alfredo D; San-Blas, Gioconda

    2004-02-01

    We describe the isolation and sequencing of PbrODC, the gene encoding ornithine decarboxylase (ODC) in Paracoccidioides brasiliensis. The gene contains a single open reading frame made of 1413 bp with a single intron (72 bp), and encodes a 447 amino acid polypeptide with a predicted molecular weight of 50.0 kDa, an isoelectric point of 4.9 and a high similarity to other fungal ornithine decarboxylases. Functionality of the gene was demonstrated by transformation into a Saccharomyces cerevisiae odc null mutant. A phylogenetic tree generated with several fungal ODCs provided additional evidence to favour a taxonomic position for P. brasiliensis as an ascomycetous fungus, belonging to the order Onygenales. Expression of the PbrODC gene was determined by Northern analyses during growth of the mycelial and yeast forms, and through the temperature-regulated dimorphic transition between these two extreme phases. Expression of PbrODC remained constant at all stages of the fungal growth, and did not correlate with a previously observed increase in the activity of ornithine decarboxylase at the onset of the budding process in both yeast growth and mycelium-to-yeast transition. Accordingly, post-transcriptional regulation for the product of PbrODC is suggested. Copyright 2004 John Wiley & Sons, Ltd.

  20. Efficient production of artificially designed gelatins with a Bacillus brevis system.

    PubMed

    Kajino, T; Takahashi, H; Hirai, M; Yamada, Y

    2000-01-01

    Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.

  1. The Ep152R ORF of African swine fever virus strain Georgia encodes for an essential gene that interacts with host protein BAG6.

    PubMed

    Borca, Manuel V; O'Donnell, Vivian; Holinka, Lauren G; Rai, Devendra K; Sanford, Brenton; Alfano, Marialexia; Carlson, Jolene; Azzinaro, Paul A; Alonso, Covadonga; Gladue, Douglas P

    2016-09-02

    African swine fever virus (ASFV) is the etiological agent of a contagious and often lethal disease of domestic pigs that has significant economic consequences for the swine industry. The viral genome encodes for more than 150 genes, and only a select few of these genes have been studied in some detail. Here we report the characterization of open reading frame Ep152R that has a predicted complement control module/SCR domain. This domain is found in Vaccinia virus proteins that are involved in blocking the immune response during viral infection. A recombinant ASFV harboring a HA tagged version of the Ep152R protein was developed (ASFV-G-Ep152R-HA) and used to demonstrate that Ep152R is an early virus protein. Attempts to construct recombinant viruses having a deleted Ep152R gene were consistently unsuccessful indicating that Ep152R is an essential gene. Interestingly, analysis of host-protein interactions for Ep152R using a yeast two-hybrid screen, identified BAG6, a protein previously identified as being required for ASFV replication. Furthermore, fluorescent microscopy analysis confirms that Ep152R-BAG6 interaction actually occurs in cells infected with ASFV. Published by Elsevier B.V.

  2. WVD2 and WDL1 modulate helical organ growth and anisotropic cell expansion in Arabidopsis

    NASA Technical Reports Server (NTRS)

    Yuen, Christen Y L.; Pearlman, Rebecca S.; Silo-Suh, Laura; Hilson, Pierre; Carroll, Kathleen L.; Masson, Patrick H.

    2003-01-01

    Wild-type Arabidopsis roots develop a wavy pattern of growth on tilted agar surfaces. For many Arabidopsis ecotypes, roots also grow askew on such surfaces, typically slanting to the right of the gravity vector. We identified a mutant, wvd2-1, that displays suppressed root waving and leftward root slanting under these conditions. These phenotypes arise from transcriptional activation of the novel WAVE-DAMPENED2 (WVD2) gene by the cauliflower mosaic virus 35S promoter in mutant plants. Seedlings overexpressing WVD2 exhibit constitutive right-handed helical growth in both roots and etiolated hypocotyls, whereas the petioles of WVD2-overexpressing rosette leaves exhibit left-handed twisting. Moreover, the anisotropic expansion of cells is impaired, resulting in the formation of shorter and stockier organs. In roots, the phenotype is accompanied by a change in the arrangement of cortical microtubules within peripheral cap cells and cells at the basal end of the elongation zone. WVD2 transcripts are detectable by reverse transcriptase-polymerase chain reaction in multiple organs of wild-type plants. Its predicted gene product contains a conserved region named "KLEEK," which is found only in plant proteins. The Arabidopsis genome possesses seven other genes predicted to encode KLEEK-containing products. Overexpression of one of these genes, WVD2-LIKE 1, which encodes a protein with regions of similarity to WVD2 extending beyond the KLEEK domain, results in phenotypes that are highly similar to wvd2-1. Silencing of WVD2 and its paralogs results in enhanced root skewing in the wild-type direction. Our observations suggest that at least two members of this gene family may modulate both rotational polarity and anisotropic cell expansion during organ growth.

  3. Automated genomic context analysis and experimental validation platform for discovery of prokaryote transcriptional regulator functions

    DOE PAGES

    Martí-Arbona, Ricardo; Mu, Fangping; Nowak-Lovato, Kristy L.; ...

    2014-12-18

    In this study, the clustering of genes in a pathway and the co-location of functionally related genes is widely recognized in prokaryotes. We used these characteristics to predict the metabolic involvement for a Transcriptional Regulator (TR) of unknown function, identified and confirmed its biological activity. software tool that identifies the genes encoded within a defined genomic neighborhood for the subject TR and its homologs was developed. The output lists of genes in the genetic neighborhoods, their annotated functions, the reactants/products, and identifies the metabolic pathway in which the encoded-proteins function. When a set of TRs of known function was analyzed,more » we observed that their homologs frequently had conserved genomic neighborhoods that co-located the metabolically related genes regulated by the subject TR. We postulate that TR effectors are metabolites in the identified pathways; indeed the known effectors were present. We analyzed Bxe_B3018 from Burkholderia xenovorans, a TR of unknown function and predicted that this TR was related to the glycine, threonine and serine degradation. We tested the binding of metabolites in these pathways and for those that bound, their ability to modulate TR binding to its specific DNA operator sequence. Using rtPCR, we confirmed that methylglyoxal was an effector of Bxe_3018. These studies provide the proof of concept and validation of a systematic approach to the discovery of the biological activity for proteins of unknown function, in this case a TR. Bxe_B3018 is a methylglyoxal responsive TR that controls the expression of an operon composed of a putative efflux system.« less

  4. Truncated Photosystem Chlorophyll Antenna Size in the Green Microalga Chlamydomonas reinhardtii upon Deletion of the TLA3-CpSRP43 Gene1[C][W][OA

    PubMed Central

    Kirst, Henning; Garcia-Cerdan, Jose Gines; Zurbriggen, Andreas; Ruehle, Thilo; Melis, Anastasios

    2012-01-01

    The truncated light-harvesting antenna size3 (tla3) DNA insertional transformant of Chlamydomonas reinhardtii is a chlorophyll-deficient mutant with a lighter green phenotype, a lower chlorophyll (Chl) per cell content, and higher Chl a/b ratio than corresponding wild-type strains. Functional analyses revealed a higher intensity for the saturation of photosynthesis and greater light-saturated photosynthetic activity in the tla3 mutant than in the wild type and a Chl antenna size of the photosystems that was only about 40% of that in the wild type. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis and western-blot analyses showed that the tla3 strain was deficient in the Chl a/b light-harvesting complex. Molecular and genetic analyses revealed a single plasmid insertion in chromosome 4 of the tla3 nuclear genome, causing deletion of predicted gene g5047 and plasmid insertion within the fourth intron of downstream-predicted gene g5046. Complementation studies defined that gene g5047 alone was necessary and sufficient to rescue the tla3 mutation. Gene g5047 encodes a C. reinhardtii homolog of the chloroplast-localized SRP43 signal recognition particle, whose occurrence and function in green microalgae has not hitherto been investigated. Biochemical analysis showed that the nucleus-encoded and chloroplast-localized CrCpSRP43 protein specifically operates in the assembly of the peripheral components of the Chl a/b light-harvesting antenna. This work demonstrates that cpsrp43 deletion in green microalgae can be employed to generate tla mutants with a substantially diminished Chl antenna size. The latter exhibit improved solar energy conversion efficiency and photosynthetic productivity under mass culture and bright sunlight conditions. PMID:23043081

  5. Bioinformatic Analysis of Strawberry GSTF12 Gene

    NASA Astrophysics Data System (ADS)

    Wang, Xiran; Jiang, Leiyu; Tang, Haoru

    2018-01-01

    GSTF12 has always been known as a key factor of proanthocyanins accumulate in plant testa. Through bioinformatics analysis of the nucleotide and encoded protein sequence of GSTF12, it is more advantageous to the study of genes related to anthocyanin biosynthesis accumulation pathway. Therefore, we chosen GSTF12 gene of 11 kinds species, downloaded their nucleotide and protein sequence from NCBI as the research object, found strawberry GSTF12 gene via bioinformation analyse, constructed phylogenetic tree. At the same time, we analysed the strawberry GSTF12 gene of physical and chemical properties and its protein structure and so on. The phylogenetic tree showed that Strawberry and petunia were closest relative. By the protein prediction, we found that the protein owed one proper signal peptide without obvious transmembrane regions.

  6. Genome-Wide Identification and Mapping of NBS-Encoding Resistance Genes in Solanum tuberosum Group Phureja

    PubMed Central

    Lozano, Roberto; Ponce, Olga; Ramirez, Manuel; Mostajo, Nelly; Orjeda, Gisella

    2012-01-01

    The majority of disease resistance (R) genes identified to date in plants encode a nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domain containing protein. Additional domains such as coiled-coil (CC) and TOLL/interleukin-1 receptor (TIR) domains can also be present. In the recently sequenced Solanum tuberosum group phureja genome we used HMM models and manual curation to annotate 435 NBS-encoding R gene homologs and 142 NBS-derived genes that lack the NBS domain. Highly similar homologs for most previously documented Solanaceae R genes were identified. A surprising ∼41% (179) of the 435 NBS-encoding genes are pseudogenes primarily caused by premature stop codons or frameshift mutations. Alignment of 81.80% of the 577 homologs to S. tuberosum group phureja pseudomolecules revealed non-random distribution of the R-genes; 362 of 470 genes were found in high density clusters on 11 chromosomes. PMID:22493716

  7. Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities

    PubMed Central

    Fang, Xin; Sastry, Anand; Mih, Nathan; Kim, Donghyuk; Tan, Justin; Lloyd, Colton J.; Gao, Ye; Yang, Laurence; Palsson, Bernhard O.

    2017-01-01

    Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN—probably the best characterized TRN—several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism’s TRN from disparate data types. PMID:28874552

  8. Two Membrane-Associated Tyrosine Phosphatase Homologs Potentiate C. elegans AKT-1/PKB Signaling

    PubMed Central

    Hu, Patrick J; Xu, Jinling; Ruvkun, Gary

    2006-01-01

    Akt/protein kinase B (PKB) functions in conserved signaling cascades that regulate growth and metabolism. In humans, Akt/PKB is dysregulated in diabetes and cancer; in Caenorhabditis elegans, Akt/PKB functions in an insulin-like signaling pathway to regulate larval development. To identify molecules that modulate C. elegans Akt/PKB signaling, we performed a genetic screen for enhancers of the akt-1 mutant phenotype (eak). We report the analysis of three eak genes. eak-6 and eak-5/sdf-9 encode protein tyrosine phosphatase homologs; eak-4 encodes a novel protein with an N-myristoylation signal. All three genes are expressed primarily in the two endocrine XXX cells, and their predicted gene products localize to the plasma membrane. Genetic evidence indicates that these proteins function in parallel to AKT-1 to inhibit the FoxO transcription factor DAF-16. These results define two membrane-associated protein tyrosine phosphatase homologs that may potentiate C. elegans Akt/PKB signaling by cell autonomous and cell nonautonomous mechanisms. Similar molecules may modulate Akt/PKB signaling in human endocrine tissues. PMID:16839187

  9. The design of strain-specific polymerase chain reactions for discrimination of the racoon rabies virus strain from indigenous rabies viruses of Ontario.

    PubMed

    Nadin-Davis, S A; Huang, W; Wandeler, A I

    1996-03-01

    Since its recognition as a discrete epizootic in Florida in the early 1950s, the raccoon strain of rabies virus (RV) has spread over almost the entire eastern seaboard of the US and now threatens to enter the southernmost regions of Canada. To characterise this RV strain in more detail, nucleotide sequencing of the N and G genes, encoding the nucleoprotein and glycoprotein, respectively, of representative isolates has been undertaken. This sequence information generated a conserved restriction map of the N gene, thereby permitting unequivocal identification of this strain by molecular techniques. Comparisons of the predicted nucleoprotein and glycoprotein products with those of other RV strains identified a number of amino acid sequence variations conserved only in the raccoon strain. This information was used to design strain-specific primers targeted to the N gene sequences encoding these residues. The incorporation of these primers into a multiplex polymerase chain reaction (PCR) protocol permitted easy and rapid discrimination between the raccoon RV strain and indigenous Ontario RVs.

  10. A Screen for Modifiers of Hedgehog Signaling in Drosophila melanogaster Identifies swm and mts

    PubMed Central

    Casso, David J.; Liu, Songmei; Iwaki, D. David; Ogden, Stacey K.; Kornberg, Thomas B.

    2008-01-01

    Signaling by Hedgehog (Hh) proteins shapes most tissues and organs in both vertebrates and invertebrates, and its misregulation has been implicated in many human diseases. Although components of the signaling pathway have been identified, key aspects of the signaling mechanism and downstream targets remain to be elucidated. We performed an enhancer/suppressor screen in Drosophila to identify novel components of the pathway and identified 26 autosomal regions that modify a phenotypic readout of Hh signaling. Three of the regions include genes that contribute constituents to the pathway—patched, engrailed, and hh. One of the other regions includes the gene microtubule star (mts) that encodes a subunit of protein phosphatase 2A. We show that mts is necessary for full activation of Hh signaling. A second region includes the gene second mitotic wave missing (swm). swm is recessive lethal and is predicted to encode an evolutionarily conserved protein with RNA binding and Zn+ finger domains. Characterization of newly isolated alleles indicates that swm is a negative regulator of Hh signaling and is essential for cell polarity. PMID:18245841

  11. Specific molecular signatures predict decitabine response in chronic myelomonocytic leukemia

    PubMed Central

    Meldi, Kristen; Qin, Tingting; Buchi, Francesca; Droin, Nathalie; Sotzen, Jason; Micol, Jean-Baptiste; Selimoglu-Buet, Dorothée; Masala, Erico; Allione, Bernardino; Gioia, Daniela; Poloni, Antonella; Lunghi, Monia; Solary, Eric; Abdel-Wahab, Omar; Santini, Valeria; Figueroa, Maria E.

    2015-01-01

    Myelodysplastic syndromes and chronic myelomonocytic leukemia (CMML) are characterized by mutations in genes encoding epigenetic modifiers and aberrant DNA methylation. DNA methyltransferase inhibitors (DMTis) are used to treat these disorders, but response is highly variable, with few means to predict which patients will benefit. Here, we examined baseline differences in mutations, DNA methylation, and gene expression in 40 CMML patients who were responsive or resistant to decitabine (DAC) in order to develop a molecular means of predicting response at diagnosis. While somatic mutations did not differentiate responders from nonresponders, we identified 167 differentially methylated regions (DMRs) of DNA at baseline that distinguished responders from nonresponders using next-generation sequencing. These DMRs were primarily localized to nonpromoter regions and overlapped with distal regulatory enhancers. Using the methylation profiles, we developed an epigenetic classifier that accurately predicted DAC response at the time of diagnosis. Transcriptional analysis revealed differences in gene expression at diagnosis between responders and nonresponders. In responders, the upregulated genes included those that are associated with the cell cycle, potentially contributing to effective DAC incorporation. Treatment with CXCL4 and CXCL7, which were overexpressed in nonresponders, blocked DAC effects in isolated normal CD34+ and primary CMML cells, suggesting that their upregulation contributes to primary DAC resistance. PMID:25822018

  12. Microbial genomic island discovery, visualization and analysis.

    PubMed

    Bertelli, Claire; Tilley, Keith E; Brinkman, Fiona S L

    2018-06-03

    Horizontal gene transfer (also called lateral gene transfer) is a major mechanism for microbial genome evolution, enabling rapid adaptation and survival in specific niches. Genomic islands (GIs), commonly defined as clusters of bacterial or archaeal genes of probable horizontal origin, are of particular medical, environmental and/or industrial interest, as they disproportionately encode virulence factors and some antimicrobial resistance genes and may harbor entire metabolic pathways that confer a specific adaptation (solvent resistance, symbiosis properties, etc). As large-scale analyses of microbial genomes increases, such as for genomic epidemiology investigations of infectious disease outbreaks in public health, there is increased appreciation of the need to accurately predict and track GIs. Over the past decade, numerous computational tools have been developed to tackle the challenges inherent in accurate GI prediction. We review here the main types of GI prediction methods and discuss their advantages and limitations for a routine analysis of microbial genomes in this era of rapid whole-genome sequencing. An assessment is provided of 20 GI prediction software methods that use sequence-composition bias to identify the GIs, using a reference GI data set from 104 genomes obtained using an independent comparative genomics approach. Finally, we present guidelines to assist researchers in effectively identifying these key genomic regions.

  13. Cloning and sequencing of Staphylococcus aureus murC, a gene essential for cell wall biosynthesis.

    PubMed

    Lowe, A M; Deresiewicz, R L

    1999-01-01

    Staphylococcus aureus is a major human pathogen that is increasingly resistant to clinically useful antimicrobial agents. While screening for S. aureus genes expressed during mammalian infection, we isolated murC. This gene encodes UDP-N-acetylmuramoyl-L-alanine synthetase, an enzyme essential for cell wall biosynthesis in a number of bacteria. S. aureus MurC has a predicted mass 49,182 Da and complements the temperature-sensitive murC mutation of E. coli ST222. Sequence data on the DNA flanking staphylococcal murC suggests that the local gene organization there parallels that found in B. subtilis, but differs from that found in gram-negative bacterial pathogens. MurC proteins represent promising targets for broad spectrum antimicrobial drug development.

  14. In Silico Pattern-Based Analysis of the Human Cytomegalovirus Genome

    PubMed Central

    Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T.; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

    2003-01-01

    More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/). PMID:12634390

  15. Clarin-1, encoded by the Usher Syndrome III causative gene, forms a membranous microdomain: possible role of clarin-1 in organizing the actin cytoskeleton.

    PubMed

    Tian, Guilian; Zhou, Yun; Hajkova, Dagmar; Miyagi, Masaru; Dinculescu, Astra; Hauswirth, William W; Palczewski, Krzysztof; Geng, Ruishuang; Alagramam, Kumar N; Isosomppi, Juha; Sankila, Eeva-Marja; Flannery, John G; Imanishi, Yoshikazu

    2009-07-10

    Clarin-1 is the protein product encoded by the gene mutated in Usher syndrome III. Although the molecular function of clarin-1 is unknown, its primary structure predicts four transmembrane domains similar to a large family of membrane proteins that include tetraspanins. Here we investigated the role of clarin-1 by using heterologous expression and in vivo model systems. When expressed in HEK293 cells, clarin-1 localized to the plasma membrane and concentrated in low density compartments distinct from lipid rafts. Clarin-1 reorganized actin filament structures and induced lamellipodia. This actin-reorganizing function was absent in the modified protein encoded by the most prevalent North American Usher syndrome III mutation, the N48K form of clarin-1 deficient in N-linked glycosylation. Proteomics analyses revealed a number of clarin-1-interacting proteins involved in cell-cell adhesion, focal adhesions, cell migration, tight junctions, and regulation of the actin cytoskeleton. Consistent with the hypothesized role of clarin-1 in actin organization, F-actin-enriched stereocilia of auditory hair cells evidenced structural disorganization in Clrn1(-/-) mice. These observations suggest a possible role for clarin-1 in the regulation and homeostasis of actin filaments, and link clarin-1 to the interactive network of Usher syndrome gene products.

  16. In silico pattern-based analysis of the human cytomegalovirus genome.

    PubMed

    Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

    2003-04-01

    More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/).

  17. Molecular cloning and functional analysis of ESGP, an embryonic stem cell and germ cell specific protein.

    PubMed

    Chen, Yan-Mei; Du, Zhong-Wei; Yao, Zhen

    2005-12-01

    Several putative Oct-4 downstream genes from mouse embryonic stem (ES) cells have been identified using the suppression-subtractive hybridization method. In this study, one of the novel genes encoding an ES cell and germ cell specific protein (ESGP) was cloned by rapid amplification of cDNA ends. ESGP contains 801 bp encoding an 84 amino acid small protein and has no significant homology to any known genes. There is a signal peptide at the N-terminal of ESGP protein as predicted by SeqWeb (GCG) (SeqWeb version 2.0.2, http://gcg.biosino.org:8080/). The result of immunofluorescence assay suggested that ESGP might encode a secretory protein. The expression pattern of ESGP is consistent with the expression of Oct-4 during embryonic development. ESGP protein was detected in fertilized oocyte, from 3.5 day postcoital (dpc) blastocyst to 17.5 dpc embryo, and was only detected in testis and ovary tissues in adult. In vitro, ESGP was only expressed in pluripotent cell lines, such as embryonic stem cells, embryonic caoma cells and embryonic germ cells, but not in their differentiated progenies. Despite its specific expression, forced expression of ESGP is not indispensable for the effect of Oct-4 on ES cell self-renewal, and does not affect the differentiation to three germ layers.

  18. Cag3 Is a Novel Essential Component of the Helicobacter pylori Cag Type IV Secretion System Outer Membrane Subcomplex ▿ †

    PubMed Central

    Pinto-Santini, Delia M.; Salama, Nina R.

    2009-01-01

    Helicobacter pylori strains harboring the cag pathogenicity island (PAI) have been associated with more severe gastric disease in infected humans. The cag PAI encodes a type IV secretion (T4S) system required for CagA translocation into host cells as well as induction of proinflammatory cytokines, such as interleukin-8 (IL-8). cag PAI genes sharing sequence similarity with T4S components from other bacteria are essential for Cag T4S function. Other cag PAI-encoded genes are also essential for Cag T4S, but lack of sequence-based or structural similarity with genes in existing databases has precluded a functional assignment for the encoded proteins. We have studied the role of one such protein, Cag3 (HP0522), in Cag T4S and determined Cag3 subcellular localization and protein interactions. Cag3 is membrane associated and copurifies with predicted inner and outer membrane Cag T4S components that are essential for Cag T4S as well as putative accessory factors. Coimmunoprecipitation and cross-linking experiments revealed specific interactions with HpVirB7 and CagM, suggesting Cag3 is a new component of the Cag T4S outer membrane subcomplex. Finally, lack of Cag3 lowers HpVirB7 steady-state levels, further indicating Cag3 makes a subcomplex with this protein. PMID:19801411

  19. Cooperative Bacterial Growth Dynamics Predict the Evolution of Antibiotic Resistance

    NASA Astrophysics Data System (ADS)

    Artemova, Tatiana; Gerardin, Ylaine; Hsin-Jung Li, Sophia; Gore, Jeff

    2011-03-01

    Since the discovery of penicillin, antibiotics have been our primary weapon against bacterial infections. Unfortunately, bacteria can gain resistance to penicillin by acquiring the gene that encodes beta-lactamase, which inactivates the antibiotic. However, mutations in this gene are necessary to degrade the modern antibiotic cefotaxime. Understanding the conditions that favor the spread of these mutations is a challenge. Here we show that bacterial growth in beta-lactam antibiotics is cooperative and that the nature of this growth determines the conditions in which resistance evolves. Quantitative analysis of the growth dynamics predicts a peak in selection at very low antibiotic concentrations; competition between strains confirms this prediction. We also find significant selection at higher antibiotic concentrations, close to the minimum inhibitory concentrations of the strains. Our results argue that an understanding of the evolutionary forces that lead to antibiotic resistance requires a quantitative understanding of the evolution of cooperation in bacteria.

  20. PanDaTox: A tool for accelerated metabolic engineering

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Amitai, Gil; Sorek, Rotem

    2012-07-18

    Metabolic engineering is often facilitated by cloning of genes encoding enzymes from various heterologous organisms into E. coli. Such engineering efforts are frequently hampered by foreign genes that are toxic to the E. coli host. We have developed PanDaTox (www.weizmann.ac.il/pandatox), a web-based resource that provides experimental toxicity information for more than 1.5 million genes from hundreds of different microbial genomes. The toxicity predictions, which were extensively experimentally verified, are based on serial cloning of genes into E. coli as part of the Sanger whole genome shotgun sequencing process. PanDaTox can accelerate metabolic engineering projects by allowing researchers to exclude toxicmore » genes from the engineering plan and verify the clonability of selected genes before the actual metabolic engineering experiments are conducted.« less

  1. Genome Sequence of the Fleming Strain of Micrococcus luteus, a Simple Free-Living Actinobacterium▿ †‡

    PubMed Central

    Young, Michael; Artsatbanov, Vladislav; Beller, Harry R.; Chandra, Govind; Chater, Keith F.; Dover, Lynn G.; Goh, Ee-Been; Kahan, Tamar; Kaprelyants, Arseny S.; Kyrpides, Nikos; Lapidus, Alla; Lowry, Stephen R.; Lykidis, Athanasios; Mahillon, Jacques; Markowitz, Victor; Mavromatis, Konstantinos; Mukamolova, Galina V.; Oren, Aharon; Rokem, J. Stefan; Smith, Margaret C. M.; Young, Danielle I.; Greenblatt, Charles L.

    2010-01-01

    Micrococcus luteus (NCTC2665, “Fleming strain”) has one of the smallest genomes of free-living actinobacteria sequenced to date, comprising a single circular chromosome of 2,501,097 bp (G+C content, 73%) predicted to encode 2,403 proteins. The genome shows extensive synteny with that of the closely related organism, Kocuria rhizophila, from which it was taxonomically separated relatively recently. Despite its small size, the genome harbors 73 insertion sequence (IS) elements, almost all of which are closely related to elements found in other actinobacteria. An IS element is inserted into the rrs gene of one of only two rrn operons found in M. luteus. The genome encodes only four sigma factors and 14 response regulators, a finding indicative of adaptation to a rather strict ecological niche (mammalian skin). The high sensitivity of M. luteus to β-lactam antibiotics may result from the presence of a reduced set of penicillin-binding proteins and the absence of a wblC gene, which plays an important role in the antibiotic resistance in other actinobacteria. Consistent with the restricted range of compounds it can use as a sole source of carbon for energy and growth, M. luteus has a minimal complement of genes concerned with carbohydrate transport and metabolism and its inability to utilize glucose as a sole carbon source may be due to the apparent absence of a gene encoding glucokinase. Uniquely among characterized bacteria, M. luteus appears to be able to metabolize glycogen only via trehalose and to make trehalose only via glycogen. It has very few genes associated with secondary metabolism. In contrast to most other actinobacteria, M. luteus encodes only one resuscitation-promoting factor (Rpf) required for emergence from dormancy, and its complement of other dormancy-related proteins is also much reduced. M. luteus is capable of long-chain alkene biosynthesis, which is of interest for advanced biofuel production; a three-gene cluster essential for this metabolism has been identified in the genome. PMID:19948807

  2. Exome Sequencing of 18 Chinese Families with Congenital Cataracts: A New Sight of the NHS Gene

    PubMed Central

    Sun, Wenmin; Xiao, Xueshan; Li, Shiqiang; Guo, Xiangming; Zhang, Qingjiong

    2014-01-01

    Purpose The aim of this study was to investigate the mutation spectrum and frequency of 34 known genes in 18 Chinese families with congenital cataracts. Methods Genomic DNA and clinical data was collected from 18 families with congenital cataracts. Variations in 34 cataract-associated genes were screened by whole exome sequencing and then validated by Sanger sequencing. Results Eleven candidate variants in seven of the 34 genes were detected by exome sequencing and then confirmed by Sanger sequencing, including two variants predicted to be benign and the other pathogenic mutations. The nine mutations were present in 9 of the 18 (50%) families with congenital cataracts. Of the four families with mutations in the X-linked NHS gene, no other abnormalities were recorded except for cataract, in which a pseudo-dominant inheritance form was suggested, as female carriers also had different forms of cataracts. Conclusion This study expands the mutation spectrum and frequency of genes responsible for congenital cataract. Mutation in NHS is a common cause of nonsyndromic congenital cataract with pseudo-autosomal dominant inheritance. Combined with our previous studies, a genetic basis could be identified in 67.6% of families with congenital cataracts in our case series, in which mutations in genes encoding crystallins, genes encoding connexins, and NHS are responsible for 29.4%, 14.7%, and 11.8% of families, respectively. Our results suggest that mutations in NHS are the common cause of congenital cataract, both syndromic and nonsyndromic. PMID:24968223

  3. Genome sequencing of four Aureobasidium pullulans varieties: biotechnological potential, stress tolerance, and description of new species.

    PubMed

    Gostinčar, Cene; Ohm, Robin A; Kogej, Tina; Sonjak, Silva; Turk, Martina; Zajc, Janja; Zalar, Polona; Grube, Martin; Sun, Hui; Han, James; Sharma, Aditi; Chiniquy, Jennifer; Ngan, Chew Yee; Lipzen, Anna; Barry, Kerrie; Grigoriev, Igor V; Gunde-Cimerman, Nina

    2014-07-01

    Aureobasidium pullulans is a black-yeast-like fungus used for production of the polysaccharide pullulan and the antimycotic aureobasidin A, and as a biocontrol agent in agriculture. It can cause opportunistic human infections, and it inhabits various extreme environments. To promote the understanding of these traits, we performed de-novo genome sequencing of the four varieties of A. pullulans. The 25.43-29.62 Mb genomes of these four varieties of A. pullulans encode between 10266 and 11866 predicted proteins. Their genomes encode most of the enzyme families involved in degradation of plant material and many sugar transporters, and they have genes possibly associated with degradation of plastic and aromatic compounds. Proteins believed to be involved in the synthesis of pullulan and siderophores, but not of aureobasidin A, are predicted. Putative stress-tolerance genes include several aquaporins and aquaglyceroporins, large numbers of alkali-metal cation transporters, genes for the synthesis of compatible solutes and melanin, all of the components of the high-osmolarity glycerol pathway, and bacteriorhodopsin-like proteins. All of these genomes contain a homothallic mating-type locus. The differences between these four varieties of A. pullulans are large enough to justify their redefinition as separate species: A. pullulans, A. melanogenum, A. subglaciale and A. namibiae. The redundancy observed in several gene families can be linked to the nutritional versatility of these species and their particular stress tolerance. The availability of the genome sequences of the four Aureobasidium species should improve their biotechnological exploitation and promote our understanding of their stress-tolerance mechanisms, diverse lifestyles, and pathogenic potential.

  4. Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome.

    PubMed

    Mahesh, Hirehally Basavarajegowda; Subba, Pratigya; Advani, Jayshree; Shirke, Meghana Deepak; Loganathan, Ramya Malarini; Chandana, Shankara Lingu; Shilpa, Siddappa; Chatterjee, Oishi; Pinto, Sneha Maria; Prasad, Thottethodi Subrahmanya Keshava; Gowda, Malali

    2018-04-01

    Indian sandalwood ( Santalum album ) is an important tropical evergreen tree known for its fragrant heartwood-derived essential oil and its valuable carving wood. Here, we applied an integrated genomic, transcriptomic, and proteomic approach to assemble and annotate the Indian sandalwood genome. Our genome sequencing resulted in the establishment of a draft map of the smallest genome for any woody tree species to date (221 Mb). The genome annotation predicted 38,119 protein-coding genes and 27.42% repetitive DNA elements. In-depth proteome analysis revealed the identities of 72,325 unique peptides, which confirmed 10,076 of the predicted genes. The addition of transcriptomic and proteogenomic approaches resulted in the identification of 53 novel proteins and 34 gene-correction events that were missed by genomic approaches. Proteogenomic analysis also helped in reassigning 1,348 potential noncoding RNAs as bona fide protein-coding messenger RNAs. Gene expression patterns at the RNA and protein levels indicated that peptide sequencing was useful in capturing proteins encoded by nuclear and organellar genomes alike. Mass spectrometry-based proteomic evidence provided an unbiased approach toward the identification of proteins encoded by organellar genomes. Such proteins are often missed in transcriptome data sets due to the enrichment of only messenger RNAs that contain poly(A) tails. Overall, the use of integrated omic approaches enhanced the quality of the assembly and annotation of this nonmodel plant genome. The availability of genomic, transcriptomic, and proteomic data will enhance genomics-assisted breeding, germplasm characterization, and conservation of sandalwood trees. © 2018 American Society of Plant Biologists. All Rights Reserved.

  5. Twenty novel mutations in BCKDHA, BCKDHB and DBT genes in a cohort of 52 Saudi Arabian patients with maple syrup urine disease.

    PubMed

    Imtiaz, Faiqa; Al-Mostafa, Abeer; Allam, Rabab; Ramzan, Khushnooda; Al-Tassan, Nada; Tahir, Asma I; Al-Numair, Nouf S; Al-Hamed, Mohamed H; Al-Hassnan, Zuhair; Al-Owain, Mohammad; Al-Zaidan, Hamad; Al-Amoudi, Mohammad; Qari, Alya; Balobaid, Ameera; Al-Sayed, Moeenaldeen

    2017-06-01

    Maple syrup urine disease (MSUD), an autosomal recessive inborn error of metabolism due to defects in the branched-chain α-ketoacid dehydrogenase (BCKD) complex, is commonly observed among other inherited metabolic disorders in the kingdom of Saudi Arabia. This report presents the results of mutation analysis of three of the four genes encoding the BCKD complex in 52 biochemically diagnosed MSUD patients originating from Saudi Arabia. The 25 mutations (20 novel) detected spanned across the entire coding regions of the BCKHDA , BCKDHB and DBT genes. There were no mutations found in the DLD gene in this cohort of patients. Prediction effects, conservation and modelling of novel mutations demonstrated that all were predicted to be disease-causing. All mutations presented in a homozygous form and we did not detect the presence of a "founder" mutation in any of three genes. In addition, prenatal molecular genetic testing was successfully carried out on chorionic villus samples or amniocenteses in 10 expectant mothers with affected children with MSUD, molecularly characterized by this study.

  6. The candidate histocompatibility locus of a Basal chordate encodes two highly polymorphic proteins.

    PubMed

    Nydam, Marie L; Netuschil, Nikolai; Sanders, Erin; Langenbacher, Adam; Lewis, Daniel D; Taketa, Daryl A; Marimuthu, Arumugapradeep; Gracey, Andrew Y; De Tomaso, Anthony W

    2013-01-01

    The basal chordate Botryllus schlosseri undergoes a natural transplantation reaction governed by a single, highly polymorphic locus called the fuhc. Our initial characterization of this locus suggested it encoded a single gene alternatively spliced into two transcripts: a 555 amino acid-secreted form containing the first half of the gene, and a full-length, 1008 amino acid transmembrane form, with polymorphisms throughout the ectodomain determining outcome. We have now found that the locus encodes two highly polymorphic genes which are separated by a 227 bp intergenic region: first, the secreted form as previously described, and a second gene encoding a 531 amino acid membrane-bound gene containing three extracellular immunoglobulin domains. While northern blotting revealed only these two mRNAs, both PCR and mRNA-seq detect a single capped and polyadenylated transcript that encodes processed forms of both genes linked by the intergenic region, as well as other transcripts in which exons of the two genes are spliced together. These results might suggest that the two genes are expressed as an operon, during which both genes are co-transcribed and then trans-spliced into two separate messages. This type of transcriptional regulation has been described in tunicates previously; however, the membrane-bound gene does not encode a typical Splice Leader (SL) sequence at the 5' terminus that usually accompanies trans-splicing. Thus, the presence of stable transcripts encoding both genes may suggest a novel mechanism of regulation, or conversely may be rare but stable transcripts in which the two mRNAs are linked due to a small amount of read-through by RNA polymerase. Both genes are highly polymorphic and co-expressed on tissues involved in histocompatibility. In addition, polymorphisms on both genes correlate with outcome, although we have found a case in which it appears that the secreted form may be major allorecognition determinant.

  7. Staphylococcal SCCmec elements encode an active MCM-like helicase and thus may be replicative

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mir-Sanchis, Ignacio; Roman, Christina A.; Misiura, Agnieszka

    2016-08-29

    Methicillin-resistant Staphylococcus aureus (MRSA) is a public-health threat worldwide. Although the mobile genomic island responsible for this phenotype, staphylococcal cassette chromosome (SCC), has been thought to be nonreplicative, we predicted DNA-replication-related functions for some of the conserved proteins encoded by SCC. We show that one of these, Cch, is homologous to the self-loading initiator helicases of an unrelated family of genomic islands, that it is an active 3'-to-5' helicase and that the adjacent ORF encodes a single-stranded DNA–binding protein. Our 2.9-Å crystal structure of intact Cch shows that it forms a hexameric ring. Cch, like the archaeal and eukaryotic MCM-familymore » replicative helicases, belongs to the pre–sensor II insert clade of AAA+ ATPases. Additionally, we found that SCC elements are part of a broader family of mobile elements, all of which encode a replication initiator upstream of their recombinases. Replication after excision would enhance the efficiency of horizontal gene transfer.« less

  8. Development of a gene synthesis platform for the efficient large scale production of small genes encoding animal toxins.

    PubMed

    Sequeira, Ana Filipa; Brás, Joana L A; Guerreiro, Catarina I P D; Vincentelli, Renaud; Fontes, Carlos M G A

    2016-12-01

    Gene synthesis is becoming an important tool in many fields of recombinant DNA technology, including recombinant protein production. De novo gene synthesis is quickly replacing the classical cloning and mutagenesis procedures and allows generating nucleic acids for which no template is available. In addition, when coupled with efficient gene design algorithms that optimize codon usage, it leads to high levels of recombinant protein expression. Here, we describe the development of an optimized gene synthesis platform that was applied to the large scale production of small genes encoding venom peptides. This improved gene synthesis method uses a PCR-based protocol to assemble synthetic DNA from pools of overlapping oligonucleotides and was developed to synthesise multiples genes simultaneously. This technology incorporates an accurate, automated and cost effective ligation independent cloning step to directly integrate the synthetic genes into an effective Escherichia coli expression vector. The robustness of this technology to generate large libraries of dozens to thousands of synthetic nucleic acids was demonstrated through the parallel and simultaneous synthesis of 96 genes encoding animal toxins. An automated platform was developed for the large-scale synthesis of small genes encoding eukaryotic toxins. Large scale recombinant expression of synthetic genes encoding eukaryotic toxins will allow exploring the extraordinary potency and pharmacological diversity of animal venoms, an increasingly valuable but unexplored source of lead molecules for drug discovery.

  9. Bioinformatic Analyses of Unique (Orphan) Core Genes of the Genus Acidithiobacillus: Functional Inferences and Use As Molecular Probes for Genomic and Metagenomic/Transcriptomic Interrogation

    PubMed Central

    González, Carolina; Lazcano, Marcelo; Valdés, Jorge; Holmes, David S.

    2016-01-01

    Using phylogenomic and gene compositional analyses, five highly conserved gene families have been detected in the core genome of the phylogenetically coherent genus Acidithiobacillus of the class Acidithiobacillia. These core gene families are absent in the closest extant genus Thermithiobacillus tepidarius that subtends the Acidithiobacillus genus and roots the deepest in this class. The predicted proteins encoded by these core gene families are not detected by a BLAST search in the NCBI non-redundant database of more than 90 million proteins using a relaxed cut-off of 1.0e−5. None of the five families has a clear functional prediction. However, bioinformatic scrutiny, using pI prediction, motif/domain searches, cellular location predictions, genomic context analyses, and chromosome topology studies together with previously published transcriptomic and proteomic data, suggests that some may have functions associated with membrane remodeling during cell division perhaps in response to pH stress. Despite the high level of amino acid sequence conservation within each family, there is sufficient nucleotide variation of the respective genes to permit the use of the DNA sequences to distinguish different species of Acidithiobacillus, making them useful additions to the armamentarium of tools for phylogenetic analysis. Since the protein families are unique to the Acidithiobacillus genus, they can also be leveraged as probes to detect the genus in environmental metagenomes and metatranscriptomes, including industrial biomining operations, and acid mine drainage (AMD). PMID:28082953

  10. Bioinformatic Analyses of Unique (Orphan) Core Genes of the Genus Acidithiobacillus: Functional Inferences and Use As Molecular Probes for Genomic and Metagenomic/Transcriptomic Interrogation.

    PubMed

    González, Carolina; Lazcano, Marcelo; Valdés, Jorge; Holmes, David S

    2016-01-01

    Using phylogenomic and gene compositional analyses, five highly conserved gene families have been detected in the core genome of the phylogenetically coherent genus Acidithiobacillus of the class Acidithiobacillia . These core gene families are absent in the closest extant genus Thermithiobacillus tepidarius that subtends the Acidithiobacillus genus and roots the deepest in this class. The predicted proteins encoded by these core gene families are not detected by a BLAST search in the NCBI non-redundant database of more than 90 million proteins using a relaxed cut-off of 1.0e -5 . None of the five families has a clear functional prediction. However, bioinformatic scrutiny, using pI prediction, motif/domain searches, cellular location predictions, genomic context analyses, and chromosome topology studies together with previously published transcriptomic and proteomic data, suggests that some may have functions associated with membrane remodeling during cell division perhaps in response to pH stress. Despite the high level of amino acid sequence conservation within each family, there is sufficient nucleotide variation of the respective genes to permit the use of the DNA sequences to distinguish different species of Acidithiobacillus , making them useful additions to the armamentarium of tools for phylogenetic analysis. Since the protein families are unique to the Acidithiobacillus genus, they can also be leveraged as probes to detect the genus in environmental metagenomes and metatranscriptomes, including industrial biomining operations, and acid mine drainage (AMD).

  11. Motif-independent prediction of a secondary metabolism gene cluster using comparative genomics: application to sequenced genomes of Aspergillus and ten other filamentous fungal species.

    PubMed

    Takeda, Itaru; Umemura, Myco; Koike, Hideaki; Asai, Kiyoshi; Machida, Masayuki

    2014-08-01

    Despite their biological importance, a significant number of genes for secondary metabolite biosynthesis (SMB) remain undetected due largely to the fact that they are highly diverse and are not expressed under a variety of cultivation conditions. Several software tools including SMURF and antiSMASH have been developed to predict fungal SMB gene clusters by finding core genes encoding polyketide synthase, nonribosomal peptide synthetase and dimethylallyltryptophan synthase as well as several others typically present in the cluster. In this work, we have devised a novel comparative genomics method to identify SMB gene clusters that is independent of motif information of the known SMB genes. The method detects SMB gene clusters by searching for a similar order of genes and their presence in nonsyntenic blocks. With this method, we were able to identify many known SMB gene clusters with the core genes in the genomic sequences of 10 filamentous fungi. Furthermore, we have also detected SMB gene clusters without core genes, including the kojic acid biosynthesis gene cluster of Aspergillus oryzae. By varying the detection parameters of the method, a significant difference in the sequence characteristics was detected between the genes residing inside the clusters and those outside the clusters. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  12. Comprehensive search for accessory proteins encoded with archaeal and bacterial type III CRISPR-cas gene cassettes reveals 39 new cas gene families.

    PubMed

    Shah, Shiraz A; Alkhnbashi, Omer S; Behler, Juliane; Han, Wenyuan; She, Qunxin; Hess, Wolfgang R; Garrett, Roger A; Backofen, Rolf

    2018-06-19

    A study was undertaken to identify conserved proteins that are encoded adjacent to cas gene cassettes of Type III CRISPR-Cas (Clustered Regularly Interspaced Short Palindromic Repeats - CRISPR associated) interference modules. Type III modules have been shown to target and degrade dsDNA, ssDNA and ssRNA and are frequently intertwined with cofunctional accessory genes, including genes encoding CRISPR-associated Rossman Fold (CARF) domains. Using a comparative genomics approach, and defining a Type III association score accounting for coevolution and specificity of flanking genes, we identified and classified 39 new Type III associated gene families. Most archaeal and bacterial Type III modules were seen to be flanked by several accessory genes, around half of which did not encode CARF domains and remain of unknown function. Northern blotting and interference assays in Synechocystis confirmed that one particular non-CARF accessory protein family was involved in crRNA maturation. Non-CARF accessory genes were generally diverse, encoding nuclease, helicase, protease, ATPase, transporter and transmembrane domains with some encoding no known domains. We infer that additional families of non-CARF accessory proteins remain to be found. The method employed is scalable for potential application to metagenomic data once automated pipelines for annotation of CRISPR-Cas systems have been developed. All accessory genes found in this study are presented online in a readily accessible and searchable format for researchers to audit their model organism of choice: http://accessory.crispr.dk .

  13. Amino-terminal domain of the v-fms oncogene product includes a functional signal peptide that directs synthesis of a transforming glycoprotein in the absence of feline leukemia virus gag sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wheeler, E.F.; Roussel, M.F.; Hampe, A.

    1986-08-01

    The nucleotide sequence of a 5' segment of the human genomic c-fms proto-oncogene suggested that recombination between feline leukemia virus and feline c-fms sequences might have occurred in a region encoding the 5' untranslated portion of c-fms mRNA. The polyprotein precursor gP180/sup gag-fms/ encoded by the McDonough strain of feline sarcoma virus was therefore predicted to contain 34 v-fms-coded amino acids derived from sequences of the c-fms gene that are not ordinarily translated from the proto-oncogene mRNA. The (gP180/sup gag-fms/) polyprotein was cotranslationally cleaved near the gag-fms junction to remove its gag gene-coded portion. Determination of the amino-terminal sequence ofmore » the resulting v-fms-coded glycoprotein, gp120/sup v-fms/, showed that the site of proteolysis corresponded to a predicted signal peptidase cleavage site within the c-fms gene product. Together, these analyses suggested that the linked gag sequences may not be necessary for expression of a biologically active v-fms gene product. The gag-fms sequences of feline sarcoma virus strain McDonough and the v-fms sequences alone were inserted into a murine retroviral vector containing a neomycin resistance gene. The authors conclude that a cryptic hydrophobic signal peptide sequence in v-fms was unmasked by gag deletion, thereby allowing the correct orientation and transport of the v-fms was unmasked by gag deletion, thereby allowing the correct orientation and transport of the v-fms gene product within membranous organelles. It seems likely that the proteolytic cleavage of gP180/gag-fms/ is mediated by signal peptidase and that the amino termini of gp140/sup v-fms/ and the c-fms gene product are identical.« less

  14. Carbon-dependent control of electron transfer and central carbon pathway genes for methane biosynthesis in the Archaean, Methanosarcina acetivorans strain C2A

    PubMed Central

    2010-01-01

    Background The archaeon, Methanosarcina acetivorans strain C2A forms methane, a potent greenhouse gas, from a variety of one-carbon substrates and acetate. Whereas the biochemical pathways leading to methane formation are well understood, little is known about the expression of the many of the genes that encode proteins needed for carbon flow, electron transfer and/or energy conservation. Quantitative transcript analysis was performed on twenty gene clusters encompassing over one hundred genes in M. acetivorans that encode enzymes/proteins with known or potential roles in substrate conversion to methane. Results The expression of many seemingly "redundant" genes/gene clusters establish substrate dependent control of approximately seventy genes for methane production by the pathways for methanol and acetate utilization. These include genes for soluble-type and membrane-type heterodisulfide reductases (hdr), hydrogenases including genes for a vht-type F420 non-reducing hydrogenase, molybdenum-type (fmd) as well as tungsten-type (fwd) formylmethanofuran dehydrogenases, genes for rnf and mrp-type electron transfer complexes, for acetate uptake, plus multiple genes for aha- and atp-type ATP synthesis complexes. Analysis of promoters for seven gene clusters reveal UTR leaders of 51-137 nucleotides in length, raising the possibility of both transcriptional and translational levels of control. Conclusions The above findings establish the differential and coordinated expression of two major gene families in M. acetivorans in response to carbon/energy supply. Furthermore, the quantitative mRNA measurements demonstrate the dynamic range for modulating transcript abundance. Since many of these gene clusters in M. acetivorans are also present in other Methanosarcina species including M. mazei, and in M. barkeri, these findings provide a basis for predicting related control in these environmentally significant methanogens. PMID:20178638

  15. The Saccharomyces cerevisiae LOS1 gene involved in pre-tRNA splicing encodes a nuclear protein that behaves as a component of the nuclear matrix.

    PubMed

    Shen, W C; Selvakumar, D; Stanford, D R; Hopper, A K

    1993-09-15

    Mutations of the Saccharomyces cerevisiae LOS1 gene cause the accumulation of end matured intron-containing pre-tRNAs at elevated temperatures. In an effort to decipher the role of the LOS1 protein in pre-tRNA splicing, we have analyzed the LOS1 gene and its protein product. The LOS1 gene is located on the left arm of chromosome XI and the order of genes in this area of the chromosome is .... URA1 ... SAC1 TRP3 UBA1 STE6 LOS1 .... FAS1..... The LOS1 open reading frame encodes a putative protein of 1100 amino acids that shows no significant homology to other genes. The LOS1 open reading frame was tagged with the influenza virus hemagglutinin epitope recognized by the 12CA5 antibody. The 12CA5 antibody recognizes an epitope-tagged protein of the size predicted by the LOS1 open reading frame. Using this antibody for indirect immunofluorescence and cell fractionation studies we show that the LOS1 protein is located in nuclei. Los1p cannot be extracted from nuclei by treatment with nucleases, salts, or Triton X-100. This insolubility suggests that Los1p is a component of the nucleoskeleton. We propose that LOS1 mutations may affect pre-tRNA processing via alteration of the nuclear matrix.

  16. The histone chaperone ASF1 is essential for sexual development in the filamentous fungus Sordaria macrospora.

    PubMed

    Gesing, Stefan; Schindler, Daniel; Fränzel, Benjamin; Wolters, Dirk; Nowrousian, Minou

    2012-05-01

    Ascomycetes develop four major types of fruiting bodies that share a common ancestor, and a set of common core genes most likely controls this process. One way to identify such genes is to search for conserved expression patterns. We analysed microarray data of Fusarium graminearum and Sordaria macrospora, identifying 78 genes with similar expression patterns during fruiting body development. One of these genes was asf1 (anti-silencing function 1), encoding a predicted histone chaperone. asf1 expression is also upregulated during development in the distantly related ascomycete Pyronema confluens. To test whether asf1 plays a role in fungal development, we generated an S. macrospora asf1 deletion mutant. The mutant is sterile and can be complemented to fertility by transformation with the wild-type asf1 and its P. confluens homologue. An ASF1-EGFP fusion protein localizes to the nucleus. By tandem-affinity purification/mass spectrometry as well as yeast two-hybrid analysis, we identified histones H3 and H4 as ASF1 interaction partners. Several developmental genes are dependent on asf1 for correct transcriptional expression. Deletion of the histone chaperone genes rtt106 and cac2 did not cause any developmental phenotypes. These data indicate that asf1 of S. macrospora encodes a conserved histone chaperone that is required for fruiting body development. © 2012 Blackwell Publishing Ltd.

  17. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gilchrist, Michael J.; Sobral, Daniel; Khoueiry, Pierre

    Genome-wide resources, such as collections of cDNA clones encoding for complete proteins (full-ORF clones), are crucial tools for studying the evolution of gene function and genetic interactions. Non-model organisms, in particular marine organisms, provide a rich source of functional diversity. Marine organism genomes are, however, frequently highly polymorphic and encode proteins that diverge significantly from those of well-annotated model genomes. The construction of full-ORF clone collections from non-model organisms is hindered by the difficulty of predicting accurately the N-terminal ends of proteins, and distinguishing recent paralogs from highly polymorphic alleles. We also report a computational strategy that overcomes these difficulties,more » and allows for accurate gene level clustering of transcript data followed by the automated identification of full-ORFs with correct 5'- and 3'-ends. It is robust to polymorphism, includes paralog calling and does not require evolutionary proximity to well annotated model organisms. Here, we developed this pipeline for the ascidian Ciona intestinalis, a highly polymorphic member of the divergent sister group of the vertebrates, emerging as a powerful model organism to study chordate gene function, Gene Regulatory Networks and molecular mechanisms underlying human pathologies. Furthermore, using this pipeline we have generated the first full-ORF collection for a highly polymorphic marine invertebrate. It contains 19,163 full-ORF cDNA clones covering 60% of Ciona coding genes, and full-ORF orthologs for approximately half of curated human disease-associated genes.« less

  18. A Member of the Sugar Transporter Family, Stl1p Is the Glycerol/H+ Symporter in Saccharomyces cerevisiae

    PubMed Central

    Ferreira, Célia; van Voorst, Frank; Martins, António; Neves, Luisa; Oliveira, Rui; Kielland-Brandt, Morten C.; Lucas, Cândida; Brandt, Anders

    2005-01-01

    Glycerol and other polyols are used as osmoprotectants by many organisms. Several yeasts and other fungi can take up glycerol by proton symport. To identify genes involved in active glycerol uptake in Saccharomyces cerevisiae we screened a deletion mutant collection comprising 321 genes encoding proteins with 6 or more predicted transmembrane domains for impaired growth on glycerol medium. Deletion of STL1, which encodes a member of the sugar transporter family, eliminates active glycerol transport. Stl1p is present in the plasma membrane in S. cerevisiae during conditions where glycerol symport is functional. Both the Stl1 protein and the active glycerol transport are subject to glucose-induced inactivation, following identical patterns. Furthermore, the Stl1 protein and the glycerol symporter activity are strongly but transiently induced when cells are subjected to osmotic shock. STL1 was heterologously expressed in Schizosaccharomyces pombe, a yeast that does not contain its own active glycerol transport system. In S. pombe, STL1 conferred the ability to take up glycerol against a concentration gradient in a proton motive force-dependent manner. We conclude that the glycerol proton symporter in S. cerevisiae is encoded by STL1. PMID:15703210

  19. NOVEL ANTIBIOTIC RESISTANCE DETERMINANTS FROM AGRICULTURAL SOIL EXPOSED TO ANTIBIOTICS WIDELY USED IN HUMAN MEDICINE AND ANIMAL FARMING.

    PubMed

    Lau, Calvin Ho-Fung; van Engelen, Kalene; Gordon, Stephen; Renaud, Justin; Topp, Edward

    2017-06-16

    Antibiotic resistance has emerged globally as one of the biggest threats to human and animal health. Although the excessive use of antibiotics is recognized for accelerating the selection for resistance, there is a growing body of evidence suggesting that natural environments are "hotspots" for the development of both ancient and contemporary resistance mechanisms. Given that pharmaceuticals can be entrained onto agricultural land through anthropogenic activities, this could be a potential driver for the emergence and dissemination of resistance in soil bacteria. Using functional metagenomics, we interrogated the "resistome" of bacterial communities found in a collection of Canadian agricultural soil, some of which had been receiving antibiotics widely used in human medicine (macrolides) or food animal production (sulfamethazine, chlortetracycline and tylosin) for up to 16 years. Of the 34 new antibiotic resistance genes (ARGs) recovered, the majority were predicted to encode for (multi)drug efflux systems, while a few share little to no homology with established resistance determinants. We characterized several novel gene products, including putative enzymes that can confer high-level resistance against aminoglycosides, sulfonamides, and broad range of beta-lactams, with respect to their resistance mechanisms and clinical significance. By coupling high-resolution proteomics analysis with functional metagenomics, we discovered an unusual peptide, PPP AZI 4 , encoded within an alternative open-reading frame not predicted by bioinformatics tools. Expression of the proline-rich PPP AZI 4 can promote resistance against different macrolides but not other ribosomal-targeting antibiotics, implicating a new macrolide-specific resistance mechanism that could be fundamentally linked to the evolutionary design of this peptide. IMPORTANCE Antibiotic resistance is a clinical phenomenon with an evolutionary link to the microbial pangenome. Genes and protogenes encoding for specialized and potential resistance mechanisms are abundant in natural environments, but understanding of their identity and genomic context remain limited. Our discovery of several previously-unknown antibiotic resistance genes from uncultured soil microorganisms indicates that soil is a significant reservoir of resistance determinants, which, once acquired and "re-purposed" by pathogenic bacteria, can have serious impacts on therapeutic outcomes. This study provides valuable insights into the diversity and identity of resistance within the soil microbiome. The finding of a novel peptide-mediated resistance mechanism involving an unpredicted gene product also highlights the usefulness of integrating proteomics analysis into metagenomics-driven gene discovery. © Crown copyright 2017.

  20. SEMI-ROLLED LEAF1 Encodes a Putative Glycosylphosphatidylinositol-Anchored Protein and Modulates Rice Leaf Rolling by Regulating the Formation of Bulliform Cells1[W][OA

    PubMed Central

    Xiang, Jing-Jing; Zhang, Guang-Heng; Qian, Qian; Xue, Hong-Wei

    2012-01-01

    Leaf rolling is an important agronomic trait in rice (Oryza sativa) breeding and moderate leaf rolling maintains the erectness of leaves and minimizes shadowing between leaves, leading to improved photosynthetic efficiency and grain yields. Although a few rolled-leaf mutants have been identified and some genes controlling leaf rolling have been isolated, the molecular mechanisms of leaf rolling still need to be elucidated. Here we report the isolation and characterization of SEMI-ROLLED LEAF1 (SRL1), a gene involved in the regulation of leaf rolling. Mutants srl1-1 (point mutation) and srl1-2 (transferred DNA insertion) exhibit adaxially rolled leaves due to the increased numbers of bulliform cells at the adaxial cell layers, which could be rescued by complementary expression of SRL1. SRL1 is expressed in various tissues and is expressed at low levels in bulliform cells. SRL1 protein is located at the plasma membrane and predicted to be a putative glycosylphosphatidylinositol-anchored protein. Moreover, analysis of the gene expression profile of cells that will become epidermal cells in wild type but probably bulliform cells in srl1-1 by laser-captured microdissection revealed that the expression of genes encoding vacuolar H+-ATPase (subunits A, B, C, and D) and H+-pyrophosphatase, which are increased during the formation of bulliform cells, were up-regulated in srl1-1. These results provide the transcript profile of rice leaf cells that will become bulliform cells and demonstrate that SRL1 regulates leaf rolling through inhibiting the formation of bulliform cells by negatively regulating the expression of genes encoding vacuolar H+-ATPase subunits and H+-pyrophosphatase, which will help to understand the mechanism regulating leaf rolling. PMID:22715111

  1. Inter- and intra-specific pan-genomes of Borrelia burgdorferi sensu lato: genome stability and adaptive radiation

    PubMed Central

    2013-01-01

    Background Lyme disease is caused by spirochete bacteria from the Borrelia burgdorferi sensu lato (B. burgdorferi s.l.) species complex. To reconstruct the evolution of B. burgdorferi s.l. and identify the genomic basis of its human virulence, we compared the genomes of 23 B. burgdorferi s.l. isolates from Europe and the United States, including B. burgdorferi sensu stricto (B. burgdorferi s.s., 14 isolates), B. afzelii (2), B. garinii (2), B. “bavariensis” (1), B. spielmanii (1), B. valaisiana (1), B. bissettii (1), and B. “finlandensis” (1). Results Robust B. burgdorferi s.s. and B. burgdorferi s.l. phylogenies were obtained using genome-wide single-nucleotide polymorphisms, despite recombination. Phylogeny-based pan-genome analysis showed that the rate of gene acquisition was higher between species than within species, suggesting adaptive speciation. Strong positive natural selection drives the sequence evolution of lipoproteins, including chromosomally-encoded genes 0102 and 0404, cp26-encoded ospC and b08, and lp54-encoded dbpA, a07, a22, a33, a53, a65. Computer simulations predicted rapid adaptive radiation of genomic groups as population size increases. Conclusions Intra- and inter-specific pan-genome sizes of B. burgdorferi s.l. expand linearly with phylogenetic diversity. Yet gene-acquisition rates in B. burgdorferi s.l. are among the lowest in bacterial pathogens, resulting in high genome stability and few lineage-specific genes. Genome adaptation of B. burgdorferi s.l. is driven predominantly by copy-number and sequence variations of lipoprotein genes. New genomic groups are likely to emerge if the current trend of B. burgdorferi s.l. population expansion continues. PMID:24112474

  2. Diverse and Abundant Secondary Metabolism Biosynthetic Gene Clusters in the Genomes of Marine Sponge Derived Streptomyces spp. Isolates.

    PubMed

    Jackson, Stephen A; Crossman, Lisa; Almeida, Eduardo L; Margassery, Lekha Menon; Kennedy, Jonathan; Dobson, Alan D W

    2018-02-20

    The genus Streptomyces produces secondary metabolic compounds that are rich in biological activity. Many of these compounds are genetically encoded by large secondary metabolism biosynthetic gene clusters (smBGCs) such as polyketide synthases (PKS) and non-ribosomal peptide synthetases (NRPS) which are modular and can be highly repetitive. Due to the repeats, these gene clusters can be difficult to resolve using short read next generation datasets and are often quite poorly predicted using standard approaches. We have sequenced the genomes of 13 Streptomyces spp. strains isolated from shallow water and deep-sea sponges that display antimicrobial activities against a number of clinically relevant bacterial and yeast species. Draft genomes have been assembled and smBGCs have been identified using the antiSMASH (antibiotics and Secondary Metabolite Analysis Shell) web platform. We have compared the smBGCs amongst strains in the search for novel sequences conferring the potential to produce novel bioactive secondary metabolites. The strains in this study recruit to four distinct clades within the genus Streptomyces . The marine strains host abundant smBGCs which encode polyketides, NRPS, siderophores, bacteriocins and lantipeptides. The deep-sea strains appear to be enriched with gene clusters encoding NRPS. Marine adaptations are evident in the sponge-derived strains which are enriched for genes involved in the biosynthesis and transport of compatible solutes and for heat-shock proteins. Streptomyces spp. from marine environments are a promising source of novel bioactive secondary metabolites as the abundance and diversity of smBGCs show high degrees of novelty. Sponge derived Streptomyces spp. isolates appear to display genomic adaptations to marine living when compared to terrestrial strains.

  3. Multidrug resistance in fungi: regulation of transporter-encoding gene expression

    PubMed Central

    Paul, Sanjoy; Moye-Rowley, W. Scott

    2014-01-01

    A critical risk to the continued success of antifungal chemotherapy is the acquisition of resistance; a risk exacerbated by the few classes of effective antifungal drugs. Predictably, as the use of these drugs increases in the clinic, more resistant organisms can be isolated from patients. A particularly problematic form of drug resistance that routinely emerges in the major fungal pathogens is known as multidrug resistance. Multidrug resistance refers to the simultaneous acquisition of tolerance to a range of drugs via a limited or even single genetic change. This review will focus on recent progress in understanding pathways of multidrug resistance in fungi including those of most medical relevance. Analyses of multidrug resistance in Saccharomyces cerevisiae have provided the most detailed outline of multidrug resistance in a eukaryotic microorganism. Multidrug resistant isolates of S. cerevisiae typically result from changes in the activity of a pair of related transcription factors that in turn elicit overproduction of several target genes. Chief among these is the ATP-binding cassette (ABC)-encoding gene PDR5. Interestingly, in the medically important Candida species, very similar pathways are involved in acquisition of multidrug resistance. In both C. albicans and C. glabrata, changes in the activity of transcriptional activator proteins elicits overproduction of a protein closely related to S. cerevisiae Pdr5 called Cdr1. The major filamentous fungal pathogen, Aspergillus fumigatus, was previously thought to acquire resistance to azole compounds (the principal antifungal drug class) via alterations in the azole drug target-encoding gene cyp51A. More recent data indicate that pathways in addition to changes in the cyp51A gene are important determinants in A. fumigatus azole resistance. We will discuss findings that suggest azole resistance in A. fumigatus and Candida species may share more mechanistic similarities than previously thought. PMID:24795641

  4. Modularity of Plant Metabolic Gene Clusters: A Trio of Linked Genes That Are Collectively Required for Acylation of Triterpenes in Oat[W][OA

    PubMed Central

    Mugford, Sam T.; Louveau, Thomas; Melton, Rachel; Qi, Xiaoquan; Bakht, Saleha; Hill, Lionel; Tsurushima, Tetsu; Honkanen, Suvi; Rosser, Susan J.; Lomonossoff, George P.; Osbourn, Anne

    2013-01-01

    Operon-like gene clusters are an emerging phenomenon in the field of plant natural products. The genes encoding some of the best-characterized plant secondary metabolite biosynthetic pathways are scattered across plant genomes. However, an increasing number of gene clusters encoding the synthesis of diverse natural products have recently been reported in plant genomes. These clusters have arisen through the neo-functionalization and relocation of existing genes within the genome, and not by horizontal gene transfer from microbes. The reasons for clustering are not yet clear, although this form of gene organization is likely to facilitate co-inheritance and co-regulation. Oats (Avena spp) synthesize antimicrobial triterpenoids (avenacins) that provide protection against disease. The synthesis of these compounds is encoded by a gene cluster. Here we show that a module of three adjacent genes within the wider biosynthetic gene cluster is required for avenacin acylation. Through the characterization of these genes and their encoded proteins we present a model of the subcellular organization of triterpenoid biosynthesis. PMID:23532069

  5. Detection with synthetic oligonucleotide probes of nucleotide sequence variations in the genes encoding enterotoxins of Escherichia coli.

    PubMed Central

    Nishibuchi, M; Murakami, A; Arita, M; Jikuya, H; Takano, J; Honda, T; Miwatani, T

    1989-01-01

    We examined variations in the genes encoding heat-stable enterotoxin (ST) and heat-labile enterotoxin (LT) in 88 strains of Escherichia coli isolated from individuals with traveler's diarrhea to find suitable sequences for use as oligonucleotide probes. Four oligonucleotide probes of the gene encoding ST of human origin (STIb or STh), one oligonucleotide probe of the gene encoding ST of porcine origin (STIa or STp), and three oligonucleotide probes of the gene encoding LT of human origin (LTIh) were used in DNA colony hybridization tests. In 15 of 22 strains possessing the STh gene and 28 of 42 strains producing LT, the sequences of all regions tested were identical to the published sequences. One region in the STh gene examined with a 18-mer probe was relatively well conserved and was shown to be closely associated with the enterotoxicity of the E. coli strains in suckling mice. This oligonucleotide, however, hybridized with strains of Vibrio cholerae O1, V. parahaemolyticus, and Yersinia enterocolitica that gave negative results in the suckling mouse assay. PMID:2685027

  6. Draft Genome Sequence of the d-Xylose-Fermenting Yeast Spathaspora arborariae UFMG-HM19.1AT

    PubMed Central

    Lobo, Francisco P.; Gonçalves, Davi L.; Alves, Sergio L.; Gerber, Alexandra L.; de Vasconcelos, Ana Tereza R.; Basso, Luiz C.; Franco, Glória R.; Soares, Marco A.; Cadete, Raquel M.; Rosa, Carlos A.

    2014-01-01

    The draft genome sequence of the yeast Spathaspora arborariae UFMG-HM19.1AT (CBS 11463 = NRRL Y-48658) is presented here. The sequenced genome size is 12.7 Mb, consisting of 41 scaffolds containing a total of 5,625 predicted open reading frames, including many genes encoding enzymes and transporters involved in d-xylose fermentation. PMID:24435867

  7. Effects of coumarate 3-hydroxylase down-regulation on lignin structure

    Treesearch

    John Ralph; Takuya Akiyama; Hoon Kim; Fachuang Lu; Paul F. Schatz; Jane M. Marita; Sally A. Ralph; M.S. Srinivasa Reddy; Fang Chen; Richard A. Dixon

    2006-01-01

    Down-regulation of the gene encoding 4-coumarate 3-hydroxylase (C3H) in alfalfa massively but predictably increased the proportion of p-hydroxyphenyl (P) units relative to thenormally dominant guaiacyl (G) and syringyl (S) units Stem levels of up to ~65% P (from wild-type levels of ~1%) resulting from down-regulation of C3H were measured by traditional degradative...

  8. A comparative gene analysis with rice identified orthologous group II HKT genes and their association with Na(+) concentration in bread wheat.

    PubMed

    Ariyarathna, H A Chandima K; Oldach, Klaus H; Francki, Michael G

    2016-01-19

    Although the HKT transporter genes ascertain some of the key determinants of crop salt tolerance mechanisms, the diversity and functional role of group II HKT genes are not clearly understood in bread wheat. The advanced knowledge on rice HKT and whole genome sequence was, therefore, used in comparative gene analysis to identify orthologous wheat group II HKT genes and their role in trait variation under different saline environments. The four group II HKTs in rice identified two orthologous gene families from bread wheat, including the known TaHKT2;1 gene family and a new distinctly different gene family designated as TaHKT2;2. A single copy of TaHKT2;2 was found on each homeologous chromosome arm 7AL, 7BL and 7DL and each gene was expressed in leaf blade, sheath and root tissues under non-stressed and at 200 mM salt stressed conditions. The proteins encoded by genes of the TaHKT2;2 family revealed more than 93% amino acid sequence identity but ≤52% amino acid identity compared to the proteins encoded by TaHKT2;1 family. Specifically, variations in known critical domains predicted functional differences between the two protein families. Similar to orthologous rice genes on chromosome 6L, TaHKT2;1 and TaHKT2;2 genes were located approximately 3 kb apart on wheat chromosomes 7AL, 7BL and 7DL, forming a static syntenic block in the two species. The chromosomal region on 7AL containing TaHKT2;1 7AL-1 co-located with QTL for shoot Na(+) concentration and yield in some saline environments. The differences in copy number, genes sequences and encoded proteins between TaHKT2;2 homeologous genes and other group II HKT gene families within and across species likely reflect functional diversity for ion selectivity and transport in plants. Evidence indicated that neither TaHKT2;2 nor TaHKT2;1 were associated with primary root Na(+) uptake but TaHKT2;1 may be associated with trait variation for Na(+) exclusion and yield in some but not all saline environments.

  9. Cyclic stretch-induced the cytoskeleton rearrangement and gene expression of cytoskeletal regulators in human periodontal ligament cells.

    PubMed

    Wu, Yaqin; Zhuang, Jiabao; Zhao, Dan; Zhang, Fuqiang; Ma, Jiayin; Xu, Chun

    2017-10-01

    This study aimed to explore the mechanism of the stretch-induced cell realignment and cytoskeletal rearrangement by identifying several mechanoresponsive genes related to cytoskeletal regulators in human PDL cells. After the cells were stretched by 1, 10 and 20% strains for 0.5, 1, 2, 4, 6, 12 or 24 h, the changes of the morphology and content of microfilaments were recorded and calculated. Meanwhile, the expression of 84 key genes encoding cytoskeletal regulators after 6 and 24 h stretches with 20% strain was detected by using real-time PCR array. Western blot was applied to identify the protein expression level of several cytoskeletal regulators encoded by these differentially expressed genes. The confocal fluorescent staining results confirmed that stretch-induced realignment of cells and rearrangement of microfilaments. Among the 84 genes screened, one gene was up-regulated while two genes were down-regulated after 6 h stretch. Meanwhile, three genes were up-regulated while two genes were down-regulated after 24 h stretch. These genes displaying differential expression included genes regulating polymerization/depolymerization of microfilaments (CDC42EP2, FNBP1L, NCK2, PIKFYVE, WASL), polymerization/depolymerization of microtubules (STMN1), interacting between microfilaments and microtubules (MACF1), as well as a phosphatase (PPP1R12B). Among the proteins encoded by these genes, the protein expression level of Cdc42 effector protein-2 (encoded by CDC42EP2) and Stathmin-1 (encoded by STMN1) was down-regulated, while the protein expression level of N-WASP (encoded by WASL) was up-regulated. The present study confirmed the cyclic stretch-induced cellular realignment and rearrangement of microfilaments in the human PDL cells and indicated several force-sensitive genes with regard to cytoskeletal regulators.

  10. A High-Resolution Gene Map of the Chloroplast Genome of the Red Alga Porphyra purpurea.

    PubMed Central

    Reith, M; Munholland, J

    1993-01-01

    Extensive DNA sequencing of the chloroplast genome of the red alga Porphyra purpurea has resulted in the detection of more than 125 genes. Fifty-eight (approximately 46%) of these genes are not found on the chloroplast genomes of land plants. These include genes encoding 17 photosynthetic proteins, three tRNAs, and nine ribosomal proteins. In addition, nine genes encoding proteins related to biosynthetic functions, six genes encoding proteins involved in gene expression, and at least five genes encoding miscellaneous proteins are among those not known to be located on land plant chloroplast genomes. The increased coding capacity of the P. purpurea chloroplast genome, along with other characteristics such as the absence of introns and the conservation of ancestral operons, demonstrate the primitive nature of the P. purpurea chloroplast genome. In addition, evidence for a monophyletic origin of chloroplasts is suggested by the identification of two groups of genes that are clustered in chloroplast genomes but not in cyanobacteria. PMID:12271072

  11. Systematic mutagenesis of genes encoding predicted autotransported proteins of Burkholderia pseudomallei identifies factors mediating virulence in mice, net intracellular replication and a novel protein conferring serum resistance.

    PubMed

    Lazar Adler, Natalie R; Stevens, Mark P; Dean, Rachel E; Saint, Richard J; Pankhania, Depesh; Prior, Joann L; Atkins, Timothy P; Kessler, Bianca; Nithichanon, Arnone; Lertmemongkolchai, Ganjana; Galyov, Edouard E

    2015-01-01

    Burkholderia pseudomallei is the causative agent of the severe tropical disease melioidosis, which commonly presents as sepsis. The B. pseudomallei K96243 genome encodes eleven predicted autotransporters, a diverse family of secreted and outer membrane proteins often associated with virulence. In a systematic study of these autotransporters, we constructed insertion mutants in each gene predicted to encode an autotransporter and assessed them for three pathogenesis-associated phenotypes: virulence in the BALB/c intra-peritoneal mouse melioidosis model, net intracellular replication in J774.2 murine macrophage-like cells and survival in 45% (v/v) normal human serum. From the complete repertoire of eleven autotransporter mutants, we identified eight mutants which exhibited an increase in median lethal dose of 1 to 2-log10 compared to the isogenic parent strain (bcaA, boaA, boaB, bpaA, bpaC, bpaE, bpaF and bimA). Four mutants, all demonstrating attenuation for virulence, exhibited reduced net intracellular replication in J774.2 macrophage-like cells (bimA, boaB, bpaC and bpaE). A single mutant (bpaC) was identified that exhibited significantly reduced serum survival compared to wild-type. The bpaC mutant, which demonstrated attenuation for virulence and net intracellular replication, was sensitive to complement-mediated killing via the classical and/or lectin pathway. Serum resistance was rescued by in trans complementation. Subsequently, we expressed recombinant proteins of the passenger domain of four predicted autotransporters representing each of the phenotypic groups identified: those attenuated for virulence (BcaA), those attenuated for virulence and net intracellular replication (BpaE), the BpaC mutant with defects in virulence, net intracellular replication and serum resistance and those displaying wild-type phenotypes (BatA). Only BcaA and BpaE elicited a strong IFN-γ response in a restimulation assay using whole blood from seropositive donors and were recognised by seropositive human sera from the endemic area. To conclude, several predicted autotransporters contribute to B. pseudomallei virulence and BpaC may do so by conferring resistance against complement-mediated killing.

  12. Interaction of apicoplast-encoded elongation factor (EF) EF-Tu with nuclear-encoded EF-Ts mediates translation in the Plasmodiumfalciparum plastid.

    PubMed

    Biswas, Subir; Lim, Erin E; Gupta, Ankit; Saqib, Uzma; Mir, Snober S; Siddiqi, Mohammad Imran; Ralph, Stuart A; Habib, Saman

    2011-03-01

    Protein translation in the plastid (apicoplast) of Plasmodium spp. is of immense interest as a target for potential anti-malarial drugs. However, the molecular data on apicoplast translation needed for optimisation and development of novel inhibitors is lacking. We report characterisation of two key translation elongation factors in Plasmodium falciparum, apicoplast-encoded elongation factor PfEF-Tu and nuclear-encoded PfEF-Ts. Recombinant PfEF-Tu hydrolysed GTP and interacted with its presumed nuclear-encoded partner PfEF-Ts. The EF-Tu inhibitor kirromycin affected PfEF-Tu activity in vitro, indicating that apicoplast EF-Tu is indeed the target of this drug. The predicted PfEF-Ts leader sequence targeted GFP to the apicoplast, confirming that PfEF-Ts functions in this organelle. Recombinant PfEF-Ts mediated nucleotide exchange on PfEF-Tu and homology modeling of the PfEF-Tu:PfEF-Ts complex revealed PfEF-Ts-induced structural alterations that would expedite GDP release from PfEF-Tu. Our results establish functional interaction between two apicoplast translation factors encoded by genes residing in different cellular compartments and highlight the significance of their sequence/structural differences from bacterial elongation factors in relation to inhibitor activity. These data provide an experimental system to study the effects of novel inhibitors targeting PfEF-Tu and PfEF-Tu.PfEF-Ts interaction. Our finding that apicoplast EF-Tu possesses chaperone-related disulphide reductase activity also provides a rationale for retention of the tufA gene on the plastid genome. Copyright © 2010 Australian Society for Parasitology Inc. All rights reserved.

  13. Cloning, characterization, expression analysis and inhibition studies of a novel gene encoding Bowman-Birk type protease inhibitor from rice bean

    USDA-ARS?s Scientific Manuscript database

    This paper presents the first study describing the isolation, cloning and characterization of a full length gene encoding Bowman-Birk protease inhibitor (RbTI) from rice bean (Vigna umbellata). A full-length protease inhibitor gene with complete open reading frame of 327bp encoding 109 amino acids w...

  14. Functional Analysis of the Brassica napus L. Phytoene Synthase (PSY) Gene Family

    PubMed Central

    López-Emparán, Ada; Quezada-Martinez, Daniela; Zúñiga-Bustos, Matías; Cifuentes, Víctor; Iñiguez-Luy, Federico; Federico, María Laura

    2014-01-01

    Phytoene synthase (PSY) has been shown to catalyze the first committed and rate-limiting step of carotenogenesis in several crop species, including Brassica napus L. Due to its pivotal role, PSY has been a prime target for breeding and metabolic engineering the carotenoid content of seeds, tubers, fruits and flowers. In Arabidopsis thaliana, PSY is encoded by a single copy gene but small PSY gene families have been described in monocot and dicotyledonous species. We have recently shown that PSY genes have been retained in a triplicated state in the A- and C-Brassica genomes, with each paralogue mapping to syntenic locations in each of the three “Arabidopsis-like” subgenomes. Most importantly, we have shown that in B. napus all six members are expressed, exhibiting overlapping redundancy and signs of subfunctionalization among photosynthetic and non photosynthetic tissues. The question of whether this large PSY family actually encodes six functional enzymes remained to be answered. Therefore, the objectives of this study were to: (i) isolate, characterize and compare the complete protein coding sequences (CDS) of the six B. napus PSY genes; (ii) model their predicted tridimensional enzyme structures; (iii) test their phytoene synthase activity in a heterologous complementation system and (iv) evaluate their individual expression patterns during seed development. This study further confirmed that the six B. napus PSY genes encode proteins with high sequence identity, which have evolved under functional constraint. Structural modeling demonstrated that they share similar tridimensional protein structures with a putative PSY active site. Significantly, all six B. napus PSY enzymes were found to be functional. Taking into account the specific patterns of expression exhibited by these PSY genes during seed development and recent knowledge of PSY suborganellar localization, the selection of transgene candidates for metabolic engineering the carotenoid content of oilseeds is discussed. PMID:25506829

  15. Comparative Sequence Analysis of the Plasmid-Encoded Regulator of Enteropathogenic Escherichia coli Strains

    PubMed Central

    Okeke, Iruka N.; Borneman, Jade A.; Shin, Sooan; Mellies, Jay L.; Quinn, Laura E.; Kaper, James B.

    2001-01-01

    Enteropathogenic Escherichia coli (EPEC) strains that carry the EPEC adherence factor (EAF) plasmid were screened for the presence of different EAF sequences, including those of the plasmid-encoded regulator (per). Considerable variation in gene content of EAF plasmids from different strains was seen. However, bfpA, the gene encoding the structural subunit for the bundle-forming pilus, bundlin, and per genes were found in 96.8% of strains. Sequence analysis of the per operon and its promoter region from 15 representative strains revealed that it is highly conserved. Most of the variation occurs in the 5′ two-thirds of the perA gene. In contrast, the C-terminal portion of the predicted PerA protein that contains the DNA-binding helix-turn-helix motif is 100% conserved in all strains that possess a full-length gene. In a minority of strains including the O119:H2 and canine isolates and in a subset of O128:H2 and O142:H6 strains, frameshift mutations in perA leading to premature truncation and consequent inactivation of the gene were identified. Cloned perA, -B, and -C genes from these strains, unlike those from strains with a functional operon, failed to activate the LEE1 operon and bfpA transcriptional fusions or to complement a per mutant in reference strain E2348/69. Furthermore, O119, O128, and canine strains that carry inactive per operons were deficient in virulence protein expression. The context in which the perABC operon occurs on the EAF plasmid varies. The sequence upstream of the per promoter region in EPEC reference strains E2348/69 and B171-8 was present in strains belonging to most serogroups. In a subset of O119:H2, O128:H2, and O142:H6 strains and in the canine isolate, this sequence was replaced by an IS1294-homologous sequence. PMID:11500429

  16. Cytochrome b5 gene and protein of Candida tropicalis and methods relating thereto

    DOEpatents

    Craft, David L.; Madduri, Krishna M.; Loper, John C.

    2003-01-01

    A novel gene has been isolated which encodes cytochrome b5 (CYTb5) protein of the .omega.-hydroxylase complex of C. tropicalis 20336. Vectors including this gene, and transformed host cells are provided. Methods of increasing the production of a CYTb5 protein are also provided which involve transforming a host cell with a gene encoding this protein and culturing the cells. Methods of increasing the production of a dicarboxylic acid are also provided which involve increasing in the host cell the number of genes encoding this protein.

  17. Genome sequence analysis of predicted polyprenol reductase gene from mangrove plant kandelia obovata

    NASA Astrophysics Data System (ADS)

    Basyuni, M.; Sagami, H.; Baba, S.; Oku, H.

    2018-03-01

    It has been previously reported that dolichols but not polyprenols were predominated in mangrove leaves and roots. Therefore, the occurrence of larger amounts of dolichol in leaves of mangrove plants implies that polyprenol reductase is responsible for the conversion of polyprenol to dolichol may be active in mangrove leaves. Here we report the early assessment of probably polyprenol reductase gene from genome sequence of mangrove plant Kandelia obovata. The functional assignment of the gene was based on a homology search of the sequences against the non-redundant (nr) peptide database of NCBI using Blastx. The degree of sequence identity between DNA sequence and known polyprenol reductase was confirmed using the Blastx probability E-value, total score, and identity. The genome sequence data resulted in three partial sequences, termed c23157 (700 bp), c23901 (960 bp), and c24171 (531 bp). The c23157 gene showed the highest similarity (61%) to predicted polyprenol reductase 2- like from Gossypium raimondii with E-value 2e-100. The second gene was c23901 to exhibit high similarity (78%) to the steroid 5-alpha-reductase Det2 from J. curcas with E-value 2e-140. Furthermore, the c24171 gene depicted highest similarity (79%) to the polyprenol reductase 2 isoform X1 from Jatropha curcas with E- value 7e-21.The present study suggested that the c23157, c23901, and c24171, genes may encode predicted polyprenol reductase. The c23157, c23901, c24171 are therefore the new type of predicted polyprenol reductase from K. obovata.

  18. Positional signaling mediated by a receptor-like kinase in Arabidopsis.

    PubMed

    Kwak, Su-Hwan; Shen, Ronglai; Schiefelbein, John

    2005-02-18

    The position-dependent specification of root epidermal cells in Arabidopsis provides an elegant paradigm for cell patterning during development. Here, we describe a new gene, SCRAMBLED (SCM), required for cells to appropriately interpret their location within the developing root epidermis. SCM encodes a receptor-like kinase protein with a predicted extracellular domain of six leucine-rich repeats and an intracellular serine-threonine kinase domain. SCM regulates the expression of the GLABRA2, CAPRICE, WEREWOLF, and ENHANCER OF GLABRA3 transcription factor genes that define the cell fates. Further, the SCM gene is expressed throughout the developing root. Therefore, SCM likely enables developing epidermal cells to detect positional cues and establish an appropriate cell-type pattern.

  19. Metagenomic insights into the carbohydrate-active enzymes carried by the microorganisms adhering to solid digesta in the rumen of cows.

    PubMed

    Wang, Lingling; Hatem, Ayat; Catalyurek, Umit V; Morrison, Mark; Yu, Zhongtang

    2013-01-01

    The ruminal microbial community is a unique source of enzymes that underpin the conversion of cellulosic biomass. In this study, the microbial consortia adherent on solid digesta in the rumen of Jersey cattle were subjected to an activity-based metagenomic study to explore the genetic diversity of carbohydrolytic enzymes in Jersey cows, with a particular focus on cellulases and xylanases. Pyrosequencing and bioinformatic analyses of 120 carbohydrate-active fosmids identified genes encoding 575 putative Carbohydrate-Active Enzymes (CAZymes) and proteins putatively related to transcriptional regulation, transporters, and signal transduction coupled with polysaccharide degradation and metabolism. Most of these genes shared little similarity to sequences archived in databases. Genes that were predicted to encode glycoside hydrolases (GH) involved in xylan and cellulose hydrolysis (e.g., GH3, 5, 9, 10, 39 and 43) were well represented. A new subfamily (S-8) of GH5 was identified from contigs assigned to Firmicutes. These subfamilies of GH5 proteins also showed significant phylum-dependent distribution. A number of polysaccharide utilization loci (PULs) were found, and two of them contained genes encoding Sus-like proteins and cellulases that have not been reported in previous metagenomic studies of samples from the rumens of cows or other herbivores. Comparison with the large metagenomic datasets previously reported of other ruminant species (or cattle breeds) and wallabies showed that the rumen microbiome of Jersey cows might contain differing CAZymes. Future studies are needed to further explore how host genetics and diets affect the diversity and distribution of CAZymes and utilization of plant cell wall materials.

  20. Partial genome assembly for a candidate division OP11 single cell from an anoxic spring (Zodletone Spring, Oklahoma).

    PubMed

    Youssef, Noha H; Blainey, Paul C; Quake, Stephen R; Elshahed, Mostafa S

    2011-11-01

    Members of candidate division OP11 are widely distributed in terrestrial and marine ecosystems, yet little information regarding their metabolic capabilities and ecological role within such habitats is currently available. Here, we report on the microfluidic isolation, multiple-displacement-amplification, pyrosequencing, and genomic analysis of a single cell (ZG1) belonging to candidate division OP11. Genome analysis of the ∼270-kb partial genome assembly obtained showed that it had no particular similarity to a specific phylum. Four hundred twenty-three open reading frames were identified, 46% of which had no function prediction. In-depth analysis revealed a heterotrophic lifestyle, with genes encoding endoglucanase, amylopullulanase, and laccase enzymes, suggesting a capacity for utilization of cellulose, starch, and, potentially, lignin, respectively. Genes encoding several glycolysis enzymes as well as formate utilization were identified, but no evidence for an electron transport chain was found. The presence of genes encoding various components of lipopolysaccharide biosynthesis indicates a Gram-negative bacterial cell wall. The partial genome also provides evidence for antibiotic resistance (β-lactamase, aminoglycoside phosphotransferase), as well as antibiotic production (bacteriocin) and extracellular bactericidal peptidases. Multiple mechanisms for stress response were identified, as were elements of type I and type IV secretion systems. Finally, housekeeping genes identified within the partial genome were used to demonstrate the OP11 affiliation of multiple hitherto unclassified genomic fragments from multiple database-deposited metagenomic data sets. These results provide the first glimpse into the lifestyle of a member of a ubiquitous, yet poorly understood bacterial candidate division.

  1. Identification of Streptococcus mitis321A vaccine antigens based on reverse vaccinology

    PubMed Central

    Zhang, Qiao; Lin, Kexiong; Wang, Changzheng; Xu, Zhi; Yang, Li; Ma, Qianli

    2018-01-01

    Streptococcus mitis (S. mitis) may transform into highly pathogenic bacteria. The aim of the present study was to identify potential antigen targets for designing an effective vaccine against the pathogenic S. mitis321A. The genome of S. mitis321A was sequenced using an Illumina Hiseq2000 instrument. Subsequently, Glimmer 3.02 and Tandem Repeat Finder (TRF) 4.04 were used to predict genes and tandem repeats, respectively, with DNA sequence function analysis using the Basic Local Alignment Search Tool (BLAST) in the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Cluster of Orthologous Groups of proteins (COG) databases. Putative gene antigen candidates were screened with BLAST ahead of phylogenetic tree analysis. The DNA sequence assembly size was 2,110,680 bp with 40.12% GC, 6 scaffolds and 9 contig. Consequently, 1,944 genes were predicted, and 119 TRF, 56 microsatellite DNA, 10 minisatellite DNA and 154 transposons were acquired. The predicted genes were associated with various pathways and functions concerning membrane transport and energy metabolism. Multiple putative genes encoding surface proteins, secreted proteins and virulence factors, as well as essential genes were determined. The majority of essential genes belonged to a phylogenetic lineage, while 321AGL000129 and 321AGL000299 were on the same branch. The current study provided useful information regarding the biological function of the S. mitis321A genome and recommends putative antigen candidates for developing a potent vaccine against S. mitis. PMID:29620181

  2. Combining inferred regulatory and reconstructed metabolic networks enhances phenotype prediction in yeast.

    PubMed

    Wang, Zhuo; Danziger, Samuel A; Heavner, Benjamin D; Ma, Shuyi; Smith, Jennifer J; Li, Song; Herricks, Thurston; Simeonidis, Evangelos; Baliga, Nitin S; Aitchison, John D; Price, Nathan D

    2017-05-01

    Gene regulatory and metabolic network models have been used successfully in many organisms, but inherent differences between them make networks difficult to integrate. Probabilistic Regulation Of Metabolism (PROM) provides a partial solution, but it does not incorporate network inference and underperforms in eukaryotes. We present an Integrated Deduced And Metabolism (IDREAM) method that combines statistically inferred Environment and Gene Regulatory Influence Network (EGRIN) models with the PROM framework to create enhanced metabolic-regulatory network models. We used IDREAM to predict phenotypes and genetic interactions between transcription factors and genes encoding metabolic activities in the eukaryote, Saccharomyces cerevisiae. IDREAM models contain many fewer interactions than PROM and yet produce significantly more accurate growth predictions. IDREAM consistently outperformed PROM using any of three popular yeast metabolic models and across three experimental growth conditions. Importantly, IDREAM's enhanced accuracy makes it possible to identify subtle synthetic growth defects. With experimental validation, these novel genetic interactions involving the pyruvate dehydrogenase complex suggested a new role for fatty acid-responsive factor Oaf1 in regulating acetyl-CoA production in glucose grown cells.

  3. Identification and phenotypic characterization of a second collagen adhesin, Scm, and genome-based identification and analysis of 13 other predicted MSCRAMMs, including four distinct pilus loci, in Enterococcus faecium

    PubMed Central

    Sillanpää, Jouko; Nallapareddy, Sreedhar R.; Prakash, Vittal P.; Qin, Xiang; Hook, Magnus; Weinstock, George M.; Murray, Barbara E.

    2009-01-01

    SUMMARY Attention has recently been drawn to Enterococcus faecium because of an increasing number of nosocomial infections caused by this species and its resistance to multiple antibacterial agents. However, relatively little is known about pathogenic determinants of this organism. We have previously identified a cell wall anchored collagen adhesin, Acm, produced by some isolates of E. faecium, and a secreted antigen, SagA, exhibiting broad spectrum binding to extracellular matrix proteins. Here, we analyzed the draft genome of strain TX0016 for potential MSCRAMMs (microbial surface component recognizing adhesive matrix molecules). Genome-based bioinformatics identified 22 predicted cell wall anchored E. faeciumsurface proteins (Fms) of which 15 (including Acm) have typical characteristics of MSCRAMMs including predicted folding into a modular architecture with multiple immunoglobulin-like domains. Functional characterization of one (Fms10, redesignated Scm for second collagen adhesin of E. faeciu m) revealed that recombinant Scm65 (A- and B-domains) and Scm36 (A-domain) bound efficiently to collagen type V in a concentration dependent manner, bound considerably less to collagen type I and fibrinogen, and differed from Acm in their binding specificities to collagen types IV and V. Results from far-UV circular dichroism of recombinant Scm36 and of Acm37 indicated that these proteins are rich in β-sheets, supporting our folding predictions. Whole-cell ELISA and FACS analyses unambiguously demonstrated surface expression of Scm in most E. faecium isolates. Strikingly, 11 of the 15 predicted MSCRAMMs clustered in four loci, each with a class C sortase gene; 9 of these showed similarity to Enterococcus faecalis Ebp pilus subunits and also contained motifs essential for pilus assembly. Antibodies against one of the predicted major pilus proteins, Fms9 (redesignated as EbpCfm), detected a “ladder” pattern of high-molecular weight protein bands in a Western blot analysis of cell surface extracts from E. faecium, suggesting that EbpCfm is polymerized into a pilus structure. Further analysis of the transcripts of the corresponding gene cluster indicated that fms1 (ebpAfm), fms5 (ebpBfm) and ebpCfm are co-transcribed, consistent with pilus-encoding gene clusters of other gram-positive bacteria. All 15 genes occurred frequently in 30 clinically-derived diverse E. faecium isolates tested. The common occurrence of MSCRAMM and pilus-encoding genes and the presence of a second collagen-binding protein may have important implications for our understanding of this emerging pathogen. PMID:18832325

  4. Use of the multipurpose transposon Tn KPK2 for the mutational analysis of chromosomal regions upstream and downstream of the sipF gene in Bradyrhizobium japonicum.

    PubMed

    Müller, P

    2004-04-01

    The DNA regions upstream and downstream of the Bradyrhizobium japonicum gene sipF were cloned by in vivo techniques and subsequently sequenced. In order to study the function of the predicted genes, a new transposon for in vitro mutagenesis, Tn KPK2, was constructed. This mutagenesis system has a number of advantages over other transposons. Tn KPK2 itself has no transposase gene, making transposition events stable. Extremely short inverted repeats minimize the length of the transposable element and facilitate the determination of the nucleotide sequence of the flanking regions. Since the transposable element carries a promoterless ' phoA reporter gene, the appearance of functional PhoA fusion proteins indicates that Tn KPK2 has inserted in a gene encoding a periplasmic or secreted protein. Although such events are extremely rare, because the transposon has to insert in-frame, in the correct orientation, and at an appropriate location in the target molecule, a direct screening procedure on agar indicator plates permits the identification of candidate clones from large numbers of colonies. In this study, Tn KPK2 was used for the construction of various symbiotic mutants of B. japonicum. One of the mutant strains, A2-10, which is defective in a gene encoding a protein that comigrates with bacterioferritin ( bcpB), was found to induce the formation of small and ineffective nodules.

  5. MicroRNAs Suppress NB Domain Genes in Tomato That Confer Resistance to Fusarium oxysporum

    PubMed Central

    Ouyang, Shouqiang; Park, Gyungsoon; Atamian, Hagop S.; Han, Cliff S.; Stajich, Jason E.; Kaloshian, Isgouhi; Borkovich, Katherine A.

    2014-01-01

    MicroRNAs (miRNAs) suppress the transcriptional and post-transcriptional expression of genes in plants. Several miRNA families target genes encoding nucleotide-binding site–leucine-rich repeat (NB-LRR) plant innate immune receptors. The fungus Fusarium oxysporum f. sp. lycopersici causes vascular wilt disease in tomato. We explored a role for miRNAs in tomato defense against F. oxysporum using comparative miRNA profiling of susceptible (Moneymaker) and resistant (Motelle) tomato cultivars. slmiR482f and slmiR5300 were repressed during infection of Motelle with F. oxysporum. Two predicted mRNA targets each of slmiR482f and slmiR5300 exhibited increased expression in Motelle and the ability of these four targets to be regulated by the miRNAs was confirmed by co-expression in Nicotiana benthamiana. Silencing of the targets in the resistant Motelle cultivar revealed a role in fungal resistance for all four genes. All four targets encode proteins with full or partial nucleotide-binding (NB) domains. One slmiR5300 target corresponds to tm-2, a susceptible allele of the Tomato Mosaic Virus resistance gene, supporting functions in immunity to a fungal pathogen. The observation that none of the targets correspond to I-2, the only known resistance (R) gene for F. oxysporum in tomato, supports roles for additional R genes in the immune response. Taken together, our findings suggest that Moneymaker is highly susceptible because its potential resistance is insufficiently expressed due to the action of miRNAs. PMID:25330340

  6. Identification and functional analysis of the NLP-encoding genes from the phytopathogenic oomycete Phytophthora capsici.

    PubMed

    Chen, Xiao-Ren; Huang, Shen-Xin; Zhang, Ye; Sheng, Gui-Lin; Li, Yan-Peng; Zhu, Feng

    2018-03-23

    Phytophthora capsici is a hemibiotrophic, phytopathogenic oomycete that infects a wide range of crops, resulting in significant economic losses worldwide. By means of a diverse arsenal of secreted effector proteins, hemibiotrophic pathogens may manipulate plant cell death to establish a successful infection and colonization. In this study, we described the analysis of the gene family encoding necrosis- and ethylene-inducing peptide 1 (Nep1)-like proteins (NLPs) in P. capsici, and identified 39 real NLP genes and 26 NLP pseudogenes. Out of the 65 predicted NLP genes, 48 occur in groups with two or more genes, whereas the remainder appears to be singletons distributed randomly among the genome. Phylogenetic analysis of the 39 real NLPs delineated three groups. Key residues/motif important for the effector activities are degenerated in most NLPs, including the nlp24 peptide consisting of the conserved region I (11-aa immunogenic part) and conserved region II (the heptapeptide GHRHDWE motif) that is important for phytotoxic activity. Transcriptional profiling of eight selected NLP genes indicated that they were differentially expressed during the developmental and plant infection phases of P. capsici. Functional analysis of ten cloned NLPs demonstrated that Pc11951, Pc107869, Pc109174 and Pc118548 were capable of inducing cell death in the Solanaceae, including Nicotiana benthamiana and hot pepper. This study provides an overview of the P. capsici NLP gene family, laying a foundation for further elucidating the pathogenicity mechanism of this devastating pathogen.

  7. Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals.

    PubMed

    Popova, Olga V; Mikhailov, Kirill V; Nikitin, Mikhail A; Logacheva, Maria D; Penin, Aleksey A; Muntyan, Maria S; Kedrova, Olga S; Petrov, Nikolai B; Panchin, Yuri V; Aleoshin, Vladimir V

    2016-01-01

    Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha-an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia.

  8. Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals

    PubMed Central

    Popova, Olga V.; Mikhailov, Kirill V.; Nikitin, Mikhail A.; Logacheva, Maria D.; Penin, Aleksey A.; Muntyan, Maria S.; Kedrova, Olga S.; Petrov, Nikolai B.; Panchin, Yuri V.

    2016-01-01

    Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha—an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia. PMID:27755612

  9. A plasmid-encoded UmuD homologue regulates expression of Pseudomonas aeruginosa SOS genes.

    PubMed

    Díaz-Magaña, Amada; Alva-Murillo, Nayeli; Chávez-Moctezuma, Martha P; López-Meza, Joel E; Ramírez-Díaz, Martha I; Cervantes, Carlos

    2015-07-01

    The Pseudomonas aeruginosa plasmid pUM505 contains the umuDC operon that encodes proteins similar to error-prone repair DNA polymerase V. The umuC gene appears to be truncated and its product is probably not functional. The umuD gene, renamed umuDpR, possesses an SOS box overlapped with a Sigma factor 70 type promoter; accordingly, transcriptional fusions revealed that the umuDpR gene promoter is activated by mitomycin C. The predicted sequence of the UmuDpR protein displays 23 % identity with the Ps. aeruginosa SOS-response LexA repressor. The umuDpR gene caused increased MMC sensitivity when transferred to the Ps. aeruginosa PAO1 strain. As expected, PAO1-derived knockout lexA-  mutant PW6037 showed resistance to MMC; however, when the umuDpR gene was transferred to PW6037, MMC resistance level was reduced. These data suggested that UmuDpR represses the expression of SOS genes, as LexA does. To test whether UmuDpR exerts regulatory functions, expression of PAO1 SOS genes was evaluated by reverse transcription quantitative PCR assays in the lexA-  mutant with or without the pUC_umuD recombinant plasmid. Expression of lexA, imuA and recA genes increased 3.4-5.3 times in the lexA-  mutant, relative to transcription of the corresponding genes in the lexA+ strain, but decreased significantly in the lexA- /umuDpR transformant. These results confirmed that the UmuDpR protein is a repressor of Ps. aeruginosa SOS genes controlled by LexA. Electrophoretic mobility shift assays, however, did not show binding of UmuDpR to 5' regions of SOS genes, suggesting an indirect mechanism of regulation.

  10. A multicopper oxidase is essential for manganese oxidation and laccase-like activity in Pedomicrobium sp. ACM 3067.

    PubMed

    Ridge, Justin P; Lin, Marianne; Larsen, Eloise I; Fegan, Mark; McEwan, Alastair G; Sly, Lindsay I

    2007-04-01

    Pedomicrobium sp. ACM 3067 is a budding-hyphal bacterium belonging to the alpha-Proteobacteria which is able to oxidize soluble Mn2+ to insoluble manganese oxide. A cosmid, from a whole-genome library, containing the putative genes responsible for manganese oxidation was identified and a primer-walking approach yielded 4350 bp of novel sequence. Analysis of this sequence showed the presence of a predicted three-gene operon, moxCBA. The moxA gene product showed homology to multicopper oxidases (MCOs) and contained the characteristic four copper-binding motifs (A, B, C and D) common to MCOs. An insertion mutation of moxA showed that this gene was essential for both manganese oxidation and laccase-like activity. The moxB gene product showed homology to a family of outer membrane proteins which are essential for Type I secretion in Gram-negative bacteria. moxBA has not been observed in other manganese-oxidizing bacteria but homologues were identified in the genomes of several bacteria including Sinorhizobium meliloti 1021 and Agrobacterium tumefaciens C58. These results suggest that moxBA and its homologues constitute a family of genes encoding an MCO and a predicted component of the Type I secretion system.

  11. Differential patterns of acquired virulence genes distinguish Salmonella strains

    PubMed Central

    Conner, Christopher P.; Heithoff, Douglas M.; Julio, Steven M.; Sinsheimer, Robert L.; Mahan, Michael J.

    1998-01-01

    Analysis of several Salmonella typhimurium in vivo-induced genes located in regions of atypical base composition has uncovered acquired genetic elements that cumulatively engender pathogenicity. Many of these regions are associated with mobile elements, encode predicted adhesin and invasin-like functions, and are required for full virulence. Some of these regions distinguish broad host range from host-adapted Salmonella serovars and may contribute to inherent differences in host specificity, tissue tropism, and disease manifestation. Maintenance of this archipelago of acquired sequence by selection in specific hosts reveals a fossil record of the evolution of pathogenic species. PMID:9539791

  12. Characterization of ROS1 cDNA from a human glioblastoma cell line

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Birchmeier, C.; O'Neill, K.; Riggs, M.

    1990-06-01

    The authors have isolated and characterized a human ROS1 cDNA from the glioblastoma cell line SW-1088. The cDNA, 8.3 kilobases long, has the potential to encode a transmembrane tyrosine-specific protein kinase with a predicted molecular mass of 259 kDa. The putative extracellular domain of ROS1 is homologous to the extracellular domain of the sevenless gene product from Drosophila. No comparable similarities in the extracellular domains were found between ROS1 and other receptor-type tyrosine kinases. Together, ROS1 and sevenless gene products define a distinct subclass of transmember tyrosine kinases.

  13. Characterization and phylogenetic analysis of lectin gene cDNA isolated from sea cucumber ( Apostichopus japonicus) body wall

    NASA Astrophysics Data System (ADS)

    Xue, Zhuang; Li, Hui; Liu, Yang; Zhou, Wei; Sun, Jing; Wang, Xiuli

    2017-12-01

    As a `living fossil' of species origin and `rich treasure' of food and nutrition development, sea cucumber has received a lot of attentions from researchers. The cDNA library construction and EST sequencing of blood had been conducted previously in our lab. The bioinformatic analysis provided a gene fragment which is highly homologous with the genes of lectin family, named AjL ( Apostichopus japonicus lectin). To characterize and determine the phylogeny of AjL genes in early evolution, we isolated a full-length cDNA of lectin gene from the body wall of A. japonicus. The open reading frame of this gene contained 489 bp and encoded a 163 amino acids secretory protein being homologous to lectins of mammals and aquatic organisms. The deduced protein included a lectin-like domain. SDS-PAGE analysis showed that AjL migrated as a specific band (about 36.09 kDa under reducing), and agglutinated against rabbit red blood cells. AjL was similar to chain A of CEL-IV in space structure. We predicted that AjL may play the same role of CEL-IV. Our results suggested that more than one lectin gene functioned in sea cucumber and most of other species, which was fused by uncertain sequences during the evolution and encoded different proteins with diverse functions. Our findings provided the insights into the function and characteristics of lectin genes invertebrates. The results will also be helpful for the identification and structural, functional, and evolutionary analyses of lectin genes.

  14. Global gene expression under nitrogen starvation in Xylella fastidiosa: contribution of the σ54 regulon

    PubMed Central

    2010-01-01

    Background Xylella fastidiosa, a Gram-negative fastidious bacterium, grows in the xylem of several plants causing diseases such as citrus variegated chlorosis. As the xylem sap contains low concentrations of amino acids and other compounds, X. fastidiosa needs to cope with nitrogen limitation in its natural habitat. Results In this work, we performed a whole-genome microarray analysis of the X. fastidiosa nitrogen starvation response. A time course experiment (2, 8 and 12 hours) of cultures grown in defined medium under nitrogen starvation revealed many differentially expressed genes, such as those related to transport, nitrogen assimilation, amino acid biosynthesis, transcriptional regulation, and many genes encoding hypothetical proteins. In addition, a decrease in the expression levels of many genes involved in carbon metabolism and energy generation pathways was also observed. Comparison of gene expression profiles between the wild type strain and the rpoN null mutant allowed the identification of genes directly or indirectly induced by nitrogen starvation in a σ54-dependent manner. A more complete picture of the σ54 regulon was achieved by combining the transcriptome data with an in silico search for potential σ54-dependent promoters, using a position weight matrix approach. One of these σ54-predicted binding sites, located upstream of the glnA gene (encoding glutamine synthetase), was validated by primer extension assays, confirming that this gene has a σ54-dependent promoter. Conclusions Together, these results show that nitrogen starvation causes intense changes in the X. fastidiosa transcriptome and some of these differentially expressed genes belong to the σ54 regulon. PMID:20799976

  15. Comparative Genomics of the Ubiquitous, Hydrocarbon-degrading Genus Marinobacter

    NASA Astrophysics Data System (ADS)

    Singer, E.; Webb, E.; Edwards, K. J.

    2012-12-01

    The genus Marinobacter is amongst the most ubiquitous in the global oceans and strains have been isolated from a wide variety of marine environments, including offshore oil-well heads, coastal thermal springs, Antarctic sea water, saline soils and associations with diatoms and dinoflagellates. Many strains have been recognized to be important hydrocarbon degraders in various marine habitats presenting sometimes extreme pH or salinity conditions. Analysis of the genome of M. aquaeolei revealed enormous adaptation versatility with an assortment of strategies for carbon and energy acquisition, sensation, and defense. In an effort to elucidate the ecological and biogeochemical significance of the Marinobacters, seven Marinobacter strains from diverse environments were included in a comparative genomics study. Genomes were screened for metabolic and adaptation potential to elucidate the strategies responsible for the omnipresence of the Marinobacter genus and their remedial action potential in hydrocarbon-polluted waters. The core genome predominantly encodes for key genes involved in hydrocarbon degradation, biofilm-relevant processes, including utilization of external DNA, halotolerance, as well as defense mechanisms against heavy metals, antibiotics, and toxins. All Marinobacter strains were observed to degrade a wide spectrum of hydrocarbon species, including aliphatic, polycyclic aromatic as well as acyclic isoprenoid compounds. Various genes predicted to facilitate hydrocarbon degradation, e.g. alkane 1-monooxygenase, appear to have originated from lateral gene transfer as they are located on gene clusters of 10-20% lower GC-content compared to genome averages and are flanked by transposases. Top ortholog hits are found in other hydrocarbon degrading organisms, e.g. Alcanivorax borkumensis. Strategies for hydrocarbon uptake encoded by various Marinobacter strains include cell surface hydrophobicity adaptation via capsular polysaccharide biosynthesis and attachment using fimbriae and pili. Formation of biofilm with biosurfactant characteristics has been observed in Marinobacter cultures and environmental strains in relation to hydrocarbon degradation. Genomic potential exists for the synthesis of biofilm-related carbon and energy storage compounds, e.g. alginate and isoprenoid wax esters, and quorum sensing encoded by the regulatory luxR gene and N-acyl-L-homoserine lactone (AHL) signals. Halotolerance is predicted to be achieved through biosynthesis and/or import of compatible solutes, including glycine betaine, choline, ectoine, sucrose, periplasmic glucans as well as membrane channel activity regulating intracellular sodium, potassium and chloride concentration balance. Gene abundances concur with those observed in sequenced halophilic Halomonas genomes. Defense mechanisms are plentiful and include arsenate, organic solvent, copper, and mercuric resistance, compounds, which frequently occur in oil refinery wastewater. The Marinobacter genomes reflect dynamic environments and diverse interactions with viruses and other bacteria with similar metabolic strategies, as reflected by the large number of integrases and transposases. This study has provided comprehensive genomic insights into the metabolic versatility and predicted environmental impact potential of one of the most ubiquitous bacterial genera.

  16. Genome complexity in the coelacanth is reflected in its adaptive immune system

    USGS Publications Warehouse

    Saha, Nil Ratan; Ota, Tatsuya; Litman, Gary W.; Hansen, John; Parra, Zuly; Hsu, Ellen; Buonocore, Francesco; Canapa, Adriana; Cheng, Jan-Fang; Amemiya, Chris T.

    2014-01-01

    We have analyzed the available genome and transcriptome resources from the coelacanth in order to characterize genes involved in adaptive immunity. Two highly distinctive IgW-encoding loci have been identified that exhibit a unique genomic organization, including a multiplicity of tandemly repeated constant region exons. The overall organization of the IgW loci precludes typical heavy chain class switching. A locus encoding IgM could not be identified either computationally or by using several different experimental strategies. Four distinct sets of genes encoding Ig light chains were identified. This includes a variant sigma-type Ig light chain previously identified only in cartilaginous fishes and which is now provisionally denoted sigma-2. Genes encoding α/β and γ/δ T-cell receptors, and CD3, CD4, and CD8 co-receptors also were characterized. Ig heavy chain variable region genes and TCR components are interspersed within the TCR α/δ locus; this organization previously was reported only in tetrapods and raises questions regarding evolution and functional cooption of genes encoding variable regions. The composition, organization and syntenic conservation of the major histocompatibility complex locus have been characterized. We also identified large numbers of genes encoding cytokines and their receptors, and other genes associated with adaptive immunity. In terms of sequence identity and organization, the adaptive immune genes of the coelacanth more closely resemble orthologous genes in tetrapods than those in teleost fishes, consistent with current phylogenomic interpretations. Overall, the work reported described herein highlights the complexity inherent in the coelacanth genome and provides a rich catalog of immune genes for future investigations.

  17. Improving the annotation of the Heterorhabditis bacteriophora genome.

    PubMed

    McLean, Florence; Berger, Duncan; Laetsch, Dominik R; Schwartz, Hillel T; Blaxter, Mark

    2018-04-01

    Genome assembly and annotation remain exacting tasks. As the tools available for these tasks improve, it is useful to return to data produced with earlier techniques to assess their credibility and correctness. The entomopathogenic nematode Heterorhabditis bacteriophora is widely used to control insect pests in horticulture. The genome sequence for this species was reported to encode an unusually high proportion of unique proteins and a paucity of secreted proteins compared to other related nematodes. We revisited the H. bacteriophora genome assembly and gene predictions to determine whether these unusual characteristics were biological or methodological in origin. We mapped an independent resequencing dataset to the genome and used the blobtools pipeline to identify potential contaminants. While present (0.2% of the genome span, 0.4% of predicted proteins), assembly contamination was not significant. Re-prediction of the gene set using BRAKER1 and published transcriptome data generated a predicted proteome that was very different from the published one. The new gene set had a much reduced complement of unique proteins, better completeness values that were in line with other related species' genomes, and an increased number of proteins predicted to be secreted. It is thus likely that methodological issues drove the apparent uniqueness of the initial H. bacteriophora genome annotation and that similar contamination and misannotation issues affect other published genome assemblies.

  18. The Bacillus subtilis ywjI (glpX) gene encodes a class II fructose-1,6-bisphosphatase, functionally equivalent to the class III Fbp enzyme.

    PubMed

    Jules, Matthieu; Le Chat, Ludovic; Aymerich, Stéphane; Le Coq, Dominique

    2009-05-01

    We present here experimental evidence that the Bacillus subtilis ywjI gene encodes a class II fructose-1,6-bisphosphatase, functionally equivalent to the fbp-encoded class III enzyme, and constitutes with the upstream gene, murAB, an operon transcribed at the same level under glycolytic or gluconeogenic conditions.

  19. The Bacillus subtilis ywjI (glpX) Gene Encodes a Class II Fructose-1,6-Bisphosphatase, Functionally Equivalent to the Class III Fbp Enzyme▿

    PubMed Central

    Jules, Matthieu; Le Chat, Ludovic; Aymerich, Stéphane; Le Coq, Dominique

    2009-01-01

    We present here experimental evidence that the Bacillus subtilis ywjI gene encodes a class II fructose-1,6-bisphosphatase, functionally equivalent to the fbp-encoded class III enzyme, and constitutes with the upstream gene, murAB, an operon transcribed at the same level under glycolytic or gluconeogenic conditions. PMID:19270101

  20. Low-molecular-weight glutenin subunits from the 1U genome of Aegilops umbellulata confer superior dough rheological properties and improve breadmaking quality of bread wheat.

    PubMed

    Wang, Jian; Wang, Chang; Zhen, Shoumin; Li, Xiaohui; Yan, Yueming

    2018-04-01

    Wheat-related genomes may carry new glutenin genes with the potential for quality improvement of breadmaking. In this study, we estimated the gluten quality properties of the wheat line CNU609 derived from crossing between Chinese Spring (CS, Triticum aestivum L., 2n = 6x = 42, AABBDD) and the wheat Aegilops umbellulata (2n = 2x = 14, UU) 1U(1B) substitution line, and investigated the function of 1U-encoded low-molecular-weight glutenin subunits (LMW-GS). The main quality parameters of CNU609 were significantly improved due to introgression of the 1U genome, including dough development time, stability time, farinograph quality number, gluten index, loaf size and inner structure. Glutenin analysis showed that CNU609 and CS had the same high-molecular-weight glutenin subunit (HMW-GS) composition, but CNU609 carried eight specific 1U genome-encoded LMW-GS. The introgression of the 1U-encoded LMW-GS led to more and larger protein body formation in the CNU609 endosperm. Two new LMW-m type genes from the 1U genome, designated Glu-U3a and Glu-U3b, were cloned and characterized. Secondary structure prediction implied that both Glu-U3a and Glu-U3b encode subunits with high α-helix and β-strand content that could benefit the formation of superior gluten structure. Our results indicate that the 1U genome has superior LMW-GS that can be used as new gene resources for wheat gluten quality improvement. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.

  1. Plant polycistronic precursors containing non-homologous microRNAs target transcripts encoding functionally related proteins

    PubMed Central

    2009-01-01

    Background MicroRNAs (miRNAs) are endogenous single-stranded small RNAs that regulate the expression of specific mRNAs involved in diverse biological processes. In plants, miRNAs are generally encoded as a single species in independent transcriptional units, referred to as MIRNA genes, in contrast to animal miRNAs, which are frequently clustered. Results We performed a comparative genomic analysis in three model plants (rice, poplar and Arabidopsis) and characterized miRNA clusters containing two to eight miRNA species. These clusters usually encode miRNAs of the same family and certain share a common evolutionary origin across monocot and dicot lineages. In addition, we identified miRNA clusters harboring miRNAs with unrelated sequences that are usually not evolutionarily conserved. Strikingly, non-homologous miRNAs from the same cluster were predicted to target transcripts encoding related proteins. At least four Arabidopsis non-homologous clusters were expressed as single transcriptional units. Overexpression of one of these polycistronic precursors, producing Ath-miR859 and Ath-miR774, led to the DCL1-dependent accumulation of both miRNAs and down-regulation of their different mRNA targets encoding F-box proteins. Conclusions In addition to polycistronic precursors carrying related miRNAs, plants also contain precursors allowing coordinated expression of non-homologous miRNAs to co-regulate functionally related target transcripts. This mechanism paves the way for using polycistronic MIRNA precursors as a new molecular tool for plant biologists to simultaneously control the expression of different genes. PMID:19951405

  2. Reevaluation of the Coding Potential and Proteomic Analysis of the BAC Derived Rhesus Cytomegalovirus Strain 68-1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Malouli, Daniel; Nakayasu, Ernesto S.; Viswanathan, Kasinath

    2012-09-01

    Cytomegaloviruses are highly host restricted resulting in co-speciation with their hosts. As a natural pathogen of rhesus macaques (RM), Rhesus Cytomegalovirus (RhCMV) has therefore emerged as a highly relevant experimental model for pathogenesis and vaccine development due to its close evolutionary relationship to human CMV (HCMV). To date, most in vivo experiments performed with RhCMV employed strain 68-1 cloned as bacterial artificial chromosome (BAC). However, the complete genome sequence of the 68-1 BAC has not been determined. Furthermore, the gene content of the RhCMV genome is unknown and previous open reading frame (ORF) predictions relied solely on uninterrupted ORFs withmore » an arbitrary cutoff of 300bp. To obtain a more precise picture of the actual proteins encoded by the most commonly used molecular clone of RhCMV we re-evaluated the RhCMV 68-1 BAC-genome by whole genome shotgun sequencing and determined the protein content of the resulting RhCMV virions by proteomics. By additionally comparing the RhCMV genome to that of several closely related Old World Monkey (OWM) CMVs we were able to filter out many unlikely ORFs and obtain a simplified map of the RhCMV genome. This comparative genomics analysis eliminated many genes previously characterized as RhCMV-specific while consolidating a high conservation of ORFs among OWM-CMVs and between RhCMV and HCMV. Moreover, virion proteomics independently validated the revised ORF predictions since only proteins encoded by predicted ORFs could be detected. Taken together these data suggest a much higher conservation of genome and virion structure between CMVs of humans, apes and OWMs than previously assumed. Remarkably, BAC-derived RhCMV is able to establish and maintain persistent infection despite the lack of multiple genes homologous to HCMV genes involved in tissue tropism.« less

  3. Genomic organization of the human mi-er1 gene and characterization of alternatively spliced isoforms: regulated use of a facultative intron determines subcellular localization.

    PubMed

    Paterno, Gary D; Ding, Zhihu; Lew, Yuan-Y; Nash, Gord W; Mercer, F Corinne; Gillespie, Laura L

    2002-07-24

    mi-er1 (previously called er1) is a fibroblast growth factor-inducible early response gene activated during mesoderm induction in Xenopus embryos and encoding a nuclear protein that functions as a transcriptional activator. The human orthologue of mi-er1 was shown to be upregulated in breast carcinoma cell lines and breast tumours when compared to normal breast cells. In this report, we investigate the structure of the human mi-er1 (hmi-er1) gene and characterize the alternatively spliced transcripts and protein isoforms. hmi-er1 is a single copy gene located at 1p31.2 and spanning 63 kb. It contains 17 exons and includes one skipped exon, a facultative intron and three polyadenylation signals to produce 12 transcripts encoding six distinct proteins. hmi-er1 transcripts were expressed at very low levels in most human adult tissues and the mRNA isoform pattern varied with the tissue. The 12 transcripts encode proteins containing a common internal sequence with variable N- and C-termini. Three distinct N- and two distinct C-termini were identified, giving rise to six protein isoforms. The two C-termini differ significantly in size and sequence and arise from alternate use of a facultative intron to produce hMI-ER1alpha and hMI-ER1beta. In all tissues except testis, transcripts encoding the beta isoform were predominant. hMI-ER1alpha lacks the predicted nuclear localization signal and transfection assays revealed that, unlike hMI-ER1beta, it is not a nuclear protein, but remains in the cytoplasm. Our results demonstrate that alternate use of a facultative intron regulates the subcellular localization of hMI-ER1 proteins and this may have important implications for hMI-ER1 function.

  4. A missense mutation encoding Cys73Phe in neurophysin II is associated with autosomal dominant neurohypophyseal diabetes insipidus.

    PubMed

    Santiprabhob, Jeerunda; Browning, James; Repaske, David

    2002-01-01

    Autosomal dominant neurohypophyseal diabetes insipidus (ADNDI) is an inherited disease caused by progressive deficiency of the hormone arginine vasopressin (AVP) that typically becomes clinically apparent in the first decade of life. The genetic locus of ADNDI is the arginine vasopressin-neurophysin II (AVP-NPII) gene and mutations that cause ADNDI have been found in the nucleotides encoding the signal peptide, vasopressin, and neurophysin II peptides. In this study we have analyzed the AVP-NPII gene in a 20-year-old female who was diagnosed with ADNDI at 2 years of age. A heterozygous missense mutation (1684G>T) was found in exon 2 that predicts replacement of cysteine with phenylalanine at position 73 of neurophysin II. The mutation was confirmed by subcloning exon 2 PCR products to sequence each allele independently. Two out of four clones were found to have the missense mutation and two have the normal sequence, confirming the presence of the mutation and heterozygosity. Neurophysin II is an intracellular carrier protein for AVP during axonal transport from the hypothalamus to the posterior pituitary and contains 14 cysteine residues forming 7 disulfide bonds. This mutation is predicted to disrupt the disulfide bridge between Cys73 and Cys61 within the neurophysin II moiety. This finding of a novel mutation substituting cysteine with phenylalanine in one AVP-NPII gene allele supports the hypothesis that inability to form normal disulfide bonds in neurophysin II leads to ADNDI.

  5. The Bordetella bhu Locus Is Required for Heme Iron Utilization

    PubMed Central

    Vanderpool, Carin K.; Armstrong, Sandra K.

    2001-01-01

    Bordetella pertussis and Bordetella bronchiseptica are capable of obtaining iron from hemin and hemoglobin. Genes encoding a putative bacterial heme iron acquisition system (bhu, for Bordetella heme utilization) were identified in a B. pertussis genomic sequence database, and the corresponding DNA was isolated from a virulent strain of B. pertussis. A B. pertussis bhuR mutant, predicted to lack the heme outer membrane receptor, was generated by allelic exchange. In contrast to the wild-type strain, bhuR mutant PM5 was incapable of acquiring iron from hemin and hemoglobin; genetic complementation of PM5 with the cloned bhuRSTUV genes restored heme utilization to wild-type levels. In parallel studies, B. bronchiseptica bhu sequences were also identified and a B. bronchiseptica bhuR mutant was constructed and confirmed to be defective in heme iron acquisition. The wild-type B. bronchiseptica parent strain grown under low-iron conditions produced the presumptive BhuR protein, which was absent in the bhuR mutant. Furthermore, production of BhuR by iron-starved B. bronchiseptica was markedly enhanced by culture in hemin-supplemented medium, suggesting that these organisms sense and respond to heme in the environment. Analysis of the genetic region upstream of the bhu cluster identified open reading frames predicted to encode homologs of the Escherichia coli ferric citrate uptake regulators FecI and FecR. These putative Bordetella regulators may mediate heme-responsive positive transcriptional control of the bhu genes. PMID:11418569

  6. The Influence of Genetics on Cystic Fibrosis Phenotypes

    PubMed Central

    Knowles, Michael R.; Drumm, Mitchell

    2012-01-01

    Technological advances in genetics have made feasible and affordable large studies to identify genetic variants that cause or modify a trait. Genetic studies have been carried out to assess variants in candidate genes, as well as polymorphisms throughout the genome, for their associations with heritable clinical outcomes of cystic fibrosis (CF), such as lung disease, meconium ileus, and CF-related diabetes. The candidate gene approach has identified some predicted relationships, while genome-wide surveys have identified several genes that would not have been obvious disease-modifying candidates, such as a methionine sulfoxide transferase gene that influences intestinal obstruction, or a region on chromosome 11 proximate to genes encoding a transcription factor and an apoptosis controller that associates with lung function. These unforeseen associations thus provide novel insight into disease pathophysiology, as well as suggesting new therapeutic strategies for CF. PMID:23209180

  7. Molecular cloning and tissue distribution of peroxisome proliferator-activated receptor-alpha (PPARα) and gamma (PPARγ) in the pigeon (Columba livia domestica).

    PubMed

    Xie, P; Yuan, C; Wang, C; Zou, X-T; Po, Z; Tong, H-B; Zou, J-M

    2014-01-01

    1. Peroxisome proliferator-activated receptors (PPAR) are involved in lipid metabolism through transcriptional regulation of target gene expression. The objective of the current study was to clone and characterise the PPARα and PPARγ genes in pigeon. 2. The full-length of 1941-bp PPARα and 1653-bp PPARγ were cloned from pigeons. The two genes were predicted to encode 468 and 475 amino acids, respectively. Both proteins contained two C4-type zinc fingers, a nuclear hormone receptor DNA-binding region signature and a HOLI domain (ligand binding domain of hormone receptors), and had high identities with other corresponding avian genes. 3. Using quantitative real-time PCR, pigeon PPARα gene expression was shown to be high in kidney, liver, gizzard and duodenum whereas PPARγ was predominantly expressed in adipose tissue.

  8. The complete mitochondrial genome of the medicinal fungus Ganoderma applanatum (Polyporales, Basidiomycota).

    PubMed

    Wang, Xin-Cun; Shao, Junjie; Liu, Chang

    2016-07-01

    We have determined the complete nucleotide sequence of the mitochondrial genome of the medicinal fungus Ganoderma applanatum (Pers.) Pat. using the next-generation sequencing technology. The circular molecule is 119,803 bp long with a GC content of 26.66%. Gene prediction revealed genes encoding 15 conserved proteins, 25 tRNAs, the large and small ribosomal RNAs, all genes are located on the same strand except trnW-CCA. Compared with previously sequenced genomes of G. lucidum, G. meredithiae and G. sinense, the order of the protein and rRNA genes is highly conserved; however, the types of tRNA genes are slightly different. The mitochondrial genome of G. applanatum will contribute to the understanding of the phylogeny and evolution of Ganoderma and Ganodermataceae, the group containing many species with high medicinal values.

  9. Molecular cloning and characterization of an SRCAP chromatin remodeling homologue in Toxoplasma gondii.

    PubMed

    Sullivan, William J; Monroy, M Alexandra; Bohne, Wolfgang; Nallani, Karuna C; Chrivia, John; Yaciuk, Peter; Smith, Charles K; Queener, Sherry F

    2003-05-01

    We have identified and mapped a gene in Toxoplasma gondii that encodes a homologue of SRCAP (Snf2-related CBP activator protein), a member of the SNF/SWI family of chromatin remodeling factors. The genomic locus (TgSRCAP) is present as a single copy and contains 16 introns. The predicted cDNA contains an open reading frame of 8,775 bp and encodes a protein of 2,924 amino acids. We have identified additional SRCAP-like sequences in Apicomplexa for comparison by screening genomic databases. An analysis of SRCAP homologues between species reveals signature features that may be indicative of SRCAP members. Expression of mRNA encoding TgSRCAP is upregulated when tachyzoite (invasive form) parasites are induced to differentiate into bradyzoites (encysted form) in vitro. Recombinant TgSRCAP protein is functionally equivalent to the human homologue, being capable of increasing transcription mediated by CREB.

  10. Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

    PubMed Central

    Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

    1993-01-01

    Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043

  11. Network Hubs Buffer Environmental Variation in Saccharomyces cerevisiae

    PubMed Central

    Levy, Sasha F; Siegal, Mark L

    2008-01-01

    Regulatory and developmental systems produce phenotypes that are robust to environmental and genetic variation. A gene product that normally contributes to this robustness is termed a phenotypic capacitor. When a phenotypic capacitor fails, for example when challenged by a harsh environment or mutation, the system becomes less robust and thus produces greater phenotypic variation. A functional phenotypic capacitor provides a mechanism by which hidden polymorphism can accumulate, whereas its failure provides a mechanism by which evolutionary change might be promoted. The primary example to date of a phenotypic capacitor is Hsp90, a molecular chaperone that targets a large set of signal transduction proteins. In both Drosophila and Arabidopsis, compromised Hsp90 function results in pleiotropic phenotypic effects dependent on the underlying genotype. For some traits, Hsp90 also appears to buffer stochastic variation, yet the relationship between environmental and genetic buffering remains an important unresolved question. We previously used simulations of knockout mutations in transcriptional networks to predict that many gene products would act as phenotypic capacitors. To test this prediction, we use high-throughput morphological phenotyping of individual yeast cells from single-gene deletion strains to identify gene products that buffer environmental variation in Saccharomyces cerevisiae. We find more than 300 gene products that, when absent, increase morphological variation. Overrepresented among these capacitors are gene products that control chromosome organization and DNA integrity, RNA elongation, protein modification, cell cycle, and response to stimuli such as stress. Capacitors have a high number of synthetic-lethal interactions but knockouts of these genes do not tend to cause severe decreases in growth rate. Each capacitor can be classified based on whether or not it is encoded by a gene with a paralog in the genome. Capacitors with a duplicate are highly connected in the protein–protein interaction network and show considerable divergence in expression from their paralogs. In contrast, capacitors encoded by singleton genes are part of highly interconnected protein clusters whose other members also tend to affect phenotypic variability or fitness. These results suggest that buffering and release of variation is a widespread phenomenon that is caused by incomplete functional redundancy at multiple levels in the genetic architecture. PMID:18986213

  12. Bacillus subtilis 168 Contains Two Differentially Regulated Genes Encoding l-Asparaginase

    PubMed Central

    Fisher, Susan H.; Wray, Lewis V.

    2002-01-01

    Expression of the two Bacillus subtilis genes encoding l-asparaginase is controlled by independent regulatory factors. The ansZ gene (formerly yccC) was shown by mutational analysis to encode a functional l-asparaginase, the expression of which is activated during nitrogen-limited growth by the TnrA transcription factor. Gel mobility shift and DNase I footprinting experiments indicate that TnrA regulates ansZ expression by binding to a DNA site located upstream of the ansZ promoter. The expression of the ansA gene, which encodes the second l-asparaginase, was found to be induced by asparagine. The ansA repressor, AnsR, was shown to negatively regulate its own expression. PMID:11914346

  13. Bacillus subtilis 168 contains two differentially regulated genes encoding L-asparaginase.

    PubMed

    Fisher, Susan H; Wray, Lewis V

    2002-04-01

    Expression of the two Bacillus subtilis genes encoding L-asparaginase is controlled by independent regulatory factors. The ansZ gene (formerly yccC) was shown by mutational analysis to encode a functional L-asparaginase, the expression of which is activated during nitrogen-limited growth by the TnrA transcription factor. Gel mobility shift and DNase I footprinting experiments indicate that TnrA regulates ansZ expression by binding to a DNA site located upstream of the ansZ promoter. The expression of the ansA gene, which encodes the second L-asparaginase, was found to be induced by asparagine. The ansA repressor, AnsR, was shown to negatively regulate its own expression.

  14. Recombinant DNA encoding a desulfurization biocatalyst

    DOEpatents

    Rambosek, John; Piddington, Chris S.; Kovacevich, Brian R.; Young, Kevin D.; Denome, Sylvia A.

    1994-01-01

    This invention relates to a recombinant DNA molecule containing a gene or genes which encode a biocatalyst capable of desulfurizing a fossil fuel which contains organic sulfur molecules. For example, the present invention encompasses a recombinant DNA molecule containing a gene or genes of a strain of Rhodococcus rhodochrous.

  15. Somatic mutations in the transcriptional corepressor gene BCORL1 in adult acute myelogenous leukemia.

    PubMed

    Li, Meng; Collins, Roxane; Jiao, Yuchen; Ouillette, Peter; Bixby, Dale; Erba, Harry; Vogelstein, Bert; Kinzler, Kenneth W; Papadopoulos, Nickolas; Malek, Sami N

    2011-11-24

    To further our understanding of the genetic basis of acute myelogenous leukemia (AML), we determined the coding exon sequences of ∼ 18 000 protein-encoding genes in 8 patients with secondary AML. Here we report the discovery of novel somatic mutations in the transcriptional corepressor gene BCORL1 that is located on the X-chromosome. Analysis of BCORL1 in an unselected cohort of 173 AML patients identified a total of 10 mutated cases (6%) with BCORL1 mutations, whereas analysis of 19 AML cell lines uncovered 4 (21%) BCORL1 mutated cell lines. The majority (87%) of the mutations in BCORL1 were predicted to inactivate the gene product as a result of nonsense mutations, splice site mutation, or out-of-frame insertions or deletions. These results indicate that BCORL1 by genetic criteria is a novel candidate tumor suppressor gene, joining the growing list of genes recurrently mutated in AML.

  16. Cloning and characterization of a Candida albicans gene homologous to fructose-1,6-bisphosphatase genes.

    PubMed

    De la Rosa, J M; Ruíz, T; Rodríguez, L

    2000-12-01

    By sequencing of the DNA adjacent to the Candida albicans SEC61 gene, an open reading frame encoding a polypeptide of 331 amino acids was found. The predicted protein showed a strong homology with the fructose-1,6-bisphosphatase [FbPase] from other organisms, and conserved regions included the catalytic motif found in all known FbPases. Although the cloned gene did not complement the growth failure of a Saccharomyces cerevisiae fbp1 mutant in media with gluconeogenic carbon sources, it was transcribed in the transformants in a fashion that indicates a partial repression by glucose. A similar control on the transcription of this gene and on FbPase activity was found in wild-type C. albicans, where the cloned gene (CaFBP1) was shown to be localized in a single chromosomal locus in the genome.

  17. Nitric Oxide Metabolism in Neisseria meningitidis

    PubMed Central

    Anjum, Muna F.; Stevanin, Tânia M.; Read, Robert C.; Moir, James W. B.

    2002-01-01

    Neisseria meningitidis, the causative agent of meningococcal disease in humans, is likely to be exposed to nitrosative stress during natural colonization and disease. The genome of N. meningitidis includes the genes aniA and norB, predicted to encode nitrite reductase and nitric oxide (NO) reductase, respectively. These gene products should allow the bacterium to denitrify nitrite to nitrous oxide. We show that N. meningitidis can support growth microaerobically by the denitrification of nitrite via NO and that norB is required for anaerobic growth with nitrite. NorB and, to a lesser extent, the cycP gene product cytochrome c′ are able to counteract toxicity due to exogenously added NO. Expression of these genes by N. meningitidis during colonization and disease may confer protection against exogenous or endogenous nitrosative stress. PMID:12003939

  18. MicroRNA Dysregulation, Gene Networks, and Risk for Schizophrenia in 22q11.2 Deletion Syndrome

    PubMed Central

    Merico, Daniele; Costain, Gregory; Butcher, Nancy J.; Warnica, William; Ogura, Lucas; Alfred, Simon E.; Brzustowicz, Linda M.; Bassett, Anne S.

    2014-01-01

    The role of microRNAs (miRNAs) in the etiology of schizophrenia is increasingly recognized. Microdeletions at chromosome 22q11.2 are recurrent structural variants that impart a high risk for schizophrenia and are found in up to 1% of all patients with schizophrenia. The 22q11.2 deletion region overlaps gene DGCR8, encoding a subunit of the miRNA microprocessor complex. We identified miRNAs overlapped by the 22q11.2 microdeletion and for the first time investigated their predicted target genes, and those implicated by DGCR8, to identify targets that may be involved in the risk for schizophrenia. The 22q11.2 region encompasses seven validated or putative miRNA genes. Employing two standard prediction tools, we generated sets of predicted target genes. Functional enrichment profiles of the 22q11.2 region miRNA target genes suggested a role in neuronal processes and broader developmental pathways. We then constructed a protein interaction network of schizophrenia candidate genes and interaction partners relevant to brain function, independent of the 22q11.2 region miRNA mechanisms. We found that the predicted gene targets of the 22q11.2 deletion miRNAs, and targets of the genome-wide miRNAs predicted to be dysregulated by DGCR8 hemizygosity, were significantly represented in this schizophrenia network. The findings provide new insights into the pathway from 22q11.2 deletion to expression of schizophrenia, and suggest that hemizygosity of the 22q11.2 region may have downstream effects implicating genes elsewhere in the genome that are relevant to the general schizophrenia population. These data also provide further support for the notion that robust genetic findings in schizophrenia may converge on a reasonable number of final pathways. PMID:25484875

  19. Global analyses of Ceratocystis cacaofunesta mitochondria: from genome to proteome.

    PubMed

    Ambrosio, Alinne Batista; do Nascimento, Leandro Costa; Oliveira, Bruno V; Teixeira, Paulo José P L; Tiburcio, Ricardo A; Toledo Thomazella, Daniela P; Leme, Adriana F P; Carazzolle, Marcelo F; Vidal, Ramon O; Mieczkowski, Piotr; Meinhardt, Lyndel W; Pereira, Gonçalo A G; Cabrera, Odalys G

    2013-02-11

    The ascomycete fungus Ceratocystis cacaofunesta is the causal agent of wilt disease in cacao, which results in significant economic losses in the affected producing areas. Despite the economic importance of the Ceratocystis complex of species, no genomic data are available for any of its members. Given that mitochondria play important roles in fungal virulence and the susceptibility/resistance of fungi to fungicides, we performed the first functional analysis of this organelle in Ceratocystis using integrated "omics" approaches. The C. cacaofunesta mitochondrial genome (mtDNA) consists of a single, 103,147-bp circular molecule, making this the second largest mtDNA among the Sordariomycetes. Bioinformatics analysis revealed the presence of 15 conserved genes and 37 intronic open reading frames in C. cacaofunesta mtDNA. Here, we predicted the mitochondrial proteome (mtProt) of C. cacaofunesta, which is comprised of 1,124 polypeptides - 52 proteins that are mitochondrially encoded and 1,072 that are nuclearly encoded. Transcriptome analysis revealed 33 probable novel genes. Comparisons among the Gene Ontology results of the predicted mtProt of C. cacaofunesta, Neurospora crassa and Saccharomyces cerevisiae revealed no significant differences. Moreover, C. cacaofunesta mitochondria were isolated, and the mtProt was subjected to mass spectrometric analysis. The experimental proteome validated 27% of the predicted mtProt. Our results confirmed the existence of 110 hypothetical proteins and 7 novel proteins of which 83 and 1, respectively, had putative mitochondrial localization. The present study provides the first partial genomic analysis of a species of the Ceratocystis genus and the first predicted mitochondrial protein inventory of a phytopathogenic fungus. In addition to the known mitochondrial role in pathogenicity, our results demonstrated that the global function analysis of this organelle is similar in pathogenic and non-pathogenic fungi, suggesting that its relevance in the lifestyle of these organisms should be based on a small number of specific proteins and/or with respect to differential gene regulation. In this regard, particular interest should be directed towards mitochondrial proteins with unknown function and the novel protein that might be specific to this species. Further functional characterization of these proteins could enhance our understanding of the role of mitochondria in phytopathogenicity.

  20. Global analyses of Ceratocystis cacaofunesta mitochondria: from genome to proteome

    PubMed Central

    2013-01-01

    Background The ascomycete fungus Ceratocystis cacaofunesta is the causal agent of wilt disease in cacao, which results in significant economic losses in the affected producing areas. Despite the economic importance of the Ceratocystis complex of species, no genomic data are available for any of its members. Given that mitochondria play important roles in fungal virulence and the susceptibility/resistance of fungi to fungicides, we performed the first functional analysis of this organelle in Ceratocystis using integrated “omics” approaches. Results The C. cacaofunesta mitochondrial genome (mtDNA) consists of a single, 103,147-bp circular molecule, making this the second largest mtDNA among the Sordariomycetes. Bioinformatics analysis revealed the presence of 15 conserved genes and 37 intronic open reading frames in C. cacaofunesta mtDNA. Here, we predicted the mitochondrial proteome (mtProt) of C. cacaofunesta, which is comprised of 1,124 polypeptides - 52 proteins that are mitochondrially encoded and 1,072 that are nuclearly encoded. Transcriptome analysis revealed 33 probable novel genes. Comparisons among the Gene Ontology results of the predicted mtProt of C. cacaofunesta, Neurospora crassa and Saccharomyces cerevisiae revealed no significant differences. Moreover, C. cacaofunesta mitochondria were isolated, and the mtProt was subjected to mass spectrometric analysis. The experimental proteome validated 27% of the predicted mtProt. Our results confirmed the existence of 110 hypothetical proteins and 7 novel proteins of which 83 and 1, respectively, had putative mitochondrial localization. Conclusions The present study provides the first partial genomic analysis of a species of the Ceratocystis genus and the first predicted mitochondrial protein inventory of a phytopathogenic fungus. In addition to the known mitochondrial role in pathogenicity, our results demonstrated that the global function analysis of this organelle is similar in pathogenic and non-pathogenic fungi, suggesting that its relevance in the lifestyle of these organisms should be based on a small number of specific proteins and/or with respect to differential gene regulation. In this regard, particular interest should be directed towards mitochondrial proteins with unknown function and the novel protein that might be specific to this species. Further functional characterization of these proteins could enhance our understanding of the role of mitochondria in phytopathogenicity. PMID:23394930

  1. Photocontrol of the expression of genes encoding chlorophyll a/b binding proteins and small subunit of ribulose-1,5-bisphosphate carboxylase in etiolated seedlings of Lycopersicon esculentum (L. ) and Nicotiana tabacum (L. )

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wehmeyer, B.; Cashmore, A.R.; Schaefer, E.

    Phytochrome and the blue ultraviolet-A photoreceptor control light-induced expression of genes encoding the chlorophyll a/b binding protein of photosystem II and photosystem I and the genes for the small subunit of the ribulose-1,5-bisphosphate carboxylase in etiolated seedlings of Lycopersicon esculentum (tomato) and Nicotiana tabacum (tobacco). A high irradiance response also controls the induction of these genes. Genes encoding photosystem II- and I-associated chlorophyll a/b binding proteins both exhibit a transient rapid increase in expression in response to light pulse or to continuous irradiation. In contrast, genes encoding the small subunit exhibit a continuous increase in expression in response to light.more » These distinct expression characteristics are shown to reflect differences at the level of transcription.« less

  2. Similarity-based gene detection: using COGs to find evolutionarily-conserved ORFs.

    PubMed

    Powell, Bradford C; Hutchison, Clyde A

    2006-01-19

    Experimental verification of gene products has not kept pace with the rapid growth of microbial sequence information. However, existing annotations of gene locations contain sufficient information to screen for probable errors. Furthermore, comparisons among genomes become more informative as more genomes are examined. We studied all open reading frames (ORFs) of at least 30 codons from the genomes of 27 sequenced bacterial strains. We grouped the potential peptide sequences encoded from the ORFs by forming Clusters of Orthologous Groups (COGs). We used this grouping in order to find homologous relationships that would not be distinguishable from noise when using simple BLAST searches. Although COG analysis was initially developed to group annotated genes, we applied it to the task of grouping anonymous DNA sequences that may encode proteins. "Mixed COGs" of ORFs (clusters in which some sequences correspond to annotated genes and some do not) are attractive targets when seeking errors of gene prediction. Examination of mixed COGs reveals some situations in which genes appear to have been missed in current annotations and a smaller number of regions that appear to have been annotated as gene loci erroneously. This technique can also be used to detect potential pseudogenes or sequencing errors. Our method uses an adjustable parameter for degree of conservation among the studied genomes (stringency). We detail results for one level of stringency at which we found 83 potential genes which had not previously been identified, 60 potential pseudogenes, and 7 sequences with existing gene annotations that are probably incorrect. Systematic study of sequence conservation offers a way to improve existing annotations by identifying potentially homologous regions where the annotation of the presence or absence of a gene is inconsistent among genomes.

  3. Evolution of Linked Avirulence Effectors in Leptosphaeria maculans Is Affected by Genomic Environment and Exposure to Resistance Genes in Host Plants

    PubMed Central

    Van de Wouw, Angela P.; Cozijnsen, Anton J.; Hane, James K.; Brunner, Patrick C.; McDonald, Bruce A.; Oliver, Richard P.; Howlett, Barbara J.

    2010-01-01

    Brassica napus (canola) cultivars and isolates of the blackleg fungus, Leptosphaeria maculans interact in a ‘gene for gene’ manner whereby plant resistance (R) genes are complementary to pathogen avirulence (Avr) genes. Avirulence genes encode proteins that belong to a class of pathogen molecules known as effectors, which includes small secreted proteins that play a role in disease. In Australia in 2003 canola cultivars with the Rlm1 resistance gene suffered a breakdown of disease resistance, resulting in severe yield losses. This was associated with a large increase in the frequency of virulence alleles of the complementary avirulence gene, AvrLm1, in fungal populations. Surprisingly, the frequency of virulence alleles of AvrLm6 (complementary to Rlm6) also increased dramatically, even though the cultivars did not contain Rlm6. In the L. maculans genome, AvrLm1 and AvrLm6 are linked along with five other genes in a region interspersed with transposable elements that have been degenerated by Repeat-Induced Point (RIP) mutations. Analyses of 295 Australian isolates showed deletions, RIP mutations and/or non-RIP derived amino acid substitutions in the predicted proteins encoded by these seven genes. The degree of RIP mutations within single copy sequences in this region was proportional to their proximity to the degenerated transposable elements. The RIP alleles were monophyletic and were present only in isolates collected after resistance conferred by Rlm1 broke down, whereas deletion alleles belonged to several polyphyletic lineages and were present before and after the resistance breakdown. Thus, genomic environment and exposure to resistance genes in B. napus has affected the evolution of these linked avirulence genes in L. maculans. PMID:21079787

  4. The complete genome sequence and genetic analysis of ΦCA82 a novel uncultured microphage from the turkey gastrointestinal system

    PubMed Central

    2011-01-01

    The genomic DNA sequence of a novel enteric uncultured microphage, ΦCA82 from a turkey gastrointestinal system was determined utilizing metagenomics techniques. The entire circular, single-stranded nucleotide sequence of the genome was 5,514 nucleotides. The ΦCA82 genome is quite different from other microviruses as indicated by comparisons of nucleotide similarity, predicted protein similarity, and functional classifications. Only three genes showed significant similarity to microviral proteins as determined by local alignments using BLAST analysis. ORF1 encoded a predicted phage F capsid protein that was phylogenetically most similar to the Microviridae ΦMH2K member's major coat protein. The ΦCA82 genome also encoded a predicted minor capsid protein (ORF2) and putative replication initiation protein (ORF3) most similar to the microviral bacteriophage SpV4. The distant evolutionary relationship of ΦCA82 suggests that the divergence of this novel turkey microvirus from other microviruses may reflect unique evolutionary pressures encountered within the turkey gastrointestinal system. PMID:21714899

  5. The evolution of genes encoding for green fluorescent proteins: insights from cephalochordates (amphioxus)

    NASA Astrophysics Data System (ADS)

    Yue, Jia-Xing; Holland, Nicholas D.; Holland, Linda Z.; Deheyn, Dimitri D.

    2016-06-01

    Green Fluorescent Protein (GFP) was originally found in cnidarians, and later in copepods and cephalochordates (amphioxus) (Branchiostoma spp). Here, we looked for GFP-encoding genes in Asymmetron, an early-diverged cephalochordate lineage, and found two such genes closely related to some of the Branchiostoma GFPs. Dim fluorescence was found throughout the body in adults of Asymmetron lucayanum, and, as in Branchiostoma floridae, was especially intense in the ripe ovaries. Spectra of the fluorescence were similar between Asymmetron and Branchiostoma. Lineage-specific expansion of GFP-encoding genes in the genus Branchiostoma was observed, largely driven by tandem duplications. Despite such expansion, purifying selection has strongly shaped the evolution of GFP-encoding genes in cephalochordates, with apparent relaxation for highly duplicated clades. All cephalochordate GFP-encoding genes are quite different from those of copepods and cnidarians. Thus, the ancestral cephalochordates probably had GFP, but since GFP appears to be lacking in more early-diverged deuterostomes (echinoderms, hemichordates), it is uncertain whether the ancestral cephalochordates (i.e. the common ancestor of Asymmetron and Branchiostoma) acquired GFP by horizontal gene transfer (HGT) from copepods or cnidarians or inherited it from the common ancestor of copepods and deuterostomes, i.e. the ancestral bilaterians.

  6. Identification and characterization of the gltK gene encoding a membrane-associated glucose transport protein of pseudomonas aeruginosa.

    PubMed

    Adewoye, L O; Worobec, E A

    2000-08-08

    The Pseudomonas aeruginosa oprB gene encodes the carbohydrate-selective OprB porin, which translocates substrate molecules across the outer membrane to the periplasmic glucose-binding protein. We identified and cloned two open reading frames (ORFs) flanking the oprB gene but are not in operonic arrangement with the oprB gene. The downstream ORF encodes a putative polypeptide homologous to members of a family of transcriptional repressors, whereas the oprB gene is preceded by an ORF encoding a putative product, which exhibits strong homology to several carbohydrate transport ATP-binding cassette (ABC) proteins. The genomic copy of the upstream ORF was mutagenized by homologous recombination. Analysis of the deletion mutant in comparison with the wild type revealed a significant reduction in [14C] glucose transport activity in the mutant strain, suggesting that this ORF likely encodes the inner membrane component of the glucose ABC transporter. It is thus designated gltK gene to reflect its homology to the Pseudomona fluorescens mtlK and its involvement in the high-affinity glucose transport system. Multiple alignment analysis revealed that the P. aeruginosa gltK gene product is a member of the MalK subfamily of ABC proteins.

  7. Three New Pierce's Disease Pathogenicity Effectors Identified Using Xylella fastidiosa Biocontrol Strain EB92-1.

    PubMed

    Zhang, Shujian; Chakrabarty, Pranjib K; Fleites, Laura A; Rayside, Patricia A; Hopkins, Donald L; Gabriel, Dean W

    2015-01-01

    Xylella fastidiosa (X. fastidiosa) infects a wide range of plant hosts and causes economically serious diseases, including Pierce's Disease (PD) of grapevines. X. fastidiosa biocontrol strain EB92-1 was isolated from elderberry and is infectious and persistent in grapevines but causes only very slight symptoms under ideal conditions. The draft genome of EB92-1 revealed that it appeared to be missing genes encoding 10 potential PD pathogenicity effectors found in Temecula1. Subsequent PCR and sequencing analyses confirmed that EB92-1 was missing the following predicted effectors found in Temecula1: two type II secreted enzymes, including a lipase (LipA; PD1703) and a serine protease (PD0956); two identical genes encoding proteins similar to Zonula occludens toxins (Zot; PD0915 and PD0928), and at least one relatively short, hemagglutinin-like protein (PD0986). Leaves of tobacco and citrus inoculated with cell-free, crude protein extracts of E. coli BL21(DE3) overexpressing PD1703 exhibited a hypersensitive response (HR) in less than 24 hours. When cloned into shuttle vector pBBR1MCS-5, PD1703 conferred strong secreted lipase activity to Xanthomonas citri, E. coli and X. fastidiosa EB92-1 in plate assays. EB92-1/PD1703 transformants also showed significantly increased disease symptoms on grapevines, characteristic of PD. Genes predicted to encode PD0928 (Zot) and a PD0986 (hemagglutinin) were also cloned into pBBR1MCS-5 and moved into EB92-1; both transformants also showed significantly increased symptoms on V. vinifera vines, characteristic of PD. Together, these results reveal that PD effectors include at least a lipase, two Zot-like toxins and a possibly redundant hemagglutinin, none of which are necessary for parasitic survival of X. fastidiosa populations in grapevines or elderberry.

  8. Three New Pierce's Disease Pathogenicity Effectors Identified Using Xylella fastidiosa Biocontrol Strain EB92-1

    PubMed Central

    Zhang, Shujian; Chakrabarty, Pranjib K.; Fleites, Laura A.; Rayside, Patricia A.; Hopkins, Donald L.; Gabriel, Dean W.

    2015-01-01

    Xylella fastidiosa (X. fastidiosa) infects a wide range of plant hosts and causes economically serious diseases, including Pierce's Disease (PD) of grapevines. X. fastidiosa biocontrol strain EB92-1 was isolated from elderberry and is infectious and persistent in grapevines but causes only very slight symptoms under ideal conditions. The draft genome of EB92-1 revealed that it appeared to be missing genes encoding 10 potential PD pathogenicity effectors found in Temecula1. Subsequent PCR and sequencing analyses confirmed that EB92-1 was missing the following predicted effectors found in Temecula1: two type II secreted enzymes, including a lipase (LipA; PD1703) and a serine protease (PD0956); two identical genes encoding proteins similar to Zonula occludens toxins (Zot; PD0915 and PD0928), and at least one relatively short, hemagglutinin-like protein (PD0986). Leaves of tobacco and citrus inoculated with cell-free, crude protein extracts of E. coli BL21(DE3) overexpressing PD1703 exhibited a hypersensitive response (HR) in less than 24 hours. When cloned into shuttle vector pBBR1MCS-5, PD1703 conferred strong secreted lipase activity to Xanthomonas citri, E. coli and X. fastidiosa EB92-1 in plate assays. EB92-1/PD1703 transformants also showed significantly increased disease symptoms on grapevines, characteristic of PD. Genes predicted to encode PD0928 (Zot) and a PD0986 (hemagglutinin) were also cloned into pBBR1MCS-5 and moved into EB92-1; both transformants also showed significantly increased symptoms on V. vinifera vines, characteristic of PD. Together, these results reveal that PD effectors include at least a lipase, two Zot-like toxins and a possibly redundant hemagglutinin, none of which are necessary for parasitic survival of X. fastidiosa populations in grapevines or elderberry. PMID:26218423

  9. Non-contiguous genome sequence of Mycobacterium simiae strain DSM 44165(T.).

    PubMed

    Sassi, Mohamed; Robert, Catherine; Raoult, Didier; Drancourt, Michel

    2013-01-01

    Mycobacterium simiae is a non-tuberculosis mycobacterium causing pulmonary infections in both immunocompetent and imunocompromized patients. We announce the draft genome sequence of M. simiae DSM 44165(T). The 5,782,968-bp long genome with 65.15% GC content (one chromosome, no plasmid) contains 5,727 open reading frames (33% with unknown function and 11 ORFs sizing more than 5000 -bp), three rRNA operons, 52 tRNA, one 66-bp tmRNA matching with tmRNA tags from Mycobacterium avium, Mycobacterium tuberculosis, Mycobacterium bovis, Mycobacterium microti, Mycobacterium marinum, and Mycobacterium africanum and 389 DNA repetitive sequences. Comparing ORFs and size distribution between M. simiae and five other Mycobacterium species M. simiae clustered with M. abscessus and M. smegmatis. A 40-kb prophage was predicted in addition to two prophage-like elements, 7-kb and 18-kb in size, but no mycobacteriophage was seen after the observation of 10(6) M. simiae cells. Fifteen putative CRISPRs were found. Three genes were predicted to encode resistance to aminoglycosides, betalactams and macrolide-lincosamide-streptogramin B. A total of 163 CAZYmes were annotated. M. simiae contains ESX-1 to ESX-5 genes encoding for a type-VII secretion system. Availability of the genome sequence may help depict the unique properties of this environmental, opportunistic pathogen.

  10. The transcriptional regulator pool of the marine bacterium Rhodopirellula baltica SH 1T as revealed by whole genome comparisons.

    PubMed

    Lombardot, Thierry; Bauer, Margarete; Teeling, Hanno; Amann, Rudolf; Glöckner, Frank Oliver

    2005-01-01

    Rhodopirellula baltica (strain SH 1T) is a free-living marine representative of the phylogenetically independent and environmentally relevant phylum Planctomycetes. Little is known about the regulatory strategies of free-living bacteria with large (7.15 Mb) genomes. Therefore, a consistent, quantitative and qualitative description was produced by comparing R. baltica's transcriptional regulator pool with that of 123 publicly available bacterial genomes. The overall results are congruous with earlier observations that in Bacteria, the proportion of genes encoding transcriptional regulators generally increases with genome size. However, R. baltica distinctly stands out from this trend with only 2.4% (174) of all genes predicted to encode transcriptional regulators. The qualitative investigation of R. baltica's transcriptional regulators revealed a clear shift towards high numbers of two-component systems (66) as well as high numbers of sigma factors (49), with more than 76% (37) belonging to the extra-cytoplasmic function subfamily of sigma-70. Only one predicted sigma factor showed a relatively close phylogenetic relationship to that of another bacterium, the sigma factor SigZ of Bacillus subtilis. In summary, analysis of the R. baltica genome revealed disparate regulatory mechanisms and a clear bias towards direct environmental sensing. This strategy might provide a selective advantage for organisms living in habitats with frequently changing environmental conditions.

  11. The Lp_3561 and Lp_3562 Enzymes Support a Functional Divergence Process in the Lipase/Esterase Toolkit from Lactobacillus plantarum

    PubMed Central

    Esteban-Torres, María; Reverón, Inés; Santamaría, Laura; Mancheño, José M.; de las Rivas, Blanca; Muñoz, Rosario

    2016-01-01

    Lactobacillus plantarum species is a good source of esterases since both lipolytic and esterase activities have been described for strains of this species. No fundamental biochemical difference exists among esterases and lipases since both share a common catalytic mechanism. L. plantarum WCFS1 possesses a protein, Lp_3561, which is 44% identical to a previously described lipase, Lp_3562. In contrast to Lp_3562, Lp_3561 was unable to degrade esters possessing a chain length higher than C4 and the triglyceride tributyrin. As in other L. plantarum esterases, the electrostatic potential surface around the active site in Lp_3561 is predicted to be basic, whereas it is essentially neutral in the Lp_3562 lipase. The fact that the genes encoding both proteins were located contiguously in the L. plantarum WCFS1 genome, suggests that they originated by tandem duplication, and therefore are paralogs as new functions have arisen during evolution. The presence of the contiguous lp_3561 and lp_3562 genes was studied among L. plantarum strains. They are located in a 8,903 bp DNA fragment that encodes proteins involved in the catabolism of sialic acid and are predicted to increase bacterial adaptability under certain growth conditions. PMID:27486450

  12. Gene expression profiling during asexual development of the late blight pathogen Phytophthora infestans reveals a highly dynamic transcriptome.

    PubMed

    Judelson, Howard S; Ah-Fong, Audrey M V; Aux, George; Avrova, Anna O; Bruce, Catherine; Cakir, Cahid; da Cunha, Luis; Grenville-Briggs, Laura; Latijnhouwers, Maita; Ligterink, Wilco; Meijer, Harold J G; Roberts, Samuel; Thurber, Carrie S; Whisson, Stephen C; Birch, Paul R J; Govers, Francine; Kamoun, Sophien; van West, Pieter; Windass, John

    2008-04-01

    Much of the pathogenic success of Phytophthora infestans, the potato and tomato late blight agent, relies on its ability to generate from mycelia large amounts of sporangia, which release zoospores that encyst and form infection structures. To better understand these stages, Affymetrix GeneChips based on 15,650 unigenes were designed and used to profile the life cycle. Approximately half of P. infestans genes were found to exhibit significant differential expression between developmental transitions, with approximately (1)/(10) being stage-specific and most changes occurring during zoosporogenesis. Quantitative reverse-transcription polymerase chain reaction assays confirmed the robustness of the array results and showed that similar patterns of differential expression were obtained regardless of whether hyphae were from laboratory media or infected tomato. Differentially expressed genes encode potential cellular regulators, especially protein kinases; metabolic enzymes such as those involved in glycolysis, gluconeogenesis, or the biosynthesis of amino acids or lipids; regulators of DNA synthesis; structural proteins, including predicted flagellar proteins; and pathogenicity factors, including cell-wall-degrading enzymes, RXLR effector proteins, and enzymes protecting against plant defense responses. Curiously, some stage-specific transcripts do not appear to encode functional proteins. These findings reveal many new aspects of oomycete biology, as well as potential targets for crop protection chemicals.

  13. PDE1 Encodes a P-Type ATPase Involved in Appressorium-Mediated Plant Infection by the Rice Blast Fungus Magnaporthe grisea

    PubMed Central

    Balhadère, Pascale V.; Talbot, Nicholas J.

    2001-01-01

    Plant infection by the rice blast fungus Magnaporthe grisea is brought about by the action of specialized infection cells called appressoria. These infection cells generate enormous turgor pressure, which is translated into an invasive force that allows a narrow penetration hypha to breach the plant cuticle. The Magnaporthe pde1 mutant was identified previously by restriction enzyme–mediated DNA integration mutagenesis and is impaired in its ability to elaborate penetration hyphae. Here we report that the pde1 mutation is the result of an insertion into the promoter of a P-type ATPase-encoding gene. Targeted gene disruption confirmed the role of PDE1 in penetration hypha development and pathogenicity but highlighted potential differences in PDE1 regulation in different Magnaporthe strains. The predicted PDE1 gene product was most similar to members of the aminophospholipid translocase group of P-type ATPases and was shown to be a functional homolog of the yeast ATPase gene ATC8. Spatial expression studies showed that PDE1 is expressed in germinating conidia and developing appressoria. These findings implicate the action of aminophospholipid translocases in the development of penetration hyphae and the proliferation of the fungus beyond colonization of the first epidermal cell. PMID:11549759

  14. Functional analysis of alternative transcripts of the soybean Rj2 gene that restricts nodulation with specific rhizobial strains.

    PubMed

    Tang, F; Yang, S; Zhu, H

    2016-05-01

    The Rj2 gene is a TIR-NBS-LRR-type resistance gene in soybean (Glycine max) that restricts root nodule symbiosis with a group of Bradyrhizobium japonicum strains including USDA122. Rj2 generates two distinct transcript variants in its expression profile through alternative splicing. Alternative splicing of Rj2 is caused by the retention of the 86-bp intron 4. Inclusion of intron 4 in mature mRNA introduces an in-frame stop codon; as such, the alternative transcript is predicted to encode a truncated protein consisting of the entire portion of the TIR, NBS and LRR domains but missing the C-terminal domain of the full-length Rj2 protein encoded by the regular transcript. Since alternative splicing has been shown to be essential for full activity of several plant R genes, we attempted to test whether the alternative splicing is required for Rj2-mediated nodulation restriction. Here we demonstrated that the Rj2-mediated nodulation restriction does not require the combined presence of the regular and alternative transcripts, and the expression of the regular transcript alone is sufficient to confer nodulation restriction. © 2016 German Botanical Society and The Royal Botanical Society of the Netherlands.

  15. Recombinant DNA encoding a desulfurization biocatalyst

    DOEpatents

    Rambosek, J.; Piddington, C.S.; Kovacevich, B.R.; Young, K.D.; Denome, S.A.

    1994-10-18

    This invention relates to a recombinant DNA molecule containing a gene or genes which encode a biocatalyst capable of desulfurizing a fossil fuel which contains organic sulfur molecules. For example, the present invention encompasses a recombinant DNA molecule containing a gene or genes of a strain of Rhodococcus rhodochrous. 13 figs.

  16. Structure, Function, Interaction, Co-evolution of Rice Blast Resistance Genes

    USDA-ARS?s Scientific Manuscript database

    Rice blast disease caused by the fungal pathogen Magnaporthe oryzae is one of the most destructive rice diseases worldwide. Resistance (R) genes to blast encode proteins that detect pathogen signaling molecules encoded by M. oryzae avirulence (AVR) genes. R genes can be a single or a member of clu...

  17. Systematic mapping of two component response regulators to gene targets in a model sulfate reducing bacterium.

    PubMed

    Rajeev, Lara; Luning, Eric G; Dehal, Paramvir S; Price, Morgan N; Arkin, Adam P; Mukhopadhyay, Aindrila

    2011-10-12

    Two component regulatory systems are the primary form of signal transduction in bacteria. Although genomic binding sites have been determined for several eukaryotic and bacterial transcription factors, comprehensive identification of gene targets of two component response regulators remains challenging due to the lack of knowledge of the signals required for their activation. We focused our study on Desulfovibrio vulgaris Hildenborough, a sulfate reducing bacterium that encodes unusually diverse and largely uncharacterized two component signal transduction systems. We report the first systematic mapping of the genes regulated by all transcriptionally acting response regulators in a single bacterium. Our results enabled functional predictions for several response regulators and include key processes of carbon, nitrogen and energy metabolism, cell motility and biofilm formation, and responses to stresses such as nitrite, low potassium and phosphate starvation. Our study also led to the prediction of new genes and regulatory networks, which found corroboration in a compendium of transcriptome data available for D. vulgaris. For several regulators we predicted and experimentally verified the binding site motifs, most of which were discovered as part of this study. The gene targets identified for the response regulators allowed strong functional predictions to be made for the corresponding two component systems. By tracking the D. vulgaris regulators and their motifs outside the Desulfovibrio spp. we provide testable hypotheses regarding the functions of orthologous regulators in other organisms. The in vitro array based method optimized here is generally applicable for the study of such systems in all organisms.

  18. The unusually large Plasmodium telomerase reverse-transcriptase localizes in a discrete compartment associated with the nucleolus

    PubMed Central

    Figueiredo, Luisa M.; Rocha, Eduardo P. C.; Mancio-Silva, Liliana; Prevost, Christine; Hernandez-Verdun, Danièle; Scherf, Artur

    2005-01-01

    Telomerase replicates chromosome ends, a function necessary for maintaining genome integrity. We have identified the gene that encodes the catalytic reverse transcriptase (RT) component of this enzyme in the malaria parasite Plasmodium falciparum (PfTERT) as well as the orthologous genes from two rodent and one simian malaria species. PfTERT is predicted to encode a basic protein that contains the major sequence motifs previously identified in known telomerase RTs (TERTs). At ∼2500 amino acids, PfTERT is three times larger than other characterized TERTs. We observed remarkable sequence diversity between TERT proteins of different Plasmodial species, with conserved domains alternating with hypervariable regions. Immunofluorescence analysis revealed that PfTERT is expressed in asexual blood stage parasites that have begun DNA synthesis. Surprisingly, rather than at telomere clusters, PfTERT typically localizes into a discrete nuclear compartment. We further demonstrate that this compartment is associated with the nucleolus, hereby defined for the first time in P.falciparum. PMID:15722485

  19. Ebolavirus comparative genomics

    DOE PAGES

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; ...

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less

  20. Effect of chitinase on resistance to fungal pathogens in sea buckthorn, Hippophae rhamnoides, and cloning of Class I and III chitinase genes.

    PubMed

    Sun, Yan-Lin; Hong, Soon-Kwan

    2012-08-01

    Sea buckthorn (Hippophae rhamnoides L.) is naturally distributed from Asia to Europe. It has been widely planted as an ornamental shrub and is rich in nutritional and medicinal compounds. Fungal pathogens that cause diseases such as dried-shrink disease are threats to the production of this plant. In this study, we isolated the dried-shrink disease pathogen from bark and total chitinase protein from leaves of infected plants. The results of the Oxford Cup experiment suggested that chitinase protein inhibited the growth of this pathogen. To improve pathogen resistance, we cloned chitinase Class I and III genes in H. rhamnoides, designated Hrchi1 and Hrchi3. The full-length cDNA of the open reading frame region of Hrchi1 contained 903 bp encoding 300 amino acids and Hrchi3 contained 894 bp encoding 297 amino acids. Active domain analysis, protein types, and secondary and 3D structures were predicted using online software.

Top