putative protein coding: Topics by Science.gov

Sample records for putative protein coding

Identification of Putative Nuclear Receptors and Steroidogenic Enzymes in Murray-Darling Rainbowfish (Melanotaenia fluviatilis) Using RNA-Seq and De Novo Transcriptome Assembly.

PubMed

Bain, Peter A; Papanicolaou, Alexie; Kumar, Anupama

2015-01-01

Murray-Darling rainbowfish (Melanotaenia fluviatilis [Castelnau, 1878]; Atheriniformes: Melanotaeniidae) is a small-bodied teleost currently under development in Australasia as a test species for aquatic toxicological studies. To date, efforts towards the development of molecular biomarkers of contaminant exposure have been hindered by the lack of available sequence data. To address this, we sequenced messenger RNA from brain, liver and gonads of mature male and female fish and generated a high-quality draft transcriptome using a de novo assembly approach. 149,742 clusters of putative transcripts were obtained, encompassing 43,841 non-redundant protein-coding regions. Deduced amino acid sequences were annotated by functional inference based on similarity with sequences from manually curated protein sequence databases. The draft assembly contained protein-coding regions homologous to 95.7% of the complete cohort of predicted proteins from the taxonomically related species, Oryzias latipes (Japanese medaka). The mean length of rainbowfish protein-coding sequences relative to their medaka homologues was 92.1%, indicating that despite the limited number of tissues sampled a large proportion of the total expected number of protein-coding genes was captured in the study. Because of our interest in the effects of environmental contaminants on endocrine pathways, we manually curated subsets of coding regions for putative nuclear receptors and steroidogenic enzymes in the rainbowfish transcriptome, revealing 61 candidate nuclear receptors encompassing all known subfamilies, and 41 putative steroidogenic enzymes representing all major steroidogenic enzymes occurring in teleosts. The transcriptome presented here will be a valuable resource for researchers interested in biomarker development, protein structure and function, and contaminant-response genomics in Murray-Darling rainbowfish.
Transcriptional landscapes of Axolotl (Ambystoma mexicanum).

PubMed

Caballero-Pérez, Juan; Espinal-Centeno, Annie; Falcon, Francisco; García-Ortega, Luis F; Curiel-Quesada, Everardo; Cruz-Hernández, Andrés; Bako, Laszlo; Chen, Xuemei; Martínez, Octavio; Alberto Arteaga-Vázquez, Mario; Herrera-Estrella, Luis; Cruz-Ramírez, Alfredo

2018-01-15

The axolotl (Ambystoma mexicanum) is the vertebrate model system with the highest regeneration capacity. Experimental tools established over the past 100 years have been fundamental to start unraveling the cellular and molecular basis of tissue and limb regeneration. In the absence of a reference genome for the Axolotl, transcriptomic analysis become fundamental to understand the genetic basis of regeneration. Here we present one of the most diverse transcriptomic data sets for Axolotl by profiling coding and non-coding RNAs from diverse tissues. We reconstructed a population of 115,906 putative protein coding mRNAs as full ORFs (including isoforms). We also identified 352 conserved miRNAs and 297 novel putative mature miRNAs. Systematic enrichment analysis of gene expression allowed us to identify tissue-specific protein-coding transcripts. We also found putative novel and conserved microRNAs which potentially target mRNAs which are reported as important disease candidates in heart and liver. Copyright © 2017 Elsevier Inc. All rights reserved.
Extreme Sensory Complexity Encoded in the 10-Megabase Draft Genome Sequence of the Chromatically Acclimating Cyanobacterium Tolypothrix sp. PCC 7601

PubMed Central

Yerrapragada, Shaila; Shukla, Animesh; Hallsworth-Pepin, Kymberlie; Choi, Kwangmin; Wollam, Aye; Clifton, Sandra; Qin, Xiang; Muzny, Donna; Raghuraman, Sriram; Ashki, Haleh; Uzman, Akif; Highlander, Sarah K.; Fryszczyn, Bartlomiej G.; Fox, George E.; Tirumalai, Madhan R.; Liu, Yamei; Kim, Sun

2015-01-01

Tolypothrix sp. PCC 7601 is a freshwater filamentous cyanobacterium with complex responses to environmental conditions. Here, we present its 9.96-Mbp draft genome sequence, containing 10,065 putative protein-coding sequences, including 305 predicted two-component system proteins and 27 putative phytochrome-class photoreceptors, the most such proteins in any sequenced genome. PMID:25953173
Extreme Sensory Complexity Encoded in the 10-Megabase Draft Genome Sequence of the Chromatically Acclimating Cyanobacterium Tolypothrix sp. PCC 7601.

PubMed

Yerrapragada, Shaila; Shukla, Animesh; Hallsworth-Pepin, Kymberlie; Choi, Kwangmin; Wollam, Aye; Clifton, Sandra; Qin, Xiang; Muzny, Donna; Raghuraman, Sriram; Ashki, Haleh; Uzman, Akif; Highlander, Sarah K; Fryszczyn, Bartlomiej G; Fox, George E; Tirumalai, Madhan R; Liu, Yamei; Kim, Sun; Kehoe, David M; Weinstock, George M

2015-05-07

Tolypothrix sp. PCC 7601 is a freshwater filamentous cyanobacterium with complex responses to environmental conditions. Here, we present its 9.96-Mbp draft genome sequence, containing 10,065 putative protein-coding sequences, including 305 predicted two-component system proteins and 27 putative phytochrome-class photoreceptors, the most such proteins in any sequenced genome. Copyright © 2015 Yerrapragada et al.
Emerging Putative Associations between Non-Coding RNAs and Protein-Coding Genes in Neuropathic Pain: Added Value from Reusing Microarray Data.

PubMed

Raju, Hemalatha B; Tsinoremas, Nicholas F; Capobianco, Enrico

2016-01-01

Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein-protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches.
Emerging Putative Associations between Non-Coding RNAs and Protein-Coding Genes in Neuropathic Pain: Added Value from Reusing Microarray Data

PubMed Central

Raju, Hemalatha B.; Tsinoremas, Nicholas F.; Capobianco, Enrico

2016-01-01

Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein–protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches. PMID:27803687
TCOF1 gene encodes a putative nucleolar phosphoprotein that exhibits mutations in Treacher Collins Syndrome throughout its coding region.

PubMed

Wise, C A; Chiang, L C; Paznekas, W A; Sharma, M; Musy, M M; Ashley, J A; Lovett, M; Jabs, E W

1997-04-01

Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development.
Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.

PubMed Central

Borodovsky, M; Rudd, K E; Koonin, E V

1994-01-01

The unannotated regions of the Escherichia coli genome DNA sequence from the EcoSeq6 database, totaling 1,278 'intergenic' sequences of the combined length of 359,279 basepairs, were analyzed using computer-assisted methods with the aim of identifying putative unknown genes. The proposed strategy for finding new genes includes two key elements: i) prediction of expressed open reading frames (ORFs) using the GeneMark method based on Markov chain models for coding and non-coding regions of Escherichia coli DNA, and ii) search for protein sequence similarities using programs based on the BLAST algorithm and programs for motif identification. A total of 354 putative expressed ORFs were predicted by GeneMark. Using the BLASTX and TBLASTN programs, it was shown that 208 ORFs located in the unannotated regions of the E. coli chromosome are significantly similar to other protein sequences. Identification of 182 ORFs as probable genes was supported by GeneMark and BLAST, comprising 51.4% of the GeneMark 'hits' and 87.5% of the BLAST 'hits'. 73 putative new genes, comprising 20.6% of the GeneMark predictions, belong to ancient conserved protein families that include both eubacterial and eukaryotic members. This value is close to the overall proportion of highly conserved sequences among eubacterial proteins, indicating that the majority of the putative expressed ORFs that are predicted by GeneMark, but have no significant BLAST hits, nevertheless are likely to be real genes. The majority of the putative genes identified by BLAST search have been described since the release of the EcoSeq6 database, but about 70 genes have not been detected so far. Among these new identifications are genes encoding proteins with a variety of predicted functions including dehydrogenases, kinases, several other metabolic enzymes, ATPases, rRNA methyltransferases, membrane proteins, and different types of regulatory proteins. Images PMID:7984428
A repertoire of the dominant transcripts from the salivary glands of the blood-sucking bug, Triatoma dimidiata, a vector of Chagas disease

PubMed Central

Kato, Hirotomo; Jochim, Ryan C.; Gomez, Eduardo A.; Sakoda, Ryo; Iwata, Hiroyuki; Valenzuela, Jesus G.; Hashiguchi, Yoshihisa

2010-01-01

Triatoma (T.) dimidiata is a hematophagous Hemiptera and a main vector of Chagas disease. The saliva of this and other blood-sucking insects contains potent pharmacologically active components that assist them in counteracting the host hemostatic and inflammatory systems during blood feeding. To describe the repertoire of potential bioactive salivary molecules from this insect, a number of randomly selected transcripts from the salivary gland cDNA library of T. dimidiata were sequenced and analyzed. This analysis showed that 77.5% of the isolated transcripts coded for putative secreted proteins, and 89.9% of these coded for variants of the lipocalin family proteins. The most abundant transcript was a homologue of procalin, the major allergen of T. protracta saliva, and contributed more than 50% of the transcripts coding for putative secreted proteins, suggesting that it may play an important role in the blood-feeding process. Other salivary transcripts encoding lipocalin family proteins had homology to triabin (a thrombin inhibitor), triafestin (an inhibitor of kallikrein–kinin system), pallidipin (an inhibitor of collagen-induced platelet aggregation) and others with unknown function. PMID:19900580
Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes.

PubMed

Seligmann, Hervé

2013-05-07

GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins

PubMed Central

Delcourt, Vivian; Lucier, Jean-François; Gagnon, Jules; Beaudoin, Maxime C; Vanderperre, Benoît; Breton, Marc-André; Motard, Julie; Jacques, Jean-François; Brunelle, Mylène; Gagnon-Arsenault, Isabelle; Fournier, Isabelle; Ouangraoua, Aida; Hunting, Darel J; Cohen, Alan A; Landry, Christian R; Scott, Michelle S

2017-01-01

Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins. PMID:29083303
Are plant formins integral membrane proteins?

PubMed

Cvrcková, F

2000-01-01

The formin family of proteins has been implicated in signaling pathways of cellular morphogenesis in both animals and fungi; in the latter case, at least, they participate in communication between the actin cytoskeleton and the cell surface. Nevertheless, they appear to be cytoplasmic or nuclear proteins, and it is not clear whether they communicate with the plasma membrane, and if so, how. Because nothing is known about formin function in plants, I performed a systematic search for putative Arabidopsis thaliana formin homologs. I found eight putative formin-coding genes in the publicly available part of the Arabidopsis genome sequence and analyzed their predicted protein sequences. Surprisingly, some of them lack parts of the conserved formin-homology 2 (FH2) domain and the majority of them seem to have signal sequences and putative transmembrane segments that are not found in yeast or animals formins. Plant formins define a distinct subfamily. The presence in most Arabidopsis formins of sequence motifs typical or transmembrane proteins suggests a mechanism of membrane attachment that may be specific to plant formins, and indicates an unexpected evolutionary flexibility of the conserved formin domain.
Network perturbation by recurrent regulatory variants in cancer

PubMed Central

Cho, Ara; Lee, Insuk; Choi, Jung Kyoon

2017-01-01

Cancer driving genes have been identified as recurrently affected by variants that alter protein-coding sequences. However, a majority of cancer variants arise in noncoding regions, and some of them are thought to play a critical role through transcriptional perturbation. Here we identified putative transcriptional driver genes based on combinatorial variant recurrence in cis-regulatory regions. The identified genes showed high connectivity in the cancer type-specific transcription regulatory network, with high outdegree and many downstream genes, highlighting their causative role during tumorigenesis. In the protein interactome, the identified transcriptional drivers were not as highly connected as coding driver genes but appeared to form a network module centered on the coding drivers. The coding and regulatory variants associated via these interactions between the coding and transcriptional drivers showed exclusive and complementary occurrence patterns across tumor samples. Transcriptional cancer drivers may act through an extensive perturbation of the regulatory network and by altering protein network modules through interactions with coding driver genes. PMID:28333928
Putative function of hypothetical proteins expressed by Clostridium perfringens type A strains and their protective efficacy in mouse model.

PubMed

Alam, Syed Imteyaz; Dwivedi, Pratistha

2016-10-01

The whole genome sequencing and annotation of Clostridium perfringens strains revealed several genes coding for proteins of unknown function with no significant similarities to genes in other organisms. Our previous studies clearly demonstrated that hypothetical proteins CPF_2500, CPF_1441, CPF_0876, CPF_0093, CPF_2002, CPF_2314, CPF_1179, CPF_1132, CPF_2853, CPF_0552, CPF_2032, CPF_0438, CPF_1440, CPF_2918, CPF_0656, and CPF_2364 are genuine proteins of C. perfringens expressed in high abundance. This study explored the putative role of these hypothetical proteins using bioinformatic tools and evaluated their potential as putative candidates for prophylaxis. Apart from a group of eight hypothetical proteins (HPs), a putative function was predicted for the rest of the hypothetical proteins using one or more of the algorithms used. The phylogenetic analysis did not suggest an evidence of a horizontal gene transfer event except for HP CPF_0876. HP CPF_2918 is an abundant extracellular protein, unique to C. perfringens species with maximum strain coverage and did not show any significant match in the database. CPF_2918 was cloned, recombinant protein was purified to near homogeneity, and probing with mouse anti-CPF_2918 serum revealed surface localization of the protein in C. perfringens ATCC13124 cultures. The purified recombinant CPF_2918 protein induced antibody production, a mixed Th1 and Th2 kind of response, and provided partial protection to immunized mice in direct C. perfringens challenge. Copyright © 2016 Elsevier B.V. All rights reserved.
A Catalogue of Putative cis-Regulatory Interactions Between Long Non-coding RNAs and Proximal Coding Genes Based on Correlative Analysis Across Diverse Human Tumors.

PubMed

Basu, Swaraj; Larsson, Erik

2018-05-31

Antisense transcripts and other long non-coding RNAs are pervasive in mammalian cells, and some of these molecules have been proposed to regulate proximal protein-coding genes in cis For example, non-coding transcription can contribute to inactivation of tumor suppressor genes in cancer, and antisense transcripts have been implicated in the epigenetic inactivation of imprinted genes. However, our knowledge is still limited and more such regulatory interactions likely await discovery. Here, we make use of available gene expression data from a large compendium of human tumors to generate hypotheses regarding non-coding-to-coding cis -regulatory relationships with emphasis on negative associations, as these are less likely to arise for reasons other than cis -regulation. We document a large number of possible regulatory interactions, including 193 coding/non-coding pairs that show expression patterns compatible with negative cis -regulation. Importantly, by this approach we capture several known cases, and many of the involved coding genes have known roles in cancer. Our study provides a large catalog of putative non-coding/coding cis -regulatory pairs that may serve as a basis for further experimental validation and characterization. Copyright © 2018 Basu and Larsson.
In Silico Pattern-Based Analysis of the Human Cytomegalovirus Genome

PubMed Central

Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T.; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

2003-01-01

More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/). PMID:12634390
In silico pattern-based analysis of the human cytomegalovirus genome.

PubMed

Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

2003-04-01

More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/).
Unraveling the molecular mechanisms of nitrogenase conformational protection against oxygen in diazotrophic bacteria.

PubMed

Lery, Letícia M S; Bitar, Mainá; Costa, Mauricio G S; Rössle, Shaila C S; Bisch, Paulo M

2010-12-22

G. diazotrophicus and A. vinelandii are aerobic nitrogen-fixing bacteria. Although oxygen is essential for the survival of these organisms, it irreversibly inhibits nitrogenase, the complex responsible for nitrogen fixation. Both microorganisms deal with this paradox through compensatory mechanisms. In A. vinelandii a conformational protection mechanism occurs through the interaction between the nitrogenase complex and the FeSII protein. Previous studies suggested the existence of a similar system in G. diazotrophicus, but the putative protein involved was not yet described. This study intends to identify the protein coding gene in the recently sequenced genome of G. diazotrophicus and also provide detailed structural information of nitrogenase conformational protection in both organisms. Genomic analysis of G. diazotrophicus sequences revealed a protein coding ORF (Gdia0615) enclosing a conserved "fer2" domain, typical of the ferredoxin family and found in A. vinelandii FeSII. Comparative models of both FeSII and Gdia0615 disclosed a conserved beta-grasp fold. Cysteine residues that coordinate the 2[Fe-S] cluster are in conserved positions towards the metallocluster. Analysis of solvent accessible residues and electrostatic surfaces unveiled an hydrophobic dimerization interface. Dimers assembled by molecular docking presented a stable behaviour and a proper accommodation of regions possibly involved in binding of FeSII to nitrogenase throughout molecular dynamics simulations in aqueous solution. Molecular modeling of the nitrogenase complex of G. diazotrophicus was performed and models were compared to the crystal structure of A. vinelandii nitrogenase. Docking experiments of FeSII and Gdia0615 with its corresponding nitrogenase complex pointed out in both systems a putative binding site presenting shape and charge complementarities at the Fe-protein/MoFe-protein complex interface. The identification of the putative FeSII coding gene in G. diazotrophicus genome represents a large step towards the understanding of the conformational protection mechanism of nitrogenase against oxygen. In addition, this is the first study regarding the structural complementarities of FeSII-nitrogenase interactions in diazotrophic bacteria. The combination of bioinformatic tools for genome analysis, comparative protein modeling, docking calculations and molecular dynamics provided a powerful strategy for the elucidation of molecular mechanisms and structural features of FeSII-nitrogenase interaction.
Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins.

PubMed

Hu, Pingzhao; Janga, Sarath Chandra; Babu, Mohan; Díaz-Mejía, J Javier; Butland, Gareth; Yang, Wenhong; Pogoutse, Oxana; Guo, Xinghua; Phanse, Sadhna; Wong, Peter; Chandran, Shamanta; Christopoulos, Constantine; Nazarians-Armavil, Anaies; Nasseri, Negin Karimi; Musso, Gabriel; Ali, Mehrab; Nazemof, Nazila; Eroukova, Veronika; Golshani, Ashkan; Paccanaro, Alberto; Greenblatt, Jack F; Moreno-Hagelsieb, Gabriel; Emili, Andrew

2009-04-28

One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans' biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a "systems-wide" functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.

PubMed

Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K

1991-09-15

We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.

From Genomes to Protein Models and Back

NASA Astrophysics Data System (ADS)

Tramontano, Anna; Giorgetti, Alejandro; Orsini, Massimiliano; Raimondo, Domenico

2007-12-01

The alternative splicing mechanism allows genes to generate more than one product. When the splicing events occur within protein coding regions they can modify the biological function of the protein. Alternative splicing has been suggested as one way for explaining the discrepancy between the number of human genes and functional complexity. We analysed the putative structure of the alternatively spliced gene products annotated in the ENCODE pilot project and discovered that many of the potential alternative gene products will be unlikely to produce stable functional proteins.
Venom gland transcriptomic and venom proteomic analyses of the scorpion Megacormus gertschi Díaz-Najera, 1966 (Scorpiones: Euscorpiidae: Megacorminae).

PubMed

Santibáñez-López, Carlos E; Cid-Uribe, Jimena I; Zamudio, Fernando Z; Batista, Cesar V F; Ortiz, Ernesto; Possani, Lourival D

2017-07-01

The soluble venom from the Mexican scorpion Megacormus gertschi of the family Euscorpiidae was obtained and its biological effects were tested in several animal models. This venom is not toxic to mice at doses of 100 μg per 20 g of mouse weight, while being lethal to arthropods (insects and crustaceans), at doses of 20 μg (for crickets) and 100 μg (for shrimps) per animal. Samples of the venom were separated by high performance liquid chromatography and circa 80 distinct chromatographic fractions were obtained from which 67 components have had their molecular weights determined by mass spectrometry analysis. The N-terminal amino acid sequence of seven protein/peptides were obtained by Edman degradation and are reported. Among the high molecular weight components there are enzymes with experimentally-confirmed phospholipase activity. A pair of telsons from this scorpion species was dissected, from which total RNA was extracted and used for cDNA library construction. Massive sequencing by the Illumina protocol, followed by de novo assembly, resulted in a total of 110,528 transcripts. From those, we were able to annotate 182, which putatively code for peptides/proteins with sequence similarity to previously-reported venom components available from different protein databases. Transcripts seemingly coding for enzymes showed the richest diversity, with 52 sequences putatively coding for proteases, 20 for phospholipases, 8 for lipases and 5 for hyaluronidases. The number of different transcripts potentially coding for peptides with sequence similarity to those that affect ion channels was 19, for putative antimicrobial peptides 19, and for protease inhibitor-like peptides, 18. Transcripts seemingly coding for other venom components were identified and described. The LC/MS analysis of a trypsin-digested venom aliquot resulted in 23 matches with the translated transcriptome database, which validates the transcriptome. The proteomic and transcriptomic analyses reported here constitute the first approach to study the venom components from a scorpion species belonging to the family Euscorpiidae. The data certainly show that this venom is different from all the ones described thus far in the literature. Copyright © 2017 Elsevier Ltd. All rights reserved.
Characterization of a Theta-Type Plasmid from Lactobacillus sakei: a Potential Basis for Low-Copy-Number Vectors in Lactobacilli

PubMed Central

Alpert, Carl-Alfred; Crutz-Le Coq, Anne-Marie; Malleret, Christine; Zagorec, Monique

2003-01-01

The complete nucleotide sequence of the 13-kb plasmid pRV500, isolated from Lactobacillus sakei RV332, was determined. Sequence analysis enabled the identification of genes coding for a putative type I restriction-modification system, two genes coding for putative recombinases of the integrase family, and a region likely involved in replication. The structural features of this region, comprising a putative ori segment containing 11- and 22-bp repeats and a repA gene coding for a putative initiator protein, indicated that pRV500 belongs to the pUCL287 subfamily of theta-type replicons. A 3.7-kb fragment encompassing this region was fused to an Escherichia coli replicon to produce the shuttle vector pRV566 and was observed to be functional in L. sakei for plasmid replication. The L. sakei replicon alone could not support replication in E. coli. Plasmid pRV500 and its derivative pRV566 were determined to be at very low copy numbers in L. sakei. pRV566 was maintained at a reasonable rate over 20 generations in several lactobacilli, such as Lactobacillus curvatus, Lactobacillus casei, and Lactobacillus plantarum, in addition to L. sakei, making it an interesting basis for developing vectors. Sequence relationships with other plasmids are described and discussed. PMID:12957947
An operon from Lactobacillus helveticus composed of a proline iminopeptidase gene (pepI) and two genes coding for putative members of the ABC transporter family of proteins.

PubMed

Varmanen, P; Rantanen, T; Palva, A

1996-12-01

A proline iminopeptidase gene (pepI) of an industrial Lactobacillus helveticus strain was cloned and found to be organized in an operon-like structure of three open reading frames (ORF1, ORF2 and ORF3). ORF1 was preceded by a typical prokaryotic promoter region, and a putative transcription terminator was found downstream of ORF3, identified as the pepI gene. Using primer-extension analyses, only one transcription start site, upstream of ORF1, was identifiable in the predicted operon. Although the size of mRNA could not be judged by Northern analysis either with ORF1-, ORF2- or pepI-specific probes, reverse transcription-PCR analyses further supported the operon structure of the three genes. ORF1, ORF2 and ORF3 had coding capacities for 50.7, 24.5 and 33.8 kDa proteins, respectively. The ORF3-encoded PepI protein showed 65% identity with the PepI proteins from Lactobacillus delbrueckii subsp. bulgaricus and Lactobacillus delbrueckii subsp. lactis. The ORF1-encoded protein had significant homology with several members of the ABC transporter family but, with two distinct putative ATP-binding sites, it would represent an unusual type among the bacterial ABC transporters. ORF2 encoded a putative integral membrane protein also characteristic of the ABC transporter family. The pepI gene was overexpressed in Escherichia coli. Purified PepI hydrolysed only di and tripeptides with proline in the first position. Optimum PepI activity was observed at pH 7.5 and 40 degrees C. A gel filtration analysis indicated that PepI is a dimer of M(r) 53,000. PepI was shown to be a metal-independent serine peptidase having thiol groups at or near the active site. Kinetic studies with proline-p-nitroanilide as substrate revealed Km and Vmax values of 0.8 mM and 350 mmol min-1 mg-1, respectively, and a very high turnover number of 135,000 s-1.
Deciphering the Genome Sequences of the Hydrophobic Cyanobacterium Scytonema tolypothrichoides VB-61278

PubMed Central

Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma

2015-01-01

Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. PMID:25838486
TCOF1 gene encodes a putative nucleolar phosphoprotein that exhibits mutations in Treacher Collins Syndrome throughout its coding region

PubMed Central

Wise, Carol A.; Chiang, Lydia C.; Paznekas, William A.; Sharma, Mridula; Musy, Maurice M.; Ashley, Jennifer A.; Lovett, Michael; Jabs, Ethylin W.

1997-01-01

Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development. PMID:9096354
A global analysis of protein expression profiles in Sinorhizobium meliloti: discovery of new genes for nodule occupancy and stress adaptation.

PubMed

Djordjevic, Michael A; Chen, Han Cai; Natera, Siria; Van Noorden, Giel; Menzel, Christian; Taylor, Scott; Renard, Clotilde; Geiger, Otto; Weiller, Georg F

2003-06-01

A proteomic examination of Sinorhizobium meliloti strain 1021 was undertaken using a combination of 2-D gel electrophoresis, peptide mass fingerprinting, and bioinformatics. Our goal was to identify (i) putative symbiosis- or nutrient-stress-specific proteins, (ii) the biochemical pathways active under different conditions, (iii) potential new genes, and (iv) the extent of posttranslational modifications of S. meliloti proteins. In total, we identified the protein products of 810 genes (13.1% of the genome's coding capacity). The 810 genes generated 1,180 gene products, with chromosomal genes accounting for 78% of the gene products identified (18.8% of the chromosome's coding capacity). The activity of 53 metabolic pathways was inferred from bioinformatic analysis of proteins with assigned Enzyme Commission numbers. Of the remaining proteins that did not encode enzymes, ABC-type transporters composed 12.7% and regulatory proteins 3.4% of the total. Proteins with up to seven transmembrane domains were identified in membrane preparations. A total of 27 putative nodule-specific proteins and 35 nutrient-stress-specific proteins were identified and used as a basis to define genes and describe processes occurring in S. meliloti cells in nodules and under stress. Several nodule proteins from the plant host were present in the nodule bacteria preparations. We also identified seven potentially novel proteins not predicted from the DNA sequence. Post-translational modifications such as N-terminal processing could be inferred from the data. The posttranslational addition of UMP to the key regulator of nitrogen metabolism, PII, was demonstrated. This work demonstrates the utility of combining mass spectrometry with protein arraying or separation techniques to identify candidate genes involved in important biological processes and niche occupations that may be intransigent to other methods of gene expression profiling.
Genetic and molecular characterization of a gene encoding a wide specificity purine permease of Aspergillus nidulans reveals a novel family of transporters conserved in prokaryotes and eukaryotes.

PubMed

Diallinas, G; Gorfinkiel, L; Arst, H N; Cecchetto, G; Scazzocchio, C

1995-04-14

In Aspergillus nidulans, loss-of-function mutations in the uapA and azgA genes, encoding the major uric acid-xanthine and hypoxanthine-adenine-guanine permeases, respectively, result in impaired utilization of these purines as sole nitrogen sources. The residual growth of the mutant strains is due to the activity of a broad specificity purine permease. We have identified uapC, the gene coding for this third permease through the isolation of both gain-of-function and loss-of-function mutations. Uptake studies with wild-type and mutant strains confirmed the genetic analysis and showed that the UapC protein contributes 30% and 8-10% to uric acid and hypoxanthine transport rates, respectively. The uapC gene was cloned, its expression studied, its sequence and transcript map established, and the sequence of its putative product analyzed. uapC message accumulation is: (i) weakly induced by 2-thiouric acid; (ii) repressed by ammonium; (iii) dependent on functional uaY and areA regulatory gene products (mediating uric acid induction and nitrogen metabolite repression, respectively); (iv) increased by uapC gain-of-function mutations which specifically, but partially, suppress a leucine to valine mutation in the zinc finger of the protein coded by the areA gene. The putative uapC gene product is a highly hydrophobic protein of 580 amino acids (M(r) = 61,251) including 12-14 putative transmembrane segments. The UapC protein is highly similar (58% identity) to the UapA permease and significantly similar (23-34% identity) to a number of bacterial transporters. Comparisons of the sequences and hydropathy profiles of members of this novel family of transporters yield insights into their structure, functionally important residues, and possible evolutionary relationships.
Genomic Organization and Molecular Analysis of Virulent Bacteriophage 2972 Infecting an Exopolysaccharide-Producing Streptococcus thermophilus Strain

PubMed Central

Lévesque, Céline; Duplessis, Martin; Labonté, Jessica; Labrie, Steve; Fremaux, Christophe; Tremblay, Denise; Moineau, Sylvain

2005-01-01

The Streptococcus thermophilus virulent pac-type phage 2972 was isolated from a yogurt made in France in 1999. It is a representative of several phages that have emerged with the industrial use of the exopolysaccharide-producing S. thermophilus strain RD534. The genome of phage 2972 has 34,704 bp with an overall G+C content of 40.15%, making it the shortest S. thermophilus phage genome analyzed so far. Forty-four open reading frames (ORFs) encoding putative proteins of 40 or more amino acids were identified, and bioinformatic analyses led to the assignment of putative functions to 23 ORFs. Comparative genomic analysis of phage 2972 with the six other sequenced S. thermophilus phage genomes confirmed that the replication module is conserved and that cos- and pac-type phages have distinct structural and packaging genes. Two group I introns were identified in the genome of 2972. They interrupted the genes coding for the putative endolysin and the terminase large subunit. Phage mRNA splicing was demonstrated for both introns, and the secondary structures were predicted. Eight structural proteins were also identified by N-terminal sequencing and/or matrix-assisted laser desorption ionization—time-of-flight mass spectrometry. Detailed analysis of the putative minor tail proteins ORF19 and ORF21 as well as the putative receptor-binding protein ORF20 showed the following interesting features: (i) ORF19 is a hybrid protein, because it displays significant identity with both pac- and cos-type phages; (ii) ORF20 is unique; and (iii) a protein similar to ORF21 of 2972 was also found in the structure of the cos-type phage DT1, indicating that this structural protein is present in both S. thermophilus phage groups. The implications of these findings for phage classification are discussed. PMID:16000821
Structural and functional studies of a family of Dictyostelium discoideum developmentally regulated, prestalk genes coding for small proteins.

PubMed

Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro

2008-01-03

The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Tenebrio molitor antifreeze protein gene identification and regulation.

PubMed

Qin, Wensheng; Walker, Virginia K

2006-02-15

The yellow mealworm, Tenebrio molitor, is a freeze susceptible, stored product pest. Its winter survival is facilitated by the accumulation of antifreeze proteins (AFPs), encoded by a small gene family. We have now isolated 11 different AFP genomic clones from 3 genomic libraries. All the clones had a single coding sequence, with no evidence of intervening sequences. Three genomic clones were further characterized. All have putative TATA box sequences upstream of the coding regions and multiple potential poly(A) signal sequences downstream of the coding regions. A TmAFP regulatory region, B1037, conferred transcriptional activity when ligated to a luciferase reporter sequence and after transfection into an insect cell line. A 143 bp core promoter including a TATA box sequence was identified. Its promoter activity was increased 4.4 times by inserting an exotic 245 bp intron into the construct, similar to the enhancement of transgenic expression seen in several other systems. The addition of a duplication of the first 120 bp sequence from the 143 bp core promoter decreased promoter activity by half. Although putative hormonal response sequences were identified, none of the five hormones tested enhanced reporter activity. These studies on the mechanisms of AFP transcriptional control are important for the consideration of any transfer of freeze-resistance phenotypes to beneficial hosts.
Deciphering the Genome Sequences of the Hydrophobic Cyanobacterium Scytonema tolypothrichoides VB-61278.

PubMed

Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma; Adhikary, Siba Prasad; Tripathy, Sucheta

2015-04-02

Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. Copyright © 2015 Das et al.
The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)

Treesearch

G.A. Tuskan; S. DiFazio; S. Jansson; J. Bohlmann; I. Grigoriev; U. Hellsten; N. Putnam; S. Ralph; S. Rombauts; A. Salamov; J. Schein; L. Sterck; A. Aerts; R.R. Bhalerao; R.P. Bhalerao; D. Blaudez; W. Boerjan; A. Brun; A. Brunner; V. Busov; M. Campbell; J. Carlson; M. Chalot; J. Chapman; G.-L. Chen; D. Cooper; P.M. Coutinho; J. Couturier; S. Covert; Q. Cronk; R. Cunningham; J. Davis; S. Degroeve; A. Dejardin; C. dePamphilis; J. Detter; B. Dirks; U. Dubchak; S. Duplessis; J. Ehlting; B. Ellis; K. Gendler; D. Goodstein; M. Gribskov; J. Grimwood; A. Groover; L. Gunter; B. Hamberger; B. Heinze; Y. Helariutta; B. Henrissat; D. Holligan; R. Holt; W. Huang; N. Islam-Faridi; S. Jones; M. Jones-Rhoades; R. Jorgensen; C. Joshi; J. Kangasjarvi; J. Karlsson; C. Kelleher; R. Kirkpatrick; M. Kirst; A. Kohler; U. Kalluri; F. Larimer; J. Leebens-Mack; J.-C. Leple; P. Locascio; Y. Lou; S. Lucas; F. Martin; B. Montanini; C. Napoli; D.R. Nelson; C. Nelson; K. Nieminen; O. Nilsson; V. Pereda; G. Peter; R. Philippe; G. Pilate; A. Poliakov; J. Razumovskaya; P. Richardson; C. Rinaldi; K. Ritland; P. Rouze; D. Ryaboy; J. Schumtz; J. Schrader; B. Segerman; H. Shin; A. Siddiqui; F. Sterky; A. Terry; C.-J. Tsai; E. Uberbacher; P. Unneberg; J. Vahala; K. Wall; S. Wessler; G. Yang; T. Yin; C. Douglas; M. Marra; G. Sandberg; Y. Van de Peer; D. Rokhsar

2006-01-01

We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs...
Cloning, characterization and sequence comparison of the gene coding for IMP dehydrogenase from Pyrococcus furiosus.

PubMed

Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E

1996-10-03

We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Pyrococcus furiosus (Pf), a hyperthermophillic archeon. Sequence analysis of the Pf gene indicated an open reading frame specifying a protein of 485 amino acids (aa) with a calculated M(r) of 52900. Canonical Archaea promoter elements, Box A and Box B, are located -49 and -17 nucleotides (nt), respectively, upstream of the putative start codon. The sequence of the putative active-site region conforms to the IMPDH signature motif and contains a putative active-site cysteine. Phylogenetic relationships derived by using all available IMPDH sequences are consistent with trees developed for other molecules; they do not precisely resolve the history of Pf IMPDH but indicate a close similarity to bacterial IMPDH proteins. The phylogenetic analysis indicates that a gene duplication occurred prior to the division between rodents and humans, accounting for the Type I and II isoforms identified in mice and humans.
mTOR referees memory and disease through mRNA repression and competition.

PubMed

Raab-Graham, Kimberly F; Niere, Farr

2017-06-01

Mammalian target of rapamycin (mTOR) activity is required for memory and is dysregulated in disease. Activation of mTOR promotes protein synthesis; however, new studies are demonstrating that mTOR activity also represses the translation of mRNAs. Almost three decades ago, Kandel and colleagues hypothesised that memory was due to the induction of positive regulators and removal of negative constraints. Are these negative constraints repressed mRNAs that code for proteins that block memory formation? Herein, we will discuss the mRNAs coded by putative memory suppressors, how activation/inactivation of mTOR repress protein expression at the synapse, how mTOR activity regulates RNA binding proteins, mRNA stability, and translation, and what the possible implications of mRNA repression are to memory and neurodegenerative disorders. © 2017 Federation of European Biochemical Societies.
Prediction of plant lncRNA by ensemble machine learning classifiers.

PubMed

Simopoulos, Caitlin M A; Weretilnyk, Elizabeth A; Golding, G Brian

2018-05-02

In plants, long non-protein coding RNAs are believed to have essential roles in development and stress responses. However, relative to advances on discerning biological roles for long non-protein coding RNAs in animal systems, this RNA class in plants is largely understudied. With comparatively few validated plant long non-coding RNAs, research on this potentially critical class of RNA is hindered by a lack of appropriate prediction tools and databases. Supervised learning models trained on data sets of mostly non-validated, non-coding transcripts have been previously used to identify this enigmatic RNA class with applications largely focused on animal systems. Our approach uses a training set comprised only of empirically validated long non-protein coding RNAs from plant, animal, and viral sources to predict and rank candidate long non-protein coding gene products for future functional validation. Individual stochastic gradient boosting and random forest classifiers trained on only empirically validated long non-protein coding RNAs were constructed. In order to use the strengths of multiple classifiers, we combined multiple models into a single stacking meta-learner. This ensemble approach benefits from the diversity of several learners to effectively identify putative plant long non-coding RNAs from transcript sequence features. When the predicted genes identified by the ensemble classifier were compared to those listed in GreeNC, an established plant long non-coding RNA database, overlap for predicted genes from Arabidopsis thaliana, Oryza sativa and Eutrema salsugineum ranged from 51 to 83% with the highest agreement in Eutrema salsugineum. Most of the highest ranking predictions from Arabidopsis thaliana were annotated as potential natural antisense genes, pseudogenes, transposable elements, or simply computationally predicted hypothetical protein. Due to the nature of this tool, the model can be updated as new long non-protein coding transcripts are identified and functionally verified. This ensemble classifier is an accurate tool that can be used to rank long non-protein coding RNA predictions for use in conjunction with gene expression studies. Selection of plant transcripts with a high potential for regulatory roles as long non-protein coding RNAs will advance research in the elucidation of long non-protein coding RNA function.
Isolation and molecular identification of Sunshine virus, a novel paramyxovirus found in Australian snakes.

PubMed

Hyndman, Timothy H; Marschang, Rachel E; Wellehan, James F X; Nicholls, Philip K

2012-10-01

This paper describes the isolation and molecular identification of a novel paramyxovirus found during an investigation of an outbreak of neurorespiratory disease in a collection of Australian pythons. Using Illumina® high-throughput sequencing, a 17,187 nucleotide sequence was assembled from RNA extracts from infected viper heart cells (VH2) displaying widespread cytopathic effects in the form of multinucleate giant cells. The sequence appears to contain all the coding regions of the genome, including the following predicted paramyxoviral open reading frames (ORFs): 3'--Nucleocapsid (N)--putative Phosphoprotein (P)--Matrix (M)--Fusion (F)--putative attachment protein--Polymerase (L)--5'. There is also a 540 nucleotide ORF between the N and putative P genes that may be an additional coding region. Phylogenetic analyses of the complete N, M, F and L genes support the clustering of this virus within the family Paramyxoviridae but outside both of the current subfamilies: Paramyxovirinae and Pneumovirinae. We propose to name this new virus, Sunshine virus, after the geographic origin of the first isolate--the Sunshine Coast of Queensland, Australia. Copyright © 2012 Elsevier B.V. All rights reserved.
Transcription of the cottontail rabbit papillomavirus early region and identification of two E6 polypeptides in COS-7 cells.

PubMed Central

Barbosa, M S; Wettstein, F O

1987-01-01

Cottontail rabbit papillomavirus (CRPV) early proteins are present at very low levels in virus-induced tumors and cannot be detected by immunological methods. Furthermore, cells in culture are not readily transformed by the virus. To overcome these difficulties in identifying and characterizing the putative transforming protein(s) coded by the E6 open reading frame, the early cottontail rabbit papillomavirus region was expressed under the control of the late simian virus 40 promoter. Mapping of the transcripts in transiently transfected COS-7 cells indicated that transcription was initiated in the late region of simian virus 40. Two E6-coded polypeptides were identified, representing translation products initiated at the first and second AUG codons. Images PMID:3039182
Two novel heat shock genes encoding proteins produced in response to heterologous protein expression in Escherichia coli.

PubMed Central

Allen, S P; Polazzi, J O; Gierse, J K; Easton, A M

1992-01-01

In Escherichia coli high-level production of some heterologous proteins (specifically, human prorenin, renin, and bovine insulin-like growth factor 2) resulted in the induction of two new E. coli heat shock proteins, both of which have molecular masses of 16 kDa and are tightly associated with inclusion bodies formed during heterologous protein production. We named these inclusion body-associated proteins IbpA and IbpB. The coding sequences for IbpA and IbpB were identified and isolated from the Kohara E. coli gene bank. The genes for these proteins (ibpA and ibpB) are located at 82.5 min on the chromosome. Nucleotide sequencing of the two genes revealed that they are transcribed in the same direction and are separated by 110 bp. Putative Shine-Dalgarno sequences are located upstream from the initiation codons of both genes. A putative heat shock promoter is located upstream from ibpA, and a putative transcription terminator is located downstream from ibpB. A temperature upshift experiment in which we used a wild-type E. coli strain and an isogenic rpoH mutant strain indicated that a sigma 32-containing RNA polymerase is involved in the regulation of expression of these genes. There is 57.5% identity between the genes at the nucleotide level and 52.2% identity at the amino acid level. A search of the protein data bases showed that both of these 16-kDa proteins exhibit low levels of homology to low-molecular-weight heat shock proteins from eukaryotic species. Images PMID:1356969
Molecular cloning and evolutionary analysis of the calcium-modulated contractile protein, centrin, in green algae and land plants.

PubMed

Bhattacharya, D; Steinkötter, J; Melkonian, M

1993-12-01

Centrin (= caltractin) is a ubiquitous, cytoskeletal protein which is a member of the EF-hand superfamily of calcium-binding proteins. A centrin-coding cDNA was isolated and characterized from the prasinophyte green alga Scherffelia dubia. Centrin PCR amplification primers were used to isolate partial, homologous cDNA sequences from the green algae Tetraselmis striata and Spermatozopsis similis. Annealing analyses suggested that centrin is a single-copy-coding region in T. striata and S. similis and other green algae studied. Centrin-coding regions from S. dubia, S. similis and T. striata encode four colinear EF-hand domains which putatively bind calcium. Phylogenetic analyses, including homologous sequences from Chlamydomonas reinhardtii and the land plant Atriplex nummularia, demonstrate that the domains of centrins are congruent and arose from the two-fold duplication of an ancestral EF hand with Domains 1+3 and Domains 2+4 clustering. The domains of centrins are also congruent with those of calmodulins demonstrating that, like calmodulin, centrin is an ancient protein which arose within the ancestor of all eukaryotes via gene duplication. Phylogenetic relationships inferred from centrin-coding region comparisons mirror results of small subunit ribosomal RNA sequence analyses suggesting that centrin-coding regions are useful evolutionary markers within the green algae.

A comparative genomics perspective on the genetic content of the alkaliphilic haloarchaeon Natrialba magadii ATCC 43099T

PubMed Central

2012-01-01

Background Natrialba magadii is an aerobic chemoorganotrophic member of the Euryarchaeota and is a dual extremophile requiring alkaline conditions and hypersalinity for optimal growth. The genome sequence of Nab. magadii type strain ATCC 43099 was deciphered to obtain a comprehensive insight into the genetic content of this haloarchaeon and to understand the basis of some of the cellular functions necessary for its survival. Results The genome of Nab. magadii consists of four replicons with a total sequence of 4,443,643 bp and encodes 4,212 putative proteins, some of which contain peptide repeats of various lengths. Comparative genome analyses facilitated the identification of genes encoding putative proteins involved in adaptation to hypersalinity, stress response, glycosylation, and polysaccharide biosynthesis. A proton-driven ATP synthase and a variety of putative cytochromes and other proteins supporting aerobic respiration and electron transfer were encoded by one or more of Nab. magadii replicons. The genome encodes a number of putative proteases/peptidases as well as protein secretion functions. Genes encoding putative transcriptional regulators, basal transcription factors, signal perception/transduction proteins, and chemotaxis/phototaxis proteins were abundant in the genome. Pathways for the biosynthesis of thiamine, riboflavin, heme, cobalamin, coenzyme F420 and other essential co-factors were deduced by in depth sequence analyses. However, approximately 36% of Nab. magadii protein coding genes could not be assigned a function based on Blast analysis and have been annotated as encoding hypothetical or conserved hypothetical proteins. Furthermore, despite extensive comparative genomic analyses, genes necessary for survival in alkaline conditions could not be identified in Nab. magadii. Conclusions Based on genomic analyses, Nab. magadii is predicted to be metabolically versatile and it could use different carbon and energy sources to sustain growth. Nab. magadii has the genetic potential to adapt to its milieu by intracellular accumulation of inorganic cations and/or neutral organic compounds. The identification of Nab. magadii genes involved in coenzyme biosynthesis is a necessary step toward further reconstruction of the metabolic pathways in halophilic archaea and other extremophiles. The knowledge gained from the genome sequence of this haloalkaliphilic archaeon is highly valuable in advancing the applications of extremophiles and their enzymes. PMID:22559199
Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

DOE PAGES

Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...

2014-10-02

Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui

Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
Decoding sORF translation - from small proteins to gene regulation.

PubMed

Cabrera-Quio, Luis Enrique; Herberg, Sarah; Pauli, Andrea

2016-11-01

Translation is best known as the fundamental mechanism by which the ribosome converts a sequence of nucleotides into a string of amino acids. Extensive research over many years has elucidated the key principles of translation, and the majority of translated regions were thought to be known. The recent discovery of wide-spread translation outside of annotated protein-coding open reading frames (ORFs) came therefore as a surprise, raising the intriguing possibility that these newly discovered translated regions might have unrecognized protein-coding or gene-regulatory functions. Here, we highlight recent findings that provide evidence that some of these newly discovered translated short ORFs (sORFs) encode functional, previously missed small proteins, while others have regulatory roles. Based on known examples we will also speculate about putative additional roles and the potentially much wider impact that these translated regions might have on cellular homeostasis and gene regulation.
Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.

PubMed

Mayer, K; Schüller, C; Wambutt, R; Murphy, G; Volckaert, G; Pohl, T; Düsterhöft, A; Stiekema, W; Entian, K D; Terryn, N; Harris, B; Ansorge, W; Brandt, P; Grivell, L; Rieger, M; Weichselgartner, M; de Simone, V; Obermaier, B; Mache, R; Müller, M; Kreis, M; Delseny, M; Puigdomenech, P; Watson, M; Schmidtheini, T; Reichert, B; Portatelle, D; Perez-Alonso, M; Boutry, M; Bancroft, I; Vos, P; Hoheisel, J; Zimmermann, W; Wedler, H; Ridley, P; Langham, S A; McCullagh, B; Bilham, L; Robben, J; Van der Schueren, J; Grymonprez, B; Chuang, Y J; Vandenbussche, F; Braeken, M; Weltjens, I; Voet, M; Bastiaens, I; Aert, R; Defoor, E; Weitzenegger, T; Bothe, G; Ramsperger, U; Hilbert, H; Braun, M; Holzer, E; Brandt, A; Peters, S; van Staveren, M; Dirske, W; Mooijman, P; Klein Lankhorst, R; Rose, M; Hauf, J; Kötter, P; Berneiser, S; Hempel, S; Feldpausch, M; Lamberth, S; Van den Daele, H; De Keyser, A; Buysshaert, C; Gielen, J; Villarroel, R; De Clercq, R; Van Montagu, M; Rogers, J; Cronin, A; Quail, M; Bray-Allen, S; Clark, L; Doggett, J; Hall, S; Kay, M; Lennard, N; McLay, K; Mayes, R; Pettett, A; Rajandream, M A; Lyne, M; Benes, V; Rechmann, S; Borkova, D; Blöcker, H; Scharfe, M; Grimm, M; Löhnert, T H; Dose, S; de Haan, M; Maarse, A; Schäfer, M; Müller-Auer, S; Gabel, C; Fuchs, M; Fartmann, B; Granderath, K; Dauner, D; Herzl, A; Neumann, S; Argiriou, A; Vitale, D; Liguori, R; Piravandi, E; Massenet, O; Quigley, F; Clabauld, G; Mündlein, A; Felber, R; Schnabl, S; Hiller, R; Schmidt, W; Lecharny, A; Aubourg, S; Chefdor, F; Cooke, R; Berger, C; Montfort, A; Casacuberta, E; Gibbons, T; Weber, N; Vandenbol, M; Bargues, M; Terol, J; Torres, A; Perez-Perez, A; Purnelle, B; Bent, E; Johnson, S; Tacon, D; Jesse, T; Heijnen, L; Schwarz, S; Scholler, P; Heber, S; Francs, P; Bielke, C; Frishman, D; Haase, D; Lemcke, K; Mewes, H W; Stocker, S; Zaccaria, P; Bevan, M; Wilson, R K; de la Bastide, M; Habermann, K; Parnell, L; Dedhia, N; Gnoj, L; Schutz, K; Huang, E; Spiegel, L; Sehkon, M; Murray, J; Sheet, P; Cordes, M; Abu-Threideh, J; Stoneking, T; Kalicki, J; Graves, T; Harmon, G; Edwards, J; Latreille, P; Courtney, L; Cloud, J; Abbott, A; Scott, K; Johnson, D; Minx, P; Bentley, D; Fulton, B; Miller, N; Greco, T; Kemp, K; Kramer, J; Fulton, L; Mardis, E; Dante, M; Pepin, K; Hillier, L; Nelson, J; Spieth, J; Ryan, E; Andrews, S; Geisel, C; Layman, D; Du, H; Ali, J; Berghoff, A; Jones, K; Drone, K; Cotton, M; Joshu, C; Antonoiu, B; Zidanic, M; Strong, C; Sun, H; Lamar, B; Yordan, C; Ma, P; Zhong, J; Preston, R; Vil, D; Shekher, M; Matero, A; Shah, R; Swaby, I K; O'Shaughnessy, A; Rodriguez, M; Hoffmann, J; Till, S; Granat, S; Shohdy, N; Hasegawa, A; Hameed, A; Lodhi, M; Johnson, A; Chen, E; Marra, M; Martienssen, R; McCombie, W R

1999-12-16

The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
The Human Cell Surfaceome of Breast Tumors

PubMed Central

da Cunha, Júlia Pinheiro Chagas; Galante, Pedro Alexandre Favoretto; de Souza, Jorge Estefano Santana; Pieprzyk, Martin; Carraro, Dirce Maria; Old, Lloyd J.; Camargo, Anamaria Aranha; de Souza, Sandro José

2013-01-01

Introduction. Cell surface proteins are ideal targets for cancer therapy and diagnosis. We have identified a set of more than 3700 genes that code for transmembrane proteins believed to be at human cell surface. Methods. We used a high-throuput qPCR system for the analysis of 573 cell surface protein-coding genes in 12 primary breast tumors, 8 breast cell lines, and 21 normal human tissues including breast. To better understand the role of these genes in breast tumors, we used a series of bioinformatics strategies to integrates different type, of the datasets, such as KEGG, protein-protein interaction databases, ONCOMINE, and data from, literature. Results. We found that at least 77 genes are overexpressed in breast primary tumors while at least 2 of them have also a restricted expression pattern in normal tissues. We found common signaling pathways that may be regulated in breast tumors through the overexpression of these cell surface protein-coding genes. Furthermore, a comparison was made between the genes found in this report and other genes associated with features clinically relevant for breast tumorigenesis. Conclusions. The expression profiling generated in this study, together with an integrative bioinformatics analysis, allowed us to identify putative targets for breast tumors. PMID:24195083
Pharmacophore screening of the protein data bank for specific binding site chemistry.

PubMed

Campagna-Slater, Valérie; Arrowsmith, Andrew G; Zhao, Yong; Schapira, Matthieu

2010-03-22

A simple computational approach was developed to screen the Protein Data Bank (PDB) for putative pockets possessing a specific binding site chemistry and geometry. The method employs two commonly used 3D screening technologies, namely identification of cavities in protein structures and pharmacophore screening of chemical libraries. For each protein structure, a pocket finding algorithm is used to extract potential binding sites containing the correct types of residues, which are then stored in a large SDF-formatted virtual library; pharmacophore filters describing the desired binding site chemistry and geometry are then applied to screen this virtual library and identify pockets matching the specified structural chemistry. As an example, this approach was used to screen all human protein structures in the PDB and identify sites having chemistry similar to that of known methyl-lysine binding domains that recognize chromatin methylation marks. The selected genes include known readers of the histone code as well as novel binding pockets that may be involved in epigenetic signaling. Putative allosteric sites were identified on the structures of TP53BP1, L3MBTL3, CHEK1, KDM4A, and CREBBP.
Using a Euclid distance discriminant method to find protein coding genes in the yeast genome.

PubMed

Zhang, Chun-Ting; Wang, Ju; Zhang, Ren

2002-02-01

The Euclid distance discriminant method is used to find protein coding genes in the yeast genome, based on the single nucleotide frequencies at three codon positions in the ORFs. The method is extremely simple and may be extended to find genes in prokaryotic genomes or eukaryotic genomes with less introns. Six-fold cross-validation tests have demonstrated that the accuracy of the algorithm is better than 93%. Based on this, it is found that the total number of protein coding genes in the yeast genome is less than or equal to 5579 only, about 3.8-7.0% less than 5800-6000, which is currently widely accepted. The base compositions at three codon positions are analyzed in details using a graphic method. The result shows that the preference codons adopted by yeast genes are of the RGW type, where R, G and W indicate the bases of purine, non-G and A/T, whereas the 'codons' in the intergenic sequences are of the form NNN, where N denotes any base. This fact constitutes the basis of the algorithm to distinguish between coding and non-coding ORFs in the yeast genome. The names of putative non-coding ORFs are listed here in detail.
Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism

PubMed Central

Yin, Ling; An, Yunhe; Qu, Junjie; Li, Xinlong; Zhang, Yali; Dry, Ian; Wu, Huijuan; Lu, Jiang

2017-01-01

Plasmopara viticola causes downy mildew disease of grapevine which is one of the most devastating diseases of viticulture worldwide. Here we report a 101.3 Mb whole genome sequence of P. viticola isolate ‘JL-7-2’ obtained by a combination of Illumina and PacBio sequencing technologies. The P. viticola genome contains 17,014 putative protein-coding genes and has ~26% repetitive sequences. A total of 1,301 putative secreted proteins, including 100 putative RXLR effectors and 90 CRN effectors were identified in this genome. In the secretome, 261 potential pathogenicity genes and 95 carbohydrate-active enzymes were predicted. Transcriptional analysis revealed that most of the RXLR effectors, pathogenicity genes and carbohydrate-active enzymes were significantly up-regulated during infection. Comparative genomic analysis revealed that P. viticola evolved independently from the Arabidopsis downy mildew pathogen Hyaloperonospora arabidopsidis. The availability of the P. viticola genome provides a valuable resource not only for comparative genomic analysis and evolutionary studies among oomycetes, but also enhance our knowledge on the mechanism of interactions between this biotrophic pathogen and its host. PMID:28417959
Transcriptional Profiles of Mating-Responsive Genes from Testes and Male Accessory Glands of the Mediterranean Fruit Fly, Ceratitis capitata

PubMed Central

Scolari, Francesca; Gomulski, Ludvik M.; Ribeiro, José M. C.; Siciliano, Paolo; Meraldi, Alice; Falchetto, Marco; Bonomi, Angelica; Manni, Mosè; Gabrieli, Paolo; Malovini, Alberto; Bellazzi, Riccardo; Aksoy, Serap; Gasperi, Giuliano; Malacrida, Anna R.

2012-01-01

Background Insect seminal fluid is a complex mixture of proteins, carbohydrates and lipids, produced in the male reproductive tract. This seminal fluid is transferred together with the spermatozoa during mating and induces post-mating changes in the female. Molecular characterization of seminal fluid proteins in the Mediterranean fruit fly, Ceratitis capitata, is limited, although studies suggest that some of these proteins are biologically active. Methodology/Principal Findings We report on the functional annotation of 5914 high quality expressed sequence tags (ESTs) from the testes and male accessory glands, to identify transcripts encoding putative secreted peptides that might elicit post-mating responses in females. The ESTs were assembled into 3344 contigs, of which over 33% produced no hits against the nr database, and thus may represent novel or rapidly evolving sequences. Extraction of the coding sequences resulted in a total of 3371 putative peptides. The annotated dataset is available as a hyperlinked spreadsheet. Four hundred peptides were identified with putative secretory activity, including odorant binding proteins, protease inhibitor domain-containing peptides, antigen 5 proteins, mucins, and immunity-related sequences. Quantitative RT-PCR-based analyses of a subset of putative secretory protein-encoding transcripts from accessory glands indicated changes in their abundance after one or more copulations when compared to virgin males of the same age. These changes in abundance, particularly evident after the third mating, may be related to the requirement to replenish proteins to be transferred to the female. Conclusions/Significance We have developed the first large-scale dataset for novel studies on functions and processes associated with the reproductive biology of Ceratitis capitata. The identified genes may help study genome evolution, in light of the high adaptive potential of the medfly. In addition, studies of male recovery dynamics in terms of accessory gland gene expression profiles and correlated remating inhibition mechanisms may permit the improvement of pest management approaches. PMID:23071645
Putative Nonribosomal Peptide Synthetase and Cytochrome P450 Genes Responsible for Tentoxin Biosynthesis in Alternaria alternata ZJ33.

PubMed

Li, You-Hai; Han, Wen-Jin; Gui, Xi-Wu; Wei, Tao; Tang, Shuang-Yan; Jin, Jian-Ming

2016-08-02

Tentoxin, a cyclic tetrapeptide produced by several Alternaria species, inhibits the F₁-ATPase activity of chloroplasts, resulting in chlorosis in sensitive plants. In this study, we report two clustered genes, encoding a putative non-ribosome peptide synthetase (NRPS) TES and a cytochrome P450 protein TES1, that are required for tentoxin biosynthesis in Alternaria alternata strain ZJ33, which was isolated from blighted leaves of Eupatorium adenophorum. Using a pair of primers designed according to the consensus sequences of the adenylation domain of NRPSs, two fragments containing putative adenylation domains were amplified from A. alternata ZJ33, and subsequent PCR analyses demonstrated that these fragments belonged to the same NRPS coding sequence. With no introns, TES consists of a single 15,486 base pair open reading frame encoding a predicted 5161 amino acid protein. Meanwhile, the TES1 gene is predicted to contain five introns and encode a 506 amino acid protein. The TES protein is predicted to be comprised of four peptide synthase modules with two additional N-methylation domains, and the number and arrangement of the modules in TES were consistent with the number and arrangement of the amino acid residues of tentoxin, respectively. Notably, both TES and TES1 null mutants generated via homologous recombination failed to produce tentoxin. This study provides the first evidence concerning the biosynthesis of tentoxin in A. alternata.
Search for protein partners of mitochondrial single-stranded DNA-binding protein Rim1p using a yeast two-hybrid system.

PubMed

Kucejová, B; Foury, F

2003-01-01

RIM1 is a nuclear gene of the yeast Saccharomyces cerevisiae coding for a protein with single-stranded DNA-binding activity that is essential for mitochondrial genome maintenance. No protein partners of Rim1p have been described so far in yeast. To better understand the role of this protein in mitochondrial DNA replication and recombination, a search for protein interactors by the yeast two-hybrid system was performed. This approach led to the identification of several candidates, including a putative transcription factor, Azf1p, and Mph1p, a protein with an RNA helicase domain which is known to influence the mutation rate of nuclear and mitochondrial genomes.
Mapping the membrane proteome of anaerobic gut fungi identifies a wealth of carbohydrate binding proteins and transporters

DOE PAGES

Seppala, Susanna; Solomon, Kevin V.; Gilmore, Sean P.; ...

2016-12-20

Here, engineered cell factories that convert biomass into value-added compounds are emerging as a timely alternative to petroleum-based industries. Although often overlooked, integral membrane proteins such as solute transporters are pivotal for engineering efficient microbial chassis. Anaerobic gut fungi, adapted to degrade raw plant biomass in the intestines of herbivores, are a potential source of valuable transporters for biotechnology, yet very little is known about the membrane constituents of these non-conventional organisms. Here, we mined the transcriptome of three recently isolated strains of anaerobic fungi to identify membrane proteins responsible for sensing and transporting biomass hydrolysates within a competitive andmore » rather extreme environment. Using sequence analyses and homology, we identified membrane protein-coding sequences from assembled transcriptomes from three strains of anaerobic gut fungi: Neocallimastix californiae, Anaeromyces robustus, and Piromyces finnis. We identified nearly 2000 transporter components: about half of these are involved in the general secretory pathway and intracellular sorting of proteins; the rest are predicted to be small-solute transporters. Unexpectedly, we found a number of putative sugar binding proteins that are associated with prokaryotic uptake systems; and approximately 100 class C G-protein coupled receptors (GPCRs) with non-canonical putative sugar binding domains. In conclusion, we report the first comprehensive characterization of the membrane protein machinery of biotechnologically relevant anaerobic gut fungi. Apart from identifying conserved machinery for protein sorting and secretion, we identify a large number of putative solute transporters that are of interest for biotechnological applications. Notably, our data suggests that the fungi display a plethora of carbohydrate binding domains at their surface, perhaps as a means to sense and sequester some of the sugars that their biomass degrading, extracellular enzymes produce.« less
Mapping the membrane proteome of anaerobic gut fungi identifies a wealth of carbohydrate binding proteins and transporters.

PubMed

Seppälä, Susanna; Solomon, Kevin V; Gilmore, Sean P; Henske, John K; O'Malley, Michelle A

2016-12-20

Engineered cell factories that convert biomass into value-added compounds are emerging as a timely alternative to petroleum-based industries. Although often overlooked, integral membrane proteins such as solute transporters are pivotal for engineering efficient microbial chassis. Anaerobic gut fungi, adapted to degrade raw plant biomass in the intestines of herbivores, are a potential source of valuable transporters for biotechnology, yet very little is known about the membrane constituents of these non-conventional organisms. Here, we mined the transcriptome of three recently isolated strains of anaerobic fungi to identify membrane proteins responsible for sensing and transporting biomass hydrolysates within a competitive and rather extreme environment. Using sequence analyses and homology, we identified membrane protein-coding sequences from assembled transcriptomes from three strains of anaerobic gut fungi: Neocallimastix californiae, Anaeromyces robustus, and Piromyces finnis. We identified nearly 2000 transporter components: about half of these are involved in the general secretory pathway and intracellular sorting of proteins; the rest are predicted to be small-solute transporters. Unexpectedly, we found a number of putative sugar binding proteins that are associated with prokaryotic uptake systems; and approximately 100 class C G-protein coupled receptors (GPCRs) with non-canonical putative sugar binding domains. We report the first comprehensive characterization of the membrane protein machinery of biotechnologically relevant anaerobic gut fungi. Apart from identifying conserved machinery for protein sorting and secretion, we identify a large number of putative solute transporters that are of interest for biotechnological applications. Notably, our data suggests that the fungi display a plethora of carbohydrate binding domains at their surface, perhaps as a means to sense and sequester some of the sugars that their biomass degrading, extracellular enzymes produce.
Probability of coding of a DNA sequence: an algorithm to predict translated reading frames from their thermodynamic characteristics.

PubMed Central

Tramontano, A; Macchiato, M F

1986-01-01

An algorithm to determine the probability that a reading frame codifies for a protein is presented. It is based on the results of our previous studies on the thermodynamic characteristics of a translated reading frame. We also develop a prediction procedure to distinguish between coding and non-coding reading frames. The procedure is based on the characteristics of the putative product of the DNA sequence and not on periodicity characteristics of the sequence, so the prediction is not biased by the presence of overlapping translated reading frames or by the presence of translated reading frames on the complementary DNA strand. PMID:3753761
Biodegradation of the Organic Disulfide 4,4′-Dithiodibutyric Acid by Rhodococcus spp.

PubMed Central

Khairy, Heba; Wübbeler, Jan Hendrik

2015-01-01

Four Rhodococcus spp. exhibited the ability to use 4,4′-dithiodibutyric acid (DTDB) as a sole carbon source for growth. The most important step for the production of a novel polythioester (PTE) using DTDB as a precursor substrate is the initial cleavage of DTDB. Thus, identification of the enzyme responsible for this step was mandatory. Because Rhodococcus erythropolis strain MI2 serves as a model organism for elucidation of the biodegradation of DTDB, it was used to identify the genes encoding the enzymes involved in DTDB utilization. To identify these genes, transposon mutagenesis of R. erythropolis MI2 was carried out using transposon pTNR-TA. Among 3,261 mutants screened, 8 showed no growth with DTDB as the sole carbon source. In five mutants, the insertion locus was mapped either within a gene coding for a polysaccharide deacetyltransferase, a putative ATPase, or an acetyl coenzyme A transferase, 1 bp upstream of a gene coding for a putative methylase, or 176 bp downstream of a gene coding for a putative kinase. In another mutant, the insertion was localized between genes encoding a putative transcriptional regulator of the TetR family (noxR) and an NADH:flavin oxidoreductase (nox). Moreover, in two other mutants, the insertion loci were mapped within a gene encoding a hypothetical protein in the vicinity of noxR and nox. The interruption mutant generated, R. erythropolis MI2 noxΩtsr, was unable to grow with DTDB as the sole carbon source. Subsequently, nox was overexpressed and purified, and its activity with DTDB was measured. The specific enzyme activity of Nox amounted to 1.2 ± 0.15 U/mg. Therefore, we propose that Nox is responsible for the initial cleavage of DTDB into 2 molecules of 4-mercaptobutyric acid (4MB). PMID:26407888
DOE Office of Scientific and Technical Information (OSTI.GOV)

Seppala, Susanna; Solomon, Kevin V.; Gilmore, Sean P.

Here, engineered cell factories that convert biomass into value-added compounds are emerging as a timely alternative to petroleum-based industries. Although often overlooked, integral membrane proteins such as solute transporters are pivotal for engineering efficient microbial chassis. Anaerobic gut fungi, adapted to degrade raw plant biomass in the intestines of herbivores, are a potential source of valuable transporters for biotechnology, yet very little is known about the membrane constituents of these non-conventional organisms. Here, we mined the transcriptome of three recently isolated strains of anaerobic fungi to identify membrane proteins responsible for sensing and transporting biomass hydrolysates within a competitive andmore » rather extreme environment. Using sequence analyses and homology, we identified membrane protein-coding sequences from assembled transcriptomes from three strains of anaerobic gut fungi: Neocallimastix californiae, Anaeromyces robustus, and Piromyces finnis. We identified nearly 2000 transporter components: about half of these are involved in the general secretory pathway and intracellular sorting of proteins; the rest are predicted to be small-solute transporters. Unexpectedly, we found a number of putative sugar binding proteins that are associated with prokaryotic uptake systems; and approximately 100 class C G-protein coupled receptors (GPCRs) with non-canonical putative sugar binding domains. In conclusion, we report the first comprehensive characterization of the membrane protein machinery of biotechnologically relevant anaerobic gut fungi. Apart from identifying conserved machinery for protein sorting and secretion, we identify a large number of putative solute transporters that are of interest for biotechnological applications. Notably, our data suggests that the fungi display a plethora of carbohydrate binding domains at their surface, perhaps as a means to sense and sequester some of the sugars that their biomass degrading, extracellular enzymes produce.« less
Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes.

PubMed

Seligmann, Hervé

2013-03-01

Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Avian sarcoma virus 17 carries the jun oncogene.

PubMed Central

Maki, Y; Bos, T J; Davis, C; Starbuck, M; Vogt, P K

1987-01-01

Biologically active molecular clones of avian sarcoma virus 17 (ASV 17) contain a replication-defective proviral genome of 3.5 kilobases (kb). The genome retains partial gag and env sequences, which flank a cell-derived putative oncogene of 0.93 kb, termed jun. The jun gene lacks preserved coding domains of tyrosine-specific protein kinases. It also shows no significant nucleic acid homology with other known oncogenes. The probable transformation-specific protein in ASV 17-transformed cells is a 55-kDa gag-jun fusion product. Images PMID:3033666
Enzymes involved in the anaerobic degradation of ortho-phthalate by the nitrate-reducing bacterium Azoarcus sp. strain PA01.

PubMed

Junghare, Madan; Spiteller, Dieter; Schink, Bernhard

2016-09-01

The pathway of anaerobic degradation of o-phthalate was studied in the nitrate-reducing bacterium Azoarcus sp. strain PA01. Differential two-dimensional protein gel profiling allowed the identification of specifically induced proteins in o-phthalate-grown compared to benzoate-grown cells. The genes encoding o-phthalate-induced proteins were found in a 9.9 kb gene cluster in the genome of Azoarcus sp. strain PA01. The o-phthalate-induced gene cluster codes for proteins homologous to a dicarboxylic acid transporter, putative CoA-transferases and a UbiD-like decarboxylase that were assigned to be specifically involved in the initial steps of anaerobic o-phthalate degradation. We propose that o-phthalate is first activated to o-phthalyl-CoA by a putative succinyl-CoA-dependent succinyl-CoA:o-phthalate CoA-transferase, and o-phthalyl-CoA is subsequently decarboxylated to benzoyl-CoA by a putative o-phthalyl-CoA decarboxylase. Results from in vitro enzyme assays with cell-free extracts of o-phthalate-grown cells demonstrated the formation of o-phthalyl-CoA from o-phthalate and succinyl-CoA as CoA donor, and its subsequent decarboxylation to benzoyl-CoA. The putative succinyl-CoA:o-phthalate CoA-transferase showed high substrate specificity for o-phthalate and did not accept isophthalate, terephthalate or 3-fluoro-o-phthalate whereas the putative o-phthalyl-CoA decarboxylase converted fluoro-o-phthalyl-CoA to fluoro-benzoyl-CoA. No decarboxylase activity was observed with isophthalyl-CoA or terephthalyl-CoA. Both enzyme activities were oxygen-insensitive and inducible only after growth with o-phthalate. Further degradation of benzoyl-CoA proceeds analogous to the well-established anaerobic benzoyl-CoA degradation pathway of nitrate-reducing bacteria. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.

The complete mitogenome of the whale shark parasitic copepod Pandarus rhincodonicus norman, Newbound & Knott (Crustacea; Siphonostomatoida; Pandaridae)--a new gene order for the copepoda.

PubMed

Austin, Christopher M; Tan, Mun Hua; Lee, Yin Peng; Croft, Laurence J; Meekan, Mark G; Pierce, Simon J; Gan, Han Ming

2016-01-01

The complete mitochondrial genome of the parasitic copepod Pandarus rhincodonicus was obtained from a partial genome scan using the HiSeq sequencing system. The Pandarus rhincodonicus mitogenome has 14,480 base pairs (62% A+T content) made up of 12 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs, and a putative 384 bp non-coding AT-rich region. This Pandarus mitogenome sequence is the first for the family Pandaridae, the second for the order Siphonostomatoida and the sixth for the Copepoda.
Cloning and identification of bacteriophage T4 gene 2 product gp2 and action of gp2 on infecting DNA in vivo.

PubMed Central

Lipinska, B; Rao, A S; Bolten, B M; Balakrishnan, R; Goldberg, E B

1989-01-01

We sequenced bacteriophage T4 genes 2 and 3 and the putative C-terminal portion of gene 50. They were found to have appropriate open reading frames directed counterclockwise on the T4 map. Mutations in genes 2 and 64 were shown to be in the same open reading frame, which we now call gene 2. This gene codes for a protein of 27,068 daltons. The open reading frame corresponding to gene 3 codes for a protein of 20,634 daltons. Appropriate bands on polyacrylamide gels were identified at 30 and 20 kilodaltons, respectively. We found that the product of the cloned gene 2 can protect T4 DNA double-stranded ends from exonuclease V action. Images PMID:2644202
Molecular cloning of chitinase 33 (chit33) gene from Trichoderma atroviride

PubMed Central

Matroudi, S.; Zamani, M.R.; Motallebi, M.

2008-01-01

In this study Trichoderma atroviride was selected as over producer of chitinase enzyme among 30 different isolates of Trichoderma sp. on the basis of chitinase specific activity. From this isolate the genomic and cDNA clones encoding chit33 have been isolated and sequenced. Comparison of genomic and cDNA sequences for defining gene structure indicates that this gene contains three short introns and also an open reading frame coding for a protein of 321 amino acids. The deduced amino acid sequence includes a 19 aa putative signal peptide. Homology between this sequence and other reported Trichoderma Chit33 proteins are discussed. The coding sequence of chit33 gene was cloned in pEt26b(+) expression vector and expressed in E. coli. PMID:24031242
Brome mosaic virus, good for an RNA virologist's basic needs.

PubMed

Kao, C C; Sivakumaran, K

2000-03-01

Abstract Taxonomic relationship: Type member of the Bromovirus genus, family Bromoviridae. A member of the alphavirus-like supergroup of positive-sense single-stranded RNA viruses. Physical properties: Virions are nonenveloped icosahedrals made up of 180 coat protein subunits (Fig. 1). The particles are 26 nm in diameter and contain 22% nucleic acid and 78% protein. The BMV genome is composed of three positive-sense, capped RNAs: RNA1 (3.2 kb), RNA2 (2.9 kb), RNA3 (2.1 kb) (Fig. 2). Viral proteins: RNA1 encodes protein 1a, containing capping and putative RNA helicase activities. RNA2 encodes protein 2a, a putative RNA-dependent RNA polymerase. RNA3 codes for two proteins: 3a, which is required for cell-to-cell movement, and the capsid protein. The capsid is translated from a subgenomic RNA, RNA4 (1.2 kb). Hosts: Monocots in the Poacea family, including Bromus inermis, Zea mays and Hordeum vulgare, in which BMV causes brown streaks. BMV can also infect the dicots Nicotiana benthamiana and several Chenopodium species. In N. benthamiana, the infection is asymptomatic while infection of Chenopodium can cause either necrotic or chlorotic lesions. Useful website:http://www4.ncbi.nlm.nih.gov/ICTVdb/ICTVdB/10030001.htm.
Characterisation of the genomes of four putative vesiculoviruses: tench rhabdovirus, grass carp rhabdovirus, perch rhabdovirus and eel rhabdovirus European X.

PubMed

Stone, David M; Kerr, Rose C; Hughes, Margaret; Radford, Alan D; Darby, Alistair C

2013-11-01

The complete coding sequences were determined for four putative vesiculoviruses isolated from fish. Sequence alignment and phylogenetic analysis based on the predicted amino acid sequences of the five main proteins assigned tench rhabdovirus and grass carp rhabdovirus together with spring viraemia of carp and pike fry rhabdovirus to a lineage that was distinct from the mammalian vesiculoviruses. Perch rhabdovirus, eel virus European X, lake trout rhabdovirus 903/87 and sea trout virus were placed in a second lineage that was also distinct from the recognised genera in the family Rhabdoviridae. Establishment of two new rhabdovirus genera, "Perhabdovirus" and "Sprivivirus", is discussed.
Mutant phenotypes for thousands of bacterial genes of unknown function

DOE PAGES

Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan; ...

2018-05-16

One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Mutant phenotypes for thousands of bacterial genes of unknown function

DOE Office of Scientific and Technical Information (OSTI.GOV)

Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan

One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Multiplexed pyrosequencing of nine sea anemone (Cnidaria: Anthozoa: Hexacorallia: Actiniaria) mitochondrial genomes.

PubMed

Foox, Jonathan; Brugler, Mercer; Siddall, Mark Edward; Rodríguez, Estefanía

2016-07-01

Six complete and three partial actiniarian mitochondrial genomes were amplified in two semi-circles using long-range PCR and pyrosequenced in a single run on a 454 GS Junior, doubling the number of complete mitogenomes available within the order. Typical metazoan mtDNA features included circularity, 13 protein-coding genes, 2 ribosomal RNA genes, and length ranging from 17,498 to 19,727 bp. Several typical anthozoan mitochondrial genome features were also observed including the presence of only two transfer RNA genes, elevated A + T richness ranging from 54.9 to 62.4%, large intergenic regions, and group 1 introns interrupting NADH dehydrogenase subunit 5 and cytochrome c oxidase subunit I, the latter of which possesses a homing endonuclease gene. Within the sea anemone Alicia sansibarensis, we report the first mitochondrial gene order rearrangement within the Actiniaria, as well as putative novel non-canonical protein-coding genes. Phylogenetic analyses of all 13 protein-coding and 2 ribosomal genes largely corroborated current hypotheses of sea anemone interrelatedness, with a few lower-level differences.
Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript

PubMed Central

Rose, Dominic; Stadler, Peter F.

2011-01-01

Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Accumulation of multiple mutations in linezolid-resistant Staphylococcus epidermidis causing bloodstream infections; in silico analysis of L3 amino acid substitutions that might confer high-level linezolid resistance.

PubMed

Ikonomidis, Alexandros; Grapsa, Anastasia; Pavlioglou, Charikleia; Demiri, Antonia; Batarli, Alexandra; Panopoulou, Maria

2016-12-01

Fifty-six Staphylococcus epidermidis clinical isolates, showing high-level linezolid resistance and causing bacteremia in critically ill patients, were studied. All isolates belonged to ST22 clone and carried the T2504A and C2534T mutations in gene coding for 23SrRNA as well as the C189A, G208A, C209T and G384C missense mutations in L3 protein which resulted in Asp159Tyr, Gly152Asp and Leu94Val substitutions. Other silent mutations were also detected in genes coding for ribosomal proteins L3 and L22. In silico analysis of missense mutations showed that although L3 protein retained the sequence of secondary motifs, the tertiary structure was influenced. The observed alteration in L3 protein folding provides an indication on the putative role of L3-coding gene mutations in high-level linezolid resistance. Furthermore, linezolid pressure in health care settings where linezolid consumption is of high rates might lead to the selection of resistant mutants possessing L3 mutations that might confer high-level linezolid resistance.
The Salivary Protein Repertoire of the Polyphagous Spider Mite Tetranychus urticae: A Quest for Effectors.

PubMed

Jonckheere, Wim; Dermauw, Wannes; Zhurov, Vladimir; Wybouw, Nicky; Van den Bulcke, Jan; Villarroel, Carlos A; Greenhalgh, Robert; Grbić, Mike; Schuurink, Rob C; Tirry, Luc; Baggerman, Geert; Clark, Richard M; Kant, Merijn R; Vanholme, Bartel; Menschaert, Gerben; Van Leeuwen, Thomas

2016-12-01

The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants-bean, maize, soy, and tomato-was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
The Salivary Protein Repertoire of the Polyphagous Spider Mite Tetranychus urticae: A Quest for Effectors*

PubMed Central

Jonckheere, Wim; Zhurov, Vladimir; Villarroel, Carlos A.; Greenhalgh, Robert; Grbić, Mike; Schuurink, Rob C.; Tirry, Luc; Kant, Merijn R.; Vanholme, Bartel

2016-01-01

The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants—bean, maize, soy, and tomato—was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily. PMID:27703040
Analysis of the Genome of the Sexually Transmitted Insect Virus Helicoverpa zea Nudivirus 2

PubMed Central

Burand, John P.; Kim, Woojin; Afonso, Claudio L.; Tulman, Edan R.; Kutish, Gerald F.; Lu, Zhiqiang; Rock, Daniel L.

2012-01-01

The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea. PMID:22355451
Analysis of the genome of the sexually transmitted insect virus Helicoverpa zea nudivirus 2.

PubMed

Burand, John P; Kim, Woojin; Afonso, Claudio L; Tulman, Edan R; Kutish, Gerald F; Lu, Zhiqiang; Rock, Daniel L

2012-01-01

The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea.
Complete mitochondrial DNA sequence of the Eastern keelback mullet Liza affinis.

PubMed

Gong, Xiaoling; Zhu, Wenjia; Bao, Baolong

2016-05-01

Eastern keelback mullet (Liza affinis) inhabits inlet waters and estuaries of rivers. In this paper, we initially determined the complete mitochondrial genome of Liza affinis. The entire mtDNA sequence is 16,831 bp in length, including 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes and 1 putative control region. Its order and numbers of genes are similar to most bony fishes.
Protein and gene structure of a blue laccase from Pleurotus ostreatus1.

PubMed Central

Giardina, P; Palmieri, G; Scaloni, A; Fontanella, B; Faraco, V; Cennamo, G; Sannia, G

1999-01-01

A new laccase isoenzyme (POXA1b, where POX is phenol oxidase), produced by Pleurotus ostreatus in cultures supplemented with copper sulphate, has been purified and fully characterized. The main characteristics of this protein (molecular mass in native and denaturing conditions, pI and catalytic properties) are almost identical to the previously studied laccase POXA1w. However, POXA1b contains four copper atoms per molecule instead of one copper, two zinc and one iron atom per molecule of POXA1w. Furthermore, POXA1b shows an unusually high stability at alkaline pH. The gene and cDNA coding for POXA1b have been cloned and sequenced. The gene coding sequence contains 1599 bp, interrupted by 15 introns. Comparison of the structure of the poxa1b gene with the two previously studied P. ostreatus laccase genes (pox1 and poxc) suggests that these genes belong to two different subfamilies. The amino acid sequence of POXA1b deduced from the cDNA sequence has been almost completely verified by means of matrix-assisted laser desorption ionization MS. It has been demonstrated that three out of six putative glycosylation sites are post-translationally modified and the structure of the bound glycosidic moieties has been determined, whereas two other putative glycosylation sites are unmodified. PMID:10417329
Network analysis of S. aureus response to ramoplanin reveals modules for virulence factors and resistance mechanisms and characteristic novel genes.

PubMed

Subramanian, Devika; Natarajan, Jeyakumar

2015-12-10

Staphylococcus aureus is a major human pathogen and ramoplanin is an antimicrobial attributed for effective treatment. The goal of this study was to examine the transcriptomic profiles of ramoplanin sensitive and resistant S. aureus to identify putative modules responsible for virulence and resistance-mechanisms and its characteristic novel genes. The dysregulated genes were used to reconstruct protein functional association networks for virulence-factors and resistance-mechanisms individually. Strong link between metabolic-pathways and development of virulence/resistance is suggested. We identified 15 putative modules of virulence factors. Six hypothetical genes were annotated with novel virulence activity among which SACOL0281 was discovered to be an essential virulence factor EsaD. The roles of MazEF toxin-antitoxin system, SACOL0202/SACOL0201 two-component system and that of amino-sugar and nucleotide-sugar metabolism in virulence are also suggested. In addition, 14 putative modules of resistance mechanisms including modules of ribosomal protein-coding genes and metabolic pathways such as biotin-synthesis, TCA-cycle, riboflavin-biosynthesis, peptidoglycan-biosynthesis etc. are also indicated. Copyright © 2015 Elsevier B.V. All rights reserved.
In vivo identification of tumor suppressive PTEN ceRNAs in an oncogenic BRAF-induced mouse model of melanoma

PubMed Central

Karreth, Florian A.; Tay, Yvonne; Perna, Daniele; Ala, Ugo; Tan, Shen Mynn; Rust, Alistair G.; DeNicola, Gina; Webster, Kaitlyn A.; Weiss, Dror; Perez-Mancera, Pedro A.; Krauthammer, Michael; Halaban, Ruth; Provero, Paolo; Adams, David J.; Tuveson, David A.; Pandolfi, Pier Paolo

2011-01-01

Summary We recently proposed that competitive endogenous RNAs (ceRNAs) sequester microRNAs to regulate mRNA transcripts containing common microRNA recognition elements (MREs). However, the functional role of ceRNAs in cancer remains unknown. Loss of PTEN, a tumor suppressor regulated by ceRNA activity, frequently occurs in melanoma. Here, we report the discovery of significant enrichment of putative PTEN ceRNAs among genes whose loss accelerates tumorigenesis following Sleeping Beauty insertional mutagenesis in a mouse model of melanoma. We validated several putative PTEN ceRNAs and further characterized one, the ZEB2 transcript. We show that ZEB2 modulates PTEN protein levels in a microRNA-dependent, protein coding-independent manner. Attenuation of ZEB2 expression activates the PI3K/AKT pathway, enhances cell transformation, and commonly occurs in human melanomas and other cancers expressing low PTEN levels. Our study genetically identifies multiple putative microRNA decoys for PTEN, validates ZEB2 mRNA as a bona fide PTEN ceRNA, and demonstrates that abrogated ZEB2 expression cooperates with BRAFV600E to promote melanomagenesis. PMID:22000016
The LacI family protein GlyR3 co-regulates the celC operon and manB in Clostridium thermocellum

DOE PAGES

Choi, Jinlyung; Klingeman, Dawn M.; Brown, Steven D.; ...

2017-06-24

In this paper, we demonstrate that the GlyR3 protein mediates the regulation of manB. We first identify putative GlyR3 binding sites within or just upstream of the coding regions of manB and celT. Using an electrophoretic mobility shift assay (EMSA), we determined that a higher concentration of GlyR3 is required to effectively bind to the putative manB site in comparison to the celC site. Neither the putative celT site nor random DNA significantly binds GlyR3. While laminaribiose interfered with GlyR3 binding to the celC binding site, binding to the manB site was unaffected. In the presence of laminaribiose, in vivomore » transcription of the celC–glyR3–licA gene cluster increases, while manB expression is repressed, compared to in the absence of laminaribiose, consistent with the results from the EMSA. An in vitro transcription assay demonstrated that GlyR3 and laminaribiose interactions were responsible for the observed patters of in vivo transcription.« less
Putative Nonribosomal Peptide Synthetase and Cytochrome P450 Genes Responsible for Tentoxin Biosynthesis in Alternaria alternata ZJ33

PubMed Central

Li, You-Hai; Han, Wen-Jin; Gui, Xi-Wu; Wei, Tao; Tang, Shuang-Yan; Jin, Jian-Ming

2016-01-01

Tentoxin, a cyclic tetrapeptide produced by several Alternaria species, inhibits the F1-ATPase activity of chloroplasts, resulting in chlorosis in sensitive plants. In this study, we report two clustered genes, encoding a putative non-ribosome peptide synthetase (NRPS) TES and a cytochrome P450 protein TES1, that are required for tentoxin biosynthesis in Alternaria alternata strain ZJ33, which was isolated from blighted leaves of Eupatorium adenophorum. Using a pair of primers designed according to the consensus sequences of the adenylation domain of NRPSs, two fragments containing putative adenylation domains were amplified from A. alternata ZJ33, and subsequent PCR analyses demonstrated that these fragments belonged to the same NRPS coding sequence. With no introns, TES consists of a single 15,486 base pair open reading frame encoding a predicted 5161 amino acid protein. Meanwhile, the TES1 gene is predicted to contain five introns and encode a 506 amino acid protein. The TES protein is predicted to be comprised of four peptide synthase modules with two additional N-methylation domains, and the number and arrangement of the modules in TES were consistent with the number and arrangement of the amino acid residues of tentoxin, respectively. Notably, both TES and TES1 null mutants generated via homologous recombination failed to produce tentoxin. This study provides the first evidence concerning the biosynthesis of tentoxin in A. alternata. PMID:27490569

Draft Genome Sequences of Two Bacillus thuringiensis Strains and Characterization of a Putative 41.9-kDa Insecticidal Toxin

PubMed Central

Palma, Leopoldo; Muñoz, Delia; Berry, Colin; Murillo, Jesús; Caballero, Primitivo

2014-01-01

In this work, we report the genome sequencing of two Bacillus thuringiensis strains using Illumina next-generation sequencing technology (NGS). Strain Hu4-2, toxic to many lepidopteran pest species and to some mosquitoes, encoded genes for two insecticidal crystal (Cry) proteins, cry1Ia and cry9Ea, and a vegetative insecticidal protein (Vip) gene, vip3Ca2. Strain Leapi01 contained genes coding for seven Cry proteins (cry1Aa, cry1Ca, cry1Da, cry2Ab, cry9Ea and two cry1Ia gene variants) and a vip3 gene (vip3Aa10). A putative novel insecticidal protein gene 1143 bp long was found in both strains, whose sequences exhibited 100% nucleotide identity. The predicted protein showed 57 and 100% pairwise identity to protein sequence 72 from a patented Bt strain (US8318900) and to a putative 41.9-kDa insecticidal toxin from Bacillus cereus, respectively. The 41.9-kDa protein, containing a C-terminal 6× HisTag fusion, was expressed in Escherichia coli and tested for the first time against four lepidopteran species (Mamestra brassicae, Ostrinia nubilalis, Spodoptera frugiperda and S. littoralis) and the green-peach aphid Myzus persicae at doses as high as 4.8 µg/cm2 and 1.5 mg/mL, respectively. At these protein concentrations, the recombinant 41.9-kDa protein caused no mortality or symptoms of impaired growth against any of the insects tested, suggesting that these species are outside the protein’s target range or that the protein may not, in fact, be toxic. While the use of the polymerase chain reaction has allowed a significant increase in the number of Bt insecticidal genes characterized to date, novel NGS technologies promise a much faster, cheaper and efficient screening of Bt pesticidal proteins. PMID:24784323
Induction of multixenobiotic defense mechanisms in resistant Daphnia magna clones as a general cellular response to stress.

PubMed

Jordão, Rita; Campos, Bruno; Lemos, Marco F L; Soares, Amadeu M V M; Tauler, Romà; Barata, Carlos

2016-06-01

Multixenobiotic resistance mechanisms (MXR) were recently identified in Daphnia magna. Previous results characterized gene transcripts of genes encoding and efflux activities of four putative ABCB1 and ABCC transporters that were chemically induced but showed low specificity against model transporter substrates and inhibitors, thus preventing us from distinguishing between activities of different efflux transporter types. In this study we report on the specificity of induction of ABC transporters and of the stress protein hsp70 in clones selected to be genetically resistant to ABCB1 chemical substrates. Clones resistant to mitoxantrone, ivermectin and pentachlorophenol showed distinctive transcriptional responses of transporter protein coding genes and of putative transporter dye activities. Expression of hsp70 proteins also varied across resistant clones. Clones resistant to mitoxantrone and pentachlorophenol showed high constitutive levels of hsp70. Transcriptional levels of the abcb1 gene transporter and of putative dye transporter activity were also induced to a greater extent in the pentachlorophenol resistant clone. Observed higher dye transporter activities in individuals from clones resistant to mitoxantrone and ivermectin were unrelated with transcriptional levels of the studied four abcc and abcb1 transporter genes. These findings suggest that Abcb1 induction in D. magna may be a part of a general cellular stress response. Copyright © 2016 Elsevier B.V. All rights reserved.
The spectrum of low molecular weight alpha-amylase/protease inhibitor genes expressed in the US bread wheat cultivar Butte 86

PubMed Central

2011-01-01

Background Wheat grains accumulate a variety of low molecular weight proteins that are inhibitors of alpha-amylases and proteases and play an important protective role in the grain. These proteins have more balanced amino acid compositions than the major wheat gluten proteins and contribute important reserves for both seedling growth and human nutrition. The alpha-amylase/protease inhibitors also are of interest because they cause IgE-mediated occupational and food allergies and thereby impact human health. Results The complement of genes encoding alpha-amylase/protease inhibitors expressed in the US bread wheat Butte 86 was characterized by analysis of expressed sequence tags (ESTs). Coding sequences for 19 distinct proteins were identified. These included two monomeric (WMAI), four dimeric (WDAI), and six tetrameric (WTAI) inhibitors of exogenous alpha-amylases, two inhibitors of endogenous alpha-amylases (WASI), four putative trypsin inhibitors (CMx and WTI), and one putative chymotrypsin inhibitor (WCI). A number of the encoded proteins were identical or very similar to proteins in the NCBI database. Sequences not reported previously included variants of WTAI-CM3, three CMx inhibitors and WTI. Within the WDAI group, two different genes encoded the same mature protein. Based on numbers of ESTs, transcripts for WTAI-CM3 Bu-1, WMAI Bu-1 and WTAI-CM16 Bu-1 were most abundant in Butte 86 developing grain. Coding sequences for 16 of the inhibitors were unequivocally associated with specific proteins identified by tandem mass spectrometry (MS/MS) in a previous proteomic analysis of milled white flour from Butte 86. Proteins corresponding to WDAI Bu-1/Bu-2, WMAI Bu-1 and the WTAI subunits CM2 Bu-1, CM3 Bu-1 and CM16 Bu-1 were accumulated to the highest levels in flour. Conclusions Information on the spectrum of alpha-amylase/protease inhibitor genes and proteins expressed in a single wheat cultivar is central to understanding the importance of these proteins in both plant defense mechanisms and human allergies and facilitates both breeding and biotechnology approaches for manipulating the composition of these proteins in plants. PMID:21774824
Draft Genome Sequence of a Novel Chitinophaga sp. Strain, MD30, Isolated from a Biofilm in an Air Conditioner Condensate Pipe

PubMed Central

Darris, Maxwell

2017-01-01

ABSTRACT Most of the 24 known Chitinophaga species were originally isolated from soils. We report the draft genome sequence of a putatively novel Chitinophaga sp. from a biofilm in an air conditioner condensate pipe. The genome comprises 7,661,303 bp in one scaffold, 5,694 predicted protein-coding sequences, and a G+C content of 47.6%. PMID:29051259
Complete mitochondrial genome of Platevindex sp. (Gastropoda: Pulmonata: Systellommatophora: Onchidiidae).

PubMed

Liu, Chen; Shen, He Ding; Zhou, Na

2016-01-01

The complete mitochondrial genome sequence of Platevindex sp. is firstly described in the article. The mitogenome (13,908 bp) contains 22 tRNA genes, 2 ribosomal RNA genes and 13 protein-coding genes, and 1 putative control region (CR). CR is not well characterized due to lack of discrete conserved sequence blocks. This characteristic is similar with CRs of other invertebrate mitochondrial genomes. The characteristic is the typical bivalvia mitochondrial gene composition.
Draft Genome Sequence of Janthinobacterium sp. Strain ROICE36, a Putative Secondary Metabolite-Synthesizing Bacterium Isolated from Antarctic Snow

PubMed Central

Chiriac, Cecilia; Baricz, Andreea

2018-01-01

ABSTRACT The draft genome assembly of Janthinobacterium sp. strain ROICE36 has 207 contigs, with a total genome size of 5,977,006 bp and a G+C content of 62%. Preliminary genome analysis identified 5,363 protein-coding genes and a total of 7 secondary metabolic gene clusters (encoding bacteriocins, nonribosomal peptide-synthetase [NRPS], terpene, hserlactone, and other ketide synthases). PMID:29650588
SORL1 variants across Alzheimer's disease European American cohorts.

PubMed

Fernández, Maria Victoria; Black, Kathleen; Carrell, David; Saef, Ben; Budde, John; Deming, Yuetiva; Howells, Bill; Del-Aguila, Jorge L; Ma, Shengmei; Bi, Catherine; Norton, Joanne; Chasse, Rachel; Morris, John; Goate, Alison; Cruchaga, Carlos

2016-12-01

The accumulation of the toxic Aβ peptide in Alzheimer's disease (AD) largely relies upon an efficient recycling of amyloid precursor protein (APP). Recent genetic association studies have described rare variants in SORL1 with putative pathogenic consequences in the recycling of APP. In this work, we examine the presence of rare coding variants in SORL1 in three different European American cohorts: early-onset, late-onset AD (LOAD) and familial LOAD.
Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes.

PubMed

Seligmann, Hervé; Warthi, Ganesh

2017-01-01

A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Mu-Like Prophage in Serogroup B Neisseria meningitidis Coding for Surface-Exposed Antigens

PubMed Central

Masignani, Vega; Giuliani, Marzia Monica; Tettelin, Hervé; Comanducci, Maurizio; Rappuoli, Rino; Scarlato, Vincenzo

2001-01-01

Sequence analysis of the genome of Neisseria meningititdis serogroup B revealed the presence of an ∼35-kb region inserted within a putative gene coding for an ABC-type transporter. The region contains 46 open reading frames, 29 of which are colinear and homologous to the genes of Escherichia coli Mu phage. Two prophages with similar organizations were also found in serogroup A meningococcus, and one was found in Haemophilus influenzae. Early and late phage functions are well preserved in this family of Mu-like prophages. Several regions of atypical nucleotide content were identified. These likely represent genes acquired by horizontal transfer. Three of the acquired genes are shown to code for surface-associated antigens, and the encoded proteins are able to induce bactericidal antibodies. PMID:11254622
Diversity and Divergence of Dinoflagellate Histone Proteins

PubMed Central

Marinov, Georgi K.; Lynch, Michael

2015-01-01

Histone proteins and the nucleosomal organization of chromatin are near-universal eukaroytic features, with the exception of dinoflagellates. Previous studies have suggested that histones do not play a major role in the packaging of dinoflagellate genomes, although several genomic and transcriptomic surveys have detected a full set of core histone genes. Here, transcriptomic and genomic sequence data from multiple dinoflagellate lineages are analyzed, and the diversity of histone proteins and their variants characterized, with particular focus on their potential post-translational modifications and the conservation of the histone code. In addition, the set of putative epigenetic mark readers and writers, chromatin remodelers and histone chaperones are examined. Dinoflagellates clearly express the most derived set of histones among all autonomous eukaryote nuclei, consistent with a combination of relaxation of sequence constraints imposed by the histone code and the presence of numerous specialized histone variants. The histone code itself appears to have diverged significantly in some of its components, yet others are conserved, implying conservation of the associated biochemical processes. Specifically, and with major implications for the function of histones in dinoflagellates, the results presented here strongly suggest that transcription through nucleosomal arrays happens in dinoflagellates. Finally, the plausible roles of histones in dinoflagellate nuclei are discussed. PMID:26646152
[Cloning, sequencing and prokaryotic expression of cDNAs for the antifreeze protein family from the beetle Tenebrio molitor].

PubMed

Liu, Zhong-Yuan; Wang, Yun; Lü, Guo-Dong; Wang, Xian-Lei; Zhang, Fu-Chun; Ma, Ji

2006-12-01

The partial cDNA sequence coding for the antifreeze proteins in the Tenebrio molitor was obtained by RT-PCR. Sequence analysis revealed nine putative cDNAs with a high degree of homology to Tenebrio molitor antifreeze proteins. The recombinant pGEX-4T-1-tmafp-XJ430 was introduced into E. coli BL21 to induce a GST fusion protein by IPTG. SDS-PAGE of the fusion protein demonstrated that the antifreeze protein migrated at a size of 38 kDa. The immunization was performed by intra-muscular injection of pCDNA3-tmafp-XJ430, and then antiserum was detected by ELISA. The titer of the antibody was 1:2,000. Western blotting analysis showed the antiserum was specific against the antifreeze protein. This finding could lead to further investigation of the properties and function of antifreeze proteins.
Biodegradation of the organic disulfide 4,4'-dithiodibutyric acid by Rhodococcus spp.

PubMed

Khairy, Heba; Wübbeler, Jan Hendrik; Steinbüchel, Alexander

2015-12-01

Four Rhodococcus spp. exhibited the ability to use 4,4'-dithiodibutyric acid (DTDB) as a sole carbon source for growth. The most important step for the production of a novel polythioester (PTE) using DTDB as a precursor substrate is the initial cleavage of DTDB. Thus, identification of the enzyme responsible for this step was mandatory. Because Rhodococcus erythropolis strain MI2 serves as a model organism for elucidation of the biodegradation of DTDB, it was used to identify the genes encoding the enzymes involved in DTDB utilization. To identify these genes, transposon mutagenesis of R. erythropolis MI2 was carried out using transposon pTNR-TA. Among 3,261 mutants screened, 8 showed no growth with DTDB as the sole carbon source. In five mutants, the insertion locus was mapped either within a gene coding for a polysaccharide deacetyltransferase, a putative ATPase, or an acetyl coenzyme A transferase, 1 bp upstream of a gene coding for a putative methylase, or 176 bp downstream of a gene coding for a putative kinase. In another mutant, the insertion was localized between genes encoding a putative transcriptional regulator of the TetR family (noxR) and an NADH:flavin oxidoreductase (nox). Moreover, in two other mutants, the insertion loci were mapped within a gene encoding a hypothetical protein in the vicinity of noxR and nox. The interruption mutant generated, R. erythropolis MI2 noxΩtsr, was unable to grow with DTDB as the sole carbon source. Subsequently, nox was overexpressed and purified, and its activity with DTDB was measured. The specific enzyme activity of Nox amounted to 1.2 ± 0.15 U/mg. Therefore, we propose that Nox is responsible for the initial cleavage of DTDB into 2 molecules of 4-mercaptobutyric acid (4MB). Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Improving the genome annotation of the acarbose producer Actinoplanes sp. SE50/110 by sequencing enriched 5'-ends of primary transcripts.

PubMed

Schwientek, Patrick; Neshat, Armin; Kalinowski, Jörn; Klein, Andreas; Rückert, Christian; Schneiker-Bekel, Susanne; Wendler, Sergej; Stoye, Jens; Pühler, Alfred

2014-11-20

Actinoplanes sp. SE50/110 is the producer of the alpha-glucosidase inhibitor acarbose, which is an economically relevant and potent drug in the treatment of type-2 diabetes mellitus. In this study, we present the detection of transcription start sites on this genome by sequencing enriched 5'-ends of primary transcripts. Altogether, 1427 putative transcription start sites were initially identified. With help of the annotated genome sequence, 661 transcription start sites were found to belong to the leader region of protein-coding genes with the surprising result that roughly 20% of these genes rank among the class of leaderless transcripts. Next, conserved promoter motifs were identified for protein-coding genes with and without leader sequences. The mapped transcription start sites were finally used to improve the annotation of the Actinoplanes sp. SE50/110 genome sequence. Concerning protein-coding genes, 41 translation start sites were corrected and 9 novel protein-coding genes could be identified. In addition to this, 122 previously undetermined non-coding RNA (ncRNA) genes of Actinoplanes sp. SE50/110 were defined. Focusing on antisense transcription start sites located within coding genes or their leader sequences, it was discovered that 96 of those ncRNA genes belong to the class of antisense RNA (asRNA) genes. The remaining 26 ncRNA genes were found outside of known protein-coding genes. Four chosen examples of prominent ncRNA genes, namely the transfer messenger RNA gene ssrA, the ribonuclease P class A RNA gene rnpB, the cobalamin riboswitch RNA gene cobRS, and the selenocysteine-specific tRNA gene selC, are presented in more detail. This study demonstrates that sequencing of enriched 5'-ends of primary transcripts and the identification of transcription start sites are valuable tools for advanced genome annotation of Actinoplanes sp. SE50/110 and most probably also for other bacteria. Copyright © 2014 Elsevier B.V. All rights reserved.
Influence of putative exopolysaccharide genes on Pseudomonas putida KT2440 biofilm stability.

PubMed

Nilsson, Martin; Chiang, Wen-Chi; Fazli, Mustafa; Gjermansen, Morten; Givskov, Michael; Tolker-Nielsen, Tim

2011-05-01

We report a study of the role of putative exopolysaccharide gene clusters in the formation and stability of Pseudomonas putida KT2440 biofilm. Two novel putative exopolysaccharide gene clusters, pea and peb, were identified, and evidence is provided that they encode products that stabilize P. putida KT2440 biofilm. The gene clusters alg and bcs, which code for proteins mediating alginate and cellulose biosynthesis, were found to play minor roles in P. putida KT2440 biofilm formation and stability under the conditions tested. A P. putida KT2440 derivative devoid of any identifiable exopolysaccharide genes was found to form biofilm with a structure similar to wild-type biofilm, but with a stability lower than that of wild-type biofilm. Based on our data, we suggest that the formation of structured P. putida KT2440 biofilm can occur in the absence of exopolysaccharides; however, exopolysaccharides play a role as structural stabilizers. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
LncRNAs in Secondary Hair Follicle of Cashmere Goat: Identification, Expression, and Their Regulatory Network in Wnt Signaling Pathway.

PubMed

Bai, Wen L; Zhao, Su J; Wang, Ze Y; Zhu, Yu B; Dang, Yun L; Cong, Yu Y; Xue, Hui L; Wang, Wei; Deng, Liang; Guo, Dan; Wang, Shi Q; Zhu, Yan X; Yin, Rong H

2018-07-03

Long noncoding RNAs (lncRNAs) are a novel class of eukaryotic transcripts. They are thought to act as a critical regulator of protein-coding gene expression. Herein, we identified and characterized 13 putative lncRNAs from the expressed sequence tags from secondary hair follicle of Cashmere goat. Furthermore, we investigated their transcriptional pattern in secondary hair follicle of Liaoning Cashmere goat during telogen and anagen phases. Also, we generated intracellular regulatory networks of upregulated lncRNAs at anagen in Wnt signaling pathway based on bioinformatics analysis. The relative expression of six putative lncRNAs (lncRNA-599618, -599556, -599554, -599547, -599531, and -599509) at the anagen phase is significantly higher than that at telogen. Compared with anagen, the relative expression of four putative lncRNAs (lncRNA-599528, -599518, -599511, and -599497) was found to be significantly upregulated at telogen phase. The network generated showed that a rich and complex regulatory relationship of the putative lncRNAs and related miRNAs with their target genes in Wnt signaling pathway. Our results from the present study provided a foundation for further elucidating the functional and regulatory mechanisms of these putative lncRNAs in the development of secondary hair follicle and cashmere fiber growth of Cashmere goat.
Draft Genome Sequence of a Novel Chitinophaga sp. Strain, MD30, Isolated from a Biofilm in an Air Conditioner Condensate Pipe.

PubMed

Wan, Xuehua; Darris, Maxwell; Hou, Shaobin; Donachie, Stuart P

2017-10-19

Most of the 24 known Chitinophaga species were originally isolated from soils. We report the draft genome sequence of a putatively novel Chitinophaga sp. from a biofilm in an air conditioner condensate pipe. The genome comprises 7,661,303 bp in one scaffold, 5,694 predicted protein-coding sequences, and a G+C content of 47.6%. Copyright © 2017 Wan et al.
Molecular Cloning, Characterization, and Differential Expression of a Glucoamylase Gene from the Basidiomycetous Fungus Lentinula edodes

PubMed Central

Zhao, J.; Chen, Y. H.; Kwan, H. S.

2000-01-01

The complete nucleotide sequence of putative glucoamylase gene gla1 from the basidiomycetous fungus Lentinula edodes strain L54 is reported. The coding region of the genomic glucoamylase sequence, which is preceded by eukaryotic promoter elements CAAT and TATA, spans 2,076 bp. The gla1 gene sequence codes for a putative polypeptide of 571 amino acids and is interrupted by seven introns. The open reading frame sequence of the gla1 gene shows strong homology with those of other fungal glucoamylase genes and encodes a protein with an N-terminal catalytic domain and a C-terminal starch-binding domain. The similarity between the Gla1 protein and other fungal glucoamylases is from 45 to 61%, with the region of highest conservation found in catalytic domains and starch-binding domains. We compared the kinetics of glucoamylase activity and levels of gene expression in L. edodes strain L54 grown on different carbon sources (glucose, starch, cellulose, and potato extract) and in various developmental stages (mycelium growth, primordium appearance, and fruiting body formation). Quantitative reverse transcription PCR utilizing pairs of primers specific for gla1 gene expression shows that expression of gla1 was induced by starch and increased during the process of fruiting body formation, which indicates that glucoamylases may play an important role in the morphogenesis of the basidiomycetous fungus. PMID:10831434
Biochemical Characterization of Putative Adenylate Dimethylallyltransferase and Cytokinin Dehydrogenase from Nostoc sp. PCC 7120.

PubMed

Frébortová, Jitka; Greplová, Marta; Seidl, Michael F; Heyl, Alexander; Frébort, Ivo

2015-01-01

Cytokinins, a class of phytohormones, are adenine derivatives common to many different organisms. In plants, these play a crucial role as regulators of plant development and the reaction to abiotic and biotic stress. Key enzymes in the cytokinin synthesis and degradation in modern land plants are the isopentyl transferases and the cytokinin dehydrogenases, respectively. Their encoding genes have been probably introduced into the plant lineage during the primary endosymbiosis. To shed light on the evolution of these proteins, the genes homologous to plant adenylate isopentenyl transferase and cytokinin dehydrogenase were amplified from the genomic DNA of cyanobacterium Nostoc sp. PCC 7120 and expressed in Escherichia coli. The putative isopentenyl transferase was shown to be functional in a biochemical assay. In contrast, no enzymatic activity was detected for the putative cytokinin dehydrogenase, even though the principal domains necessary for its function are present. Several mutant variants, in which conserved amino acids in land plant cytokinin dehydrogenases had been restored, were inactive. A combination of experimental data with phylogenetic analysis indicates that adenylate-type isopentenyl transferases might have evolved several times independently. While the Nostoc genome contains a gene coding for protein with characteristics of cytokinin dehydrogenase, the organism is not able to break down cytokinins in the way shown for land plants.
Biochemical Characterization of Putative Adenylate Dimethylallyltransferase and Cytokinin Dehydrogenase from Nostoc sp. PCC 7120

PubMed Central

Frébortová, Jitka; Greplová, Marta; Seidl, Michael F.; Heyl, Alexander; Frébort, Ivo

2015-01-01

Cytokinins, a class of phytohormones, are adenine derivatives common to many different organisms. In plants, these play a crucial role as regulators of plant development and the reaction to abiotic and biotic stress. Key enzymes in the cytokinin synthesis and degradation in modern land plants are the isopentyl transferases and the cytokinin dehydrogenases, respectively. Their encoding genes have been probably introduced into the plant lineage during the primary endosymbiosis. To shed light on the evolution of these proteins, the genes homologous to plant adenylate isopentenyl transferase and cytokinin dehydrogenase were amplified from the genomic DNA of cyanobacterium Nostoc sp. PCC 7120 and expressed in Escherichia coli. The putative isopentenyl transferase was shown to be functional in a biochemical assay. In contrast, no enzymatic activity was detected for the putative cytokinin dehydrogenase, even though the principal domains necessary for its function are present. Several mutant variants, in which conserved amino acids in land plant cytokinin dehydrogenases had been restored, were inactive. A combination of experimental data with phylogenetic analysis indicates that adenylate-type isopentenyl transferases might have evolved several times independently. While the Nostoc genome contains a gene coding for protein with characteristics of cytokinin dehydrogenase, the organism is not able to break down cytokinins in the way shown for land plants. PMID:26376297
Molecular characterization and genomic distribution of Isis: a new retrotransposon of Drosophila buzzatii.

PubMed

García Guerreiro, M P; Fontdevila, A

2007-01-01

A new transposable element, Isis, is identified as a LTR retrotransposon in Drosophila buzzatii. DNA sequence analysis shows that Isis contains three long ORFs similar to gag, pol and env genes of retroviruses. The ORF1 exhibits sequence homology to matrix, capsid and nucleocapsid gag proteins and ORF2 encodes a putative protease (PR), a reverse transcriptase (RT), an Rnase H (RH) and an integrase (IN) region. The analysis of a putative env product, encoded by the env ORF3, shows a degenerated protein containing several stop codons. The molecular study of the putative proteins coded by this new element shows striking similarities to both Ulysses and Osvaldo elements, two LTR retrotransposons, present in D. virilis and D. buzzatii, respectively. Comparisons of the predicted Isis RT to several known retrotransposons show strong phylogenetic relationships to gypsy-like elements, particulary to Ulysses retrotransposon. Studies of Isis chromosomal distribution show a strong hybridization signal in centromeric and pericentromeric regions, and a scattered distribution along all chromosomal arms. The existence of insertional polymorphisms between different strains and high molecular weight bands by Southern blot suggests the existence of full-sized copies that have been active recently. The presence of euchromatic insertion sites coincident between Isis and Osvaldo could indicate preferential insertion sites of Osvaldo element into Isis sequence or vice versa. Moreover, the presence of Isis in different species of the buzzatii complex indicates the ancient origin of this element.

Cloning, overexpression and interaction of recombinant Fur from the cyanobacterium Anabaena PCC 7119 with isiB and its own promoter.

PubMed

Bes, M T; Hernández, J A; Peleato, M L; Fillat, M F

2001-01-15

A gene coding for a Fur (ferric uptake regulation) protein from the cyanobacterium Anabaena PCC 7119 has been cloned and overexpressed in Escherichia coli. DNA sequence analysis confirmed the presence of a 151-amino-acid open reading frame that showed homology with the Fur proteins reported for the unicellular cyanobacteria Synechococcus 7942 and Synechocystis PCC 6803. Two putative Fur-binding sites were detected in the promoter regions of the fur gene from Anabaena. Partially purified recombinant Fur binds to the flavodoxin promoter as well as its own promoter. This suggests that the Fur gene is autoregulated in Anabaena.
Complete coding sequence characterization and comparative analysis of the putative novel human rhinovirus (HRV) species C and B

PubMed Central

2011-01-01

Background Human Rhinoviruses (HRVs) are well recognized viral pathogens associated with acute respiratory tract illnesses (RTIs) abundant worldwide. Although recent studies have phylogenetically identified the new HRV species (HRV-C), data on molecular epidemiology, genetic diversity, and clinical manifestation have been limited. Result To gain new insight into HRV genetic diversity, we determined the complete coding sequences of putative new members of HRV species C (HRV-CU072 with 1% prevalence) and HRV-B (HRV-CU211) identified from clinical specimens collected from pediatric patients diagnosed with a symptom of acute lower RTI. Complete coding sequence and phylogenetic analysis revealed that the HRV-CU072 strain shared a recent common ancestor with most closely related Chinese strain (N4). Comparative analysis at the protein level showed that HRV-CU072 might accumulate substitutional mutations in structural proteins, as well as nonstructural proteins 3C and 3 D. Comparative analysis of all available HRVs and HEVs indicated that HRV-C contains a relatively high G+C content and is more closely related to HEV-D. This might be correlated to their replication and capability to adapt to the high temperature environment of the human lower respiratory tract. We herein report an infrequently occurring intra-species recombination event in HRV-B species (HRV-CU211) with a crossing over having taken place at the boundary of VP2 and VP3 genes. Moreover, we observed phylogenetic compatibility in all HRV species and suggest that dynamic mechanisms for HRV evolution seem to be related to recombination events. These findings indicated that the elementary units shaping the genetic diversity of HRV-C could be found in the nonstructural 2A and 3D genes. Conclusion This study provides information for understanding HRV genetic diversity and insight into the role of selection pressure and recombination mechanisms influencing HRV evolution. PMID:21214911
Complete coding sequence characterization and comparative analysis of the putative novel human rhinovirus (HRV) species C and B.

PubMed

Linsuwanon, Piyada; Payungporn, Sunchai; Suwannakarn, Kamol; Chieochansin, Thaweesak; Theamboonlers, Apiradee; Poovorawan, Yong

2011-01-07

Human Rhinoviruses (HRVs) are well recognized viral pathogens associated with acute respiratory tract illnesses (RTIs) abundant worldwide. Although recent studies have phylogenetically identified the new HRV species (HRV-C), data on molecular epidemiology, genetic diversity, and clinical manifestation have been limited. To gain new insight into HRV genetic diversity, we determined the complete coding sequences of putative new members of HRV species C (HRV-CU072 with 1% prevalence) and HRV-B (HRV-CU211) identified from clinical specimens collected from pediatric patients diagnosed with a symptom of acute lower RTI. Complete coding sequence and phylogenetic analysis revealed that the HRV-CU072 strain shared a recent common ancestor with most closely related Chinese strain (N4). Comparative analysis at the protein level showed that HRV-CU072 might accumulate substitutional mutations in structural proteins, as well as nonstructural proteins 3C and 3 D. Comparative analysis of all available HRVs and HEVs indicated that HRV-C contains a relatively high G+C content and is more closely related to HEV-D. This might be correlated to their replication and capability to adapt to the high temperature environment of the human lower respiratory tract. We herein report an infrequently occurring intra-species recombination event in HRV-B species (HRV-CU211) with a crossing over having taken place at the boundary of VP2 and VP3 genes. Moreover, we observed phylogenetic compatibility in all HRV species and suggest that dynamic mechanisms for HRV evolution seem to be related to recombination events. These findings indicated that the elementary units shaping the genetic diversity of HRV-C could be found in the nonstructural 2A and 3D genes. This study provides information for understanding HRV genetic diversity and insight into the role of selection pressure and recombination mechanisms influencing HRV evolution.
De Novo ORFs in Drosophila Are Important to Organismal Fitness and Evolved Rapidly from Previously Non-coding Sequences

PubMed Central

Reinhardt, Josephine A.; Wanjiru, Betty M.; Brant, Alicia T.; Saelao, Perot; Begun, David J.; Jones, Corbin D.

2013-01-01

How non-coding DNA gives rise to new protein-coding genes (de novo genes) is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs), while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important. PMID:24146629
Applying functional metagenomics to search for novel lignocellulosic enzymes in a microbial consortium derived from a thermophilic composting phase of sugarcane bagasse and cow manure.

PubMed

Colombo, Lívia Tavares; de Oliveira, Marcelo Nagem Valério; Carneiro, Deisy Guimarães; de Souza, Robson Assis; Alvim, Mariana Caroline Tocantins; Dos Santos, Josenilda Carlos; da Silva, Cynthia Canêdo; Vidigal, Pedro Marcus Pereira; da Silveira, Wendel Batista; Passos, Flávia Maria Lopes

2016-09-01

Environments where lignocellulosic biomass is naturally decomposed are sources for discovery of new hydrolytic enzymes that can reduce the high cost of enzymatic cocktails for second-generation ethanol production. Metagenomic analysis was applied to discover genes coding carbohydrate-depleting enzymes from a microbial laboratory subculture using a mix of sugarcane bagasse and cow manure in the thermophilic composting phase. From a fosmid library, 182 clones had the ability to hydrolyse carbohydrate. Sequencing of 30 fosmids resulted in 12 contigs encoding 34 putative carbohydrate-active enzymes belonging to 17 glycosyl hydrolase (GH) families. One third of the putative proteins belong to the GH3 family, which includes β-glucosidase enzymes known to be important in the cellulose-deconstruction process but present with low activity in commercial enzyme preparations. Phylogenetic analysis of the amino acid sequences of seven selected proteins, including three β-glucosidases, showed low relatedness with protein sequences deposited in databases. These findings highlight microbial consortia obtained from a mixture of decomposing biomass residues, such as sugar cane bagasse and cow manure, as a rich resource of novel enzymes potentially useful in biotechnology for saccharification of lignocellulosic substrate.
Identification and Characterization of Long Non-Coding RNAs Related to Mouse Embryonic Brain Development from Available Transcriptomic Data

PubMed Central

He, Hongjuan; Xiu, Youcheng; Guo, Jing; Liu, Hui; Liu, Qi; Zeng, Tiebo; Chen, Yan; Zhang, Yan; Wu, Qiong

2013-01-01

Long non-coding RNAs (lncRNAs) as a key group of non-coding RNAs have gained widely attention. Though lncRNAs have been functionally annotated and systematic explored in higher mammals, few are under systematical identification and annotation. Owing to the expression specificity, known lncRNAs expressed in embryonic brain tissues remain still limited. Considering a large number of lncRNAs are only transcribed in brain tissues, studies of lncRNAs in developmental brain are therefore of special interest. Here, publicly available RNA-sequencing (RNA-seq) data in embryonic brain are integrated to identify thousands of embryonic brain lncRNAs by a customized pipeline. A significant proportion of novel transcripts have not been annotated by available genomic resources. The putative embryonic brain lncRNAs are shorter in length, less spliced and show less conservation than known genes. The expression of putative lncRNAs is in one tenth on average of known coding genes, while comparable with known lncRNAs. From chromatin data, putative embryonic brain lncRNAs are associated with active chromatin marks, comparable with known lncRNAs. Embryonic brain expressed lncRNAs are also indicated to have expression though not evident in adult brain. Gene Ontology analysis of putative embryonic brain lncRNAs suggests that they are associated with brain development. The putative lncRNAs are shown to be related to possible cis-regulatory roles in imprinting even themselves are deemed to be imprinted lncRNAs. Re-analysis of one knockdown data suggests that four regulators are associated with lncRNAs. Taken together, the identification and systematic analysis of putative lncRNAs would provide novel insights into uncharacterized mouse non-coding regions and the relationships with mammalian embryonic brain development. PMID:23967161
The putative drug efflux systems of the Bacillus cereus group

PubMed Central

Elbourne, Liam D. H.; Vörös, Aniko; Kroeger, Jasmin K.; Simm, Roger; Tourasse, Nicolas J.; Finke, Sarah; Henderson, Peter J. F.; Økstad, Ole Andreas; Paulsen, Ian T.; Kolstø, Anne-Brit

2017-01-01

The Bacillus cereus group of bacteria includes seven closely related species, three of which, B. anthracis, B. cereus and B. thuringiensis, are pathogens of humans, animals and/or insects. Preliminary investigations into the transport capabilities of different bacterial lineages suggested that genes encoding putative efflux systems were unusually abundant in the B. cereus group compared to other bacteria. To explore the drug efflux potential of the B. cereus group all putative efflux systems were identified in the genomes of prototypical strains of B. cereus, B. anthracis and B. thuringiensis using our Transporter Automated Annotation Pipeline. More than 90 putative drug efflux systems were found within each of these strains, accounting for up to 2.7% of their protein coding potential. Comparative analyses demonstrated that the efflux systems are highly conserved between these species; 70–80% of the putative efflux pumps were shared between all three strains studied. Furthermore, 82% of the putative efflux system proteins encoded by the prototypical B. cereus strain ATCC 14579 (type strain) were found to be conserved in at least 80% of 169 B. cereus group strains that have high quality genome sequences available. However, only a handful of these efflux pumps have been functionally characterized. Deletion of individual efflux pump genes from B. cereus typically had little impact to drug resistance phenotypes or the general fitness of the strains, possibly because of the large numbers of alternative efflux systems that may have overlapping substrate specificities. Therefore, to gain insight into the possible transport functions of efflux systems in B. cereus, we undertook large-scale qRT-PCR analyses of efflux pump gene expression following drug shocks and other stress treatments. Clustering of gene expression changes identified several groups of similarly regulated systems that may have overlapping drug resistance functions. In this article we review current knowledge of the small molecule efflux pumps encoded by the B. cereus group and suggest the likely functions of numerous uncharacterised pumps. PMID:28472044
Characterization of mitochondrial genome of sea cucumber Stichopus horrens: a novel gene arrangement in Holothuroidea.

PubMed

Fan, SiGang; Hu, ChaoQun; Wen, Jing; Zhang, LvPing

2011-05-01

The complete mitochondrial DNA sequence contains useful information for phylogenetic analyses of metazoa. In this study, the complete mitochondrial DNA sequence of sea cucumber Stichopus horrens (Holothuroidea: Stichopodidae: Stichopus) is presented. The complete sequence was determined using normal and long PCRs. The mitochondrial genome of Stichopus horrens is a circular molecule 16257 bps long, composed of 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes. Most of these genes are coded on the heavy strand except for one protein-coding gene (nad6) and five tRNA genes (tRNA ( Ser(UCN) ), tRNA ( Gln ), tRNA ( Ala ), tRNA ( Val ), tRNA ( Asp )) which are coded on the light strand. The composition of the heavy strand is 30.8% A, 23.7% C, 16.2% G, and 29.3% T bases (AT skew=0.025; GC skew=-0.188). A non-coding region of 675 bp was identified as a putative control region because of its location and AT richness. The intergenic spacers range from 1 to 50 bp in size, totaling 227 bp. A total of 25 overlapping nucleotides, ranging from 1 to 10 bp in size, exist among 11 genes. All 13 protein-coding genes are initiated with an ATG. The TAA codon is used as the stop codon in all the protein coding genes except nad3 and nad4 that use TAG as their termination codon. The most frequently used amino acids are Leu (16.29%), Ser (10.34%) and Phe (8.37%). All of the tRNA genes have the potential to fold into typical cloverleaf secondary structures. We also compared the order of the genes in the mitochondrial DNA from the five holothurians that are now available and found a novel gene arrangement in the mitochondrial DNA of Stichopus horrens.
Plasmid AZOBR_p1-borne fabG gene for putative 3-oxoacyl-[acyl-carrier protein] reductase is essential for proper assembly and work of the dual flagellar system in the alphaproteobacterium Azospirillum brasilense Sp245.

PubMed

Filip'echeva, Yulia A; Shelud'ko, Andrei V; Prilipov, Alexei G; Burygin, Gennady L; Telesheva, Elizaveta M; Yevstigneyeva, Stella S; Chernyshova, Marina P; Petrova, Lilia P; Katsy, Elena I

2018-02-01

Azospirillum brasilense can swim and swarm owing to the activity of a constitutive polar flagellum (Fla) and inducible lateral flagella (Laf), respectively. Experimental data on the regulation of the Fla and Laf assembly in azospirilla are scarce. Here, the coding sequence (CDS) AZOBR_p1160043 (fabG1) for a putative 3-oxoacyl-[acyl-carrier protein (ACP)] reductase was found essential for the construction of both types of flagella. In an immotile leaky Fla - Laf - fabG1::Omegon-Km mutant, Sp245.1610, defects in flagellation and motility were fully complemented by expressing the CDS AZOBR_p1160043 from plasmid pRK415. When pRK415 with the cloned CDS AZOBR_p1160045 (fliC) for a putative 65.2 kDa Sp245 Fla flagellin was transferred into the Sp245.1610 cells, the bacteria also became able to assemble a motile single flagellum. Some cells, however, had unusual swimming behavior, probably because of the side location of the organelle. Although the assembly of Laf was not restored in Sp245.1610 (pRK415-p1160045), this strain was somewhat capable of swarming motility. We propose that the putative 3-oxoacyl-[ACP] reductase encoded by the CDS AZOBR_p1160043 plays a role in correct flagellar location in the cell envelope and (or) in flagellar modification(s), which are also required for the inducible construction of Laf and for proper swimming and swarming motility of A. brasilense Sp245.
Integrative analyses of transcriptome sequencing identify novel functional lncRNAs in esophageal squamous cell carcinoma.

PubMed

Li, C-Q; Huang, G-W; Wu, Z-Y; Xu, Y-J; Li, X-C; Xue, Y-J; Zhu, Y; Zhao, J-M; Li, M; Zhang, J; Wu, J-Y; Lei, F; Wang, Q-Y; Li, S; Zheng, C-P; Ai, B; Tang, Z-D; Feng, C-C; Liao, L-D; Wang, S-H; Shen, J-H; Liu, Y-J; Bai, X-F; He, J-Z; Cao, H-H; Wu, B-L; Wang, M-R; Lin, D-C; Koeffler, H P; Wang, L-D; Li, X; Li, E-M; Xu, L-Y

2017-02-13

Long non-coding RNAs (lncRNAs) have a critical role in cancer initiation and progression, and thus may mediate oncogenic or tumor suppressing effects, as well as be a new class of cancer therapeutic targets. We performed high-throughput sequencing of RNA (RNA-seq) to investigate the expression level of lncRNAs and protein-coding genes in 30 esophageal samples, comprised of 15 esophageal squamous cell carcinoma (ESCC) samples and their 15 paired non-tumor tissues. We further developed an integrative bioinformatics method, denoted URW-LPE, to identify key functional lncRNAs that regulate expression of downstream protein-coding genes in ESCC. A number of known onco-lncRNA and many putative novel ones were effectively identified by URW-LPE. Importantly, we identified lncRNA625 as a novel regulator of ESCC cell proliferation, invasion and migration. ESCC patients with high lncRNA625 expression had significantly shorter survival time than those with low expression. LncRNA625 also showed specific prognostic value for patients with metastatic ESCC. Finally, we identified E1A-binding protein p300 (EP300) as a downstream executor of lncRNA625-induced transcriptional responses. These findings establish a catalog of novel cancer-associated functional lncRNAs, which will promote our understanding of lncRNA-mediated regulation in this malignancy.
The human fatty acid-binding protein family: Evolutionary divergences and functions

PubMed Central

2011-01-01

Fatty acid-binding proteins (FABPs) are members of the intracellular lipid-binding protein (iLBP) family and are involved in reversibly binding intracellular hydrophobic ligands and trafficking them throughout cellular compartments, including the peroxisomes, mitochondria, endoplasmic reticulum and nucleus. FABPs are small, structurally conserved cytosolic proteins consisting of a water-filled, interior-binding pocket surrounded by ten anti-parallel beta sheets, forming a beta barrel. At the superior surface, two alpha-helices cap the pocket and are thought to regulate binding. FABPs have broad specificity, including the ability to bind long-chain (C16-C20) fatty acids, eicosanoids, bile salts and peroxisome proliferators. FABPs demonstrate strong evolutionary conservation and are present in a spectrum of species including Drosophila melanogaster, Caenorhabditis elegans, mouse and human. The human genome consists of nine putatively functional protein-coding FABP genes. The most recently identified family member, FABP12, has been less studied. PMID:21504868
Hyperactive antifreeze proteins from longhorn beetles: some structural insights.

PubMed

Kristiansen, Erlend; Wilkens, Casper; Vincents, Bjarne; Friis, Dennis; Lorentzen, Anders Blomkild; Jenssen, Håvard; Løbner-Olesen, Anders; Ramløv, Hans

2012-11-01

This study reports on structural characteristics of hyperactive antifreeze proteins (AFPs) from two species of longhorn beetles. In Rhagium mordax, eight unique mRNAs coding for five different mature AFPs were identified from cold-hardy individuals. These AFPs are apparently homologues to a previously characterized AFP from the closely related species Rhagium inquisitor, and consist of six identifiable repeats of a putative ice binding motif TxTxTxT spaced irregularly apart by segments varying in length from 13 to 20 residues. Circular dichroism spectra show that the AFPs from both species have a high content of β-sheet and low levels of α-helix and random coil. Theoretical predictions of residue-specific secondary structure locate these β-sheets within the putative ice-binding motifs and the central parts of the segments separating them, consistent with an overall β-helical structure with the ice-binding motifs stacked in a β-sheet on one side of the coil. Molecular dynamics models based on these findings show that these AFPs would be energetically stable in a β-helical conformation. Copyright © 2012 Elsevier Ltd. All rights reserved.
A germin-like protein with superoxide dismutase activity in pea nodules with high protein sequence identity to a putative rhicadhesin receptor.

PubMed

Gucciardo, Sébastian; Wisniewski, Jean-Pierre; Brewin, Nicholas J; Bornemann, Stephen

2007-01-01

The cDNAs encoding three germin-like proteins (PsGER1, PsGER2a, and PsGER2b) were isolated from Pisum sativum. The coding sequence of PsGER1 transiently expressed in tobacco leaves gave a protein with superoxide dismutase activity but no detectable oxalate oxidase activity according to in-gel activity stains. The transient expression of wheat germin gf-2.8 oxalate oxidase showed oxalate oxidase but no superoxide dismutase activity under the same conditions. The superoxide dismutase activity of PsGER1 was resistant to high temperature, denaturation by detergent, and high concentrations of hydrogen peroxide. In salt-stressed pea roots, a heat-resistant superoxide dismutase activity was observed with an electrophoretic mobility similar to that of the PsGER1 protein, but this activity was below the detection limit in non-stressed or H(2)O(2)-stressed pea roots. Oxalate oxidase activity was not detected in either pea roots or nodules. Following in situ hybridization in developing pea nodules, PsGER1 transcript was detected in expanding cells just proximal to the meristematic zone and also in the epidermis, but to a lesser extent. PsGER1 is the first known germin-like protein with superoxide dismutase activity to be associated with nodules. It shared protein sequence identity with the N-terminal sequence of a putative plant receptor for rhicadhesin, a bacterial attachment protein. However, its primary location in nodules suggests functional roles other than as a rhicadhesin receptor required for the first stage of bacterial attachment to root hairs.
Discovery of numerous novel small genes in the intergenic regions of the Escherichia coli O157:H7 Sakai genome

PubMed Central

Hücker, Sarah M.; Ardern, Zachary; Goldberg, Tatyana; Schafferhans, Andrea; Bernhofer, Michael; Vestergaard, Gisle; Nelson, Chase W.; Schloter, Michael; Rost, Burkhard; Scherer, Siegfried

2017-01-01

In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set. PMID:28902868
Genome of Rhodnius prolixus, an insect vector of Chagas disease, reveals unique adaptations to hematophagy and parasite infection

PubMed Central

Mesquita, Rafael D.; Vionette-Amaral, Raquel J.; Lowenberger, Carl; Rivera-Pomar, Rolando; Monteiro, Fernando A.; Minx, Patrick; Spieth, John; Carvalho, A. Bernardo; Panzera, Francisco; Lawson, Daniel; Torres, André Q.; Ribeiro, Jose M. C.; Sorgine, Marcos H. F.; Waterhouse, Robert M.; Abad-Franch, Fernando; Alves-Bezerra, Michele; Amaral, Laurence R.; Araujo, Helena M.; Aravind, L.; Atella, Georgia C.; Azambuja, Patricia; Berni, Mateus; Bittencourt-Cunha, Paula R.; Braz, Gloria R. C.; Calderón-Fernández, Gustavo; Carareto, Claudia M. A.; Christensen, Mikkel B.; Costa, Igor R.; Costa, Samara G.; Dansa, Marilvia; Daumas-Filho, Carlos R. O.; De-Paula, Iron F.; Dias, Felipe A.; Dimopoulos, George; Emrich, Scott J.; Esponda-Behrens, Natalia; Fampa, Patricia; Fernandez-Medina, Rita D.; da Fonseca, Rodrigo N.; Fontenele, Marcio; Fronick, Catrina; Fulton, Lucinda A.; Gandara, Ana Caroline; Garcia, Eloi S.; Genta, Fernando A.; Giraldo-Calderón, Gloria I.; Gomes, Bruno; Gondim, Katia C.; Granzotto, Adriana; Guarneri, Alessandra A.; Guigó, Roderic; Harry, Myriam; Hughes, Daniel S. T.; Jablonka, Willy; Jacquin-Joly, Emmanuelle; Juárez, M. Patricia; Koerich, Leonardo B.; Lange, Angela B.; Latorre-Estivalis, José Manuel; Lavore, Andrés; Lawrence, Gena G.; Lazoski, Cristiano; Lazzari, Claudio R.; Lopes, Raphael R.; Lorenzo, Marcelo G.; Lugon, Magda D.; Marcet, Paula L.; Mariotti, Marco; Masuda, Hatisaburo; Megy, Karine; Missirlis, Fanis; Mota, Theo; Noriega, Fernando G.; Nouzova, Marcela; Nunes, Rodrigo D.; Oliveira, Raquel L. L.; Oliveira-Silveira, Gilbert; Ons, Sheila; Orchard, Ian; Pagola, Lucia; Paiva-Silva, Gabriela O.; Pascual, Agustina; Pavan, Marcio G.; Pedrini, Nicolás; Peixoto, Alexandre A.; Pereira, Marcos H.; Pike, Andrew; Polycarpo, Carla; Prosdocimi, Francisco; Ribeiro-Rodrigues, Rodrigo; Robertson, Hugh M.; Salerno, Ana Paula; Salmon, Didier; Santesmasses, Didac; Schama, Renata; Seabra-Junior, Eloy S.; Silva-Cardoso, Livia; Silva-Neto, Mario A. C.; Souza-Gomes, Matheus; Sterkel, Marcos; Taracena, Mabel L.; Tojo, Marta; Tu, Zhijian Jake; Tubio, Jose M. C.; Ursic-Bedoya, Raul; Venancio, Thiago M.; Walter-Nuno, Ana Beatriz; Wilson, Derek; Warren, Wesley C.; Wilson, Richard K.; Huebner, Erwin; Dotson, Ellen M.; Oliveira, Pedro L.

2015-01-01

Rhodnius prolixus not only has served as a model organism for the study of insect physiology, but also is a major vector of Chagas disease, an illness that affects approximately seven million people worldwide. We sequenced the genome of R. prolixus, generated assembled sequences covering 95% of the genome (∼702 Mb), including 15,456 putative protein-coding genes, and completed comprehensive genomic analyses of this obligate blood-feeding insect. Although immune-deficiency (IMD)-mediated immune responses were observed, R. prolixus putatively lacks key components of the IMD pathway, suggesting a reorganization of the canonical immune signaling network. Although both Toll and IMD effectors controlled intestinal microbiota, neither affected Trypanosoma cruzi, the causal agent of Chagas disease, implying the existence of evasion or tolerance mechanisms. R. prolixus has experienced an extensive loss of selenoprotein genes, with its repertoire reduced to only two proteins, one of which is a selenocysteine-based glutathione peroxidase, the first found in insects. The genome contained actively transcribed, horizontally transferred genes from Wolbachia sp., which showed evidence of codon use evolution toward the insect use pattern. Comparative protein analyses revealed many lineage-specific expansions and putative gene absences in R. prolixus, including tandem expansions of genes related to chemoreception, feeding, and digestion that possibly contributed to the evolution of a blood-feeding lifestyle. The genome assembly and these associated analyses provide critical information on the physiology and evolution of this important vector species and should be instrumental for the development of innovative disease control methods. PMID:26627243
Genome of Rhodnius prolixus, an insect vector of Chagas disease, reveals unique adaptations to hematophagy and parasite infection.

PubMed

Mesquita, Rafael D; Vionette-Amaral, Raquel J; Lowenberger, Carl; Rivera-Pomar, Rolando; Monteiro, Fernando A; Minx, Patrick; Spieth, John; Carvalho, A Bernardo; Panzera, Francisco; Lawson, Daniel; Torres, André Q; Ribeiro, Jose M C; Sorgine, Marcos H F; Waterhouse, Robert M; Montague, Michael J; Abad-Franch, Fernando; Alves-Bezerra, Michele; Amaral, Laurence R; Araujo, Helena M; Araujo, Ricardo N; Aravind, L; Atella, Georgia C; Azambuja, Patricia; Berni, Mateus; Bittencourt-Cunha, Paula R; Braz, Gloria R C; Calderón-Fernández, Gustavo; Carareto, Claudia M A; Christensen, Mikkel B; Costa, Igor R; Costa, Samara G; Dansa, Marilvia; Daumas-Filho, Carlos R O; De-Paula, Iron F; Dias, Felipe A; Dimopoulos, George; Emrich, Scott J; Esponda-Behrens, Natalia; Fampa, Patricia; Fernandez-Medina, Rita D; da Fonseca, Rodrigo N; Fontenele, Marcio; Fronick, Catrina; Fulton, Lucinda A; Gandara, Ana Caroline; Garcia, Eloi S; Genta, Fernando A; Giraldo-Calderón, Gloria I; Gomes, Bruno; Gondim, Katia C; Granzotto, Adriana; Guarneri, Alessandra A; Guigó, Roderic; Harry, Myriam; Hughes, Daniel S T; Jablonka, Willy; Jacquin-Joly, Emmanuelle; Juárez, M Patricia; Koerich, Leonardo B; Lange, Angela B; Latorre-Estivalis, José Manuel; Lavore, Andrés; Lawrence, Gena G; Lazoski, Cristiano; Lazzari, Claudio R; Lopes, Raphael R; Lorenzo, Marcelo G; Lugon, Magda D; Majerowicz, David; Marcet, Paula L; Mariotti, Marco; Masuda, Hatisaburo; Megy, Karine; Melo, Ana C A; Missirlis, Fanis; Mota, Theo; Noriega, Fernando G; Nouzova, Marcela; Nunes, Rodrigo D; Oliveira, Raquel L L; Oliveira-Silveira, Gilbert; Ons, Sheila; Orchard, Ian; Pagola, Lucia; Paiva-Silva, Gabriela O; Pascual, Agustina; Pavan, Marcio G; Pedrini, Nicolás; Peixoto, Alexandre A; Pereira, Marcos H; Pike, Andrew; Polycarpo, Carla; Prosdocimi, Francisco; Ribeiro-Rodrigues, Rodrigo; Robertson, Hugh M; Salerno, Ana Paula; Salmon, Didier; Santesmasses, Didac; Schama, Renata; Seabra-Junior, Eloy S; Silva-Cardoso, Livia; Silva-Neto, Mario A C; Souza-Gomes, Matheus; Sterkel, Marcos; Taracena, Mabel L; Tojo, Marta; Tu, Zhijian Jake; Tubio, Jose M C; Ursic-Bedoya, Raul; Venancio, Thiago M; Walter-Nuno, Ana Beatriz; Wilson, Derek; Warren, Wesley C; Wilson, Richard K; Huebner, Erwin; Dotson, Ellen M; Oliveira, Pedro L

2015-12-01

Rhodnius prolixus not only has served as a model organism for the study of insect physiology, but also is a major vector of Chagas disease, an illness that affects approximately seven million people worldwide. We sequenced the genome of R. prolixus, generated assembled sequences covering 95% of the genome (∼ 702 Mb), including 15,456 putative protein-coding genes, and completed comprehensive genomic analyses of this obligate blood-feeding insect. Although immune-deficiency (IMD)-mediated immune responses were observed, R. prolixus putatively lacks key components of the IMD pathway, suggesting a reorganization of the canonical immune signaling network. Although both Toll and IMD effectors controlled intestinal microbiota, neither affected Trypanosoma cruzi, the causal agent of Chagas disease, implying the existence of evasion or tolerance mechanisms. R. prolixus has experienced an extensive loss of selenoprotein genes, with its repertoire reduced to only two proteins, one of which is a selenocysteine-based glutathione peroxidase, the first found in insects. The genome contained actively transcribed, horizontally transferred genes from Wolbachia sp., which showed evidence of codon use evolution toward the insect use pattern. Comparative protein analyses revealed many lineage-specific expansions and putative gene absences in R. prolixus, including tandem expansions of genes related to chemoreception, feeding, and digestion that possibly contributed to the evolution of a blood-feeding lifestyle. The genome assembly and these associated analyses provide critical information on the physiology and evolution of this important vector species and should be instrumental for the development of innovative disease control methods.
Molecular modelling of the Norrie disease protein predicts a cystine knot growth factor tertiary structure.

PubMed

Meitinger, T; Meindl, A; Bork, P; Rost, B; Sander, C; Haasemann, M; Murken, J

1993-12-01

The X-lined gene for Norrie disease, which is characterized by blindness, deafness and mental retardation has been cloned recently. This gene has been thought to code for a putative extracellular factor; its predicted amino acid sequence is homologous to the C-terminal domain of diverse extracellular proteins. Sequence pattern searches and three-dimensional modelling now suggest that the Norrie disease protein (NDP) has a tertiary structure similar to that of transforming growth factor beta (TGF beta). Our model identifies NDP as a member of an emerging family of growth factors containing a cystine knot motif, with direct implications for the physiological role of NDP. The model also sheds light on sequence related domains such as the C-terminal domain of mucins and of von Willebrand factor.
An insight into the sialome of the blood-sucking bug Triatoma infestans, a vector of Chagas' disease

PubMed Central

Assumpção, Teresa C. F.; Francischetti, Ivo M. B.; Andersen, John F.; Schwarz, Alexandra; Santana, Jaime M.; Ribeiro, José M. C.

2008-01-01

Triatoma infestans is a hemiptera, vector of Chagas’ disease, that feeds exclusively on vertebrate blood in all life stages. Hematophagous insects’ salivary glands (SG) produce potent pharmacological compounds that counteract host hemostasis, including anti-clotting, anti-platelet, and vasodilatory molecules. To obtain a further insight into the salivary biochemical and pharmacological complexity of this insect, a cDNA library from its salivary glands was randomly sequenced. Also, salivary proteins were submitted to two dimentional gel (2D-gel) electrophoresis followed by MS analysis. We present the analysis of a set of 1,534 (SG) cDNA sequences, 645 of which coded for proteins of a putative secretory nature. Most salivary proteins described as lipocalins matched peptide sequences obtained from proteomic results. PMID:18207082
Long non-coding RNA discovery across the genus anopheles reveals conserved secondary structures within and beyond the Gambiae complex.

PubMed

Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T

2015-04-23

Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.
High-throughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel Bathymodiolus azoricus

PubMed Central

2010-01-01

Background Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology. Results A normalized cDNA library from gills tissue was sequenced in a full 454 GS-FLX run, producing 778,996 sequencing reads. Assembly of the high quality reads resulted in 75,407 contigs of which 3,071 were singletons. A total of 39,425 transcripts were conceptually translated into amino-sequences of which 22,023 matched known proteins in the NCBI non-redundant protein database, 15,839 revealed conserved protein domains through InterPro functional classification and 9,584 were assigned with Gene Ontology terms. Queries conducted within the database enabled the identification of genes putatively involved in immune and inflammatory reactions which had not been previously evidenced in the vent mussel. Their physical counterpart was confirmed by semi-quantitative quantitative Reverse-Transcription-Polymerase Chain Reactions (RT-PCR) and their RNA transcription level by quantitative PCR (qPCR) experiments. Conclusions We have established the first tissue transcriptional analysis of a deep-sea hydrothermal vent animal and generated a searchable catalog of genes that provides a direct method of identifying and retrieving vast numbers of novel coding sequences which can be applied in gene expression profiling experiments from a non-conventional model organism. This provides the most comprehensive sequence resource for identifying novel genes currently available for a deep-sea vent organism, in particular, genes putatively involved in immune and inflammatory reactions in vent mussels. The characterization of the B. azoricus transcriptome will facilitate research into biological processes underlying physiological adaptations to hydrothermal vent environments and will provide a basis for expanding our understanding of genes putatively involved in adaptations processes during post-capture long term acclimatization experiments, at "sea-level" conditions, using B. azoricus as a model organism. PMID:20937131

Identification and analysis of unitary loss of long-established protein-coding genes in Poaceae shows evidences for biased gene loss and putatively functional transcription of relics.

PubMed

Zhao, Yi; Tang, Liang; Li, Zhe; Jin, Jinpu; Luo, Jingchu; Gao, Ge

2015-04-18

Long-established protein-coding genes may lose their coding potential during evolution ("unitary gene loss"). Members of the Poaceae family are a major food source and represent an ideal model clade for plant evolution research. However, the global pattern of unitary gene loss in Poaceae genomes as well as the evolutionary fate of lost genes are still less-investigated and remain largely elusive. Using a locally developed pipeline, we identified 129 unitary gene loss events for long-established protein-coding genes from four representative species of Poaceae, i.e. brachypodium, rice, sorghum and maize. Functional annotation suggested that the lost genes in all or most of Poaceae species are enriched for genes involved in development and response to endogenous stimulus. We also found that 44 mutated genomic loci of lost genes, which we referred as relics, were still actively transcribed, and of which 84% (37 of 44) showed significantly differential expression across different tissues. More interestingly, we found that there were totally five expressed relics may function as competitive endogenous RNA in brachypodium, rice and sorghum genome. Based on comparative genomics and transcriptome data, we firstly compiled a comprehensive catalogue of unitary gene loss events in Poaceae species and characterized a statistically significant functional preference for these lost genes as well showed the potential of relics functioning as competitive endogenous RNAs in Poaceae genomes.
Crystal structure of the YDR533c S. cerevisiae protein, a class II member of the Hsp31 family.

PubMed

Graille, Marc; Quevillon-Cheruel, Sophie; Leulliot, Nicolas; Zhou, Cong-Zhao; Li de la Sierra Gallay, Ines; Jacquamet, Lilian; Ferrer, Jean-Luc; Liger, Dominique; Poupon, Anne; Janin, Joel; van Tilbeurgh, Herman

2004-05-01

The ORF YDR533c from Saccharomyces cerevisiae codes for a 25.5 kDa protein of unknown biochemical function. Transcriptome analysis of yeast has shown that this gene is activated in response to various stress conditions together with proteins belonging to the heat shock family. In order to clarify its biochemical function, we determined the crystal structure of YDR533c to 1.85 A resolution by the single anomalous diffraction method. The protein possesses an alpha/beta hydrolase fold and a putative Cys-His-Glu catalytic triad common to a large enzyme family containing proteases, amidotransferases, lipases, and esterases. The protein has strong structural resemblance with the E. coli Hsp31 protein and the intracellular protease I from Pyrococcus horikoshii, which are considered class I and class III members of the Hsp31 family, respectively. Detailed structural analysis strongly suggests that the YDR533c protein crystal structure is the first one of a class II member of the Hsp31 family.
Comparison of the protein-coding gene content of Chlamydia trachomatis and Protochlamydia amoebophila using a Raspberry Pi computer.

PubMed

Robson, James F; Barker, Daniel

2015-10-13

To demonstrate the bioinformatics capabilities of a low-cost computer, the Raspberry Pi, we present a comparison of the protein-coding gene content of two species in phylum Chlamydiae: Chlamydia trachomatis, a common sexually transmitted infection of humans, and Candidatus Protochlamydia amoebophila, a recently discovered amoebal endosymbiont. Identifying species-specific proteins and differences in protein families could provide insights into the unique phenotypes of the two species. Using a Raspberry Pi computer, sequence similarity-based protein families were predicted across the two species, C. trachomatis and P. amoebophila, and their members counted. Examples include nine multi-protein families unique to C. trachomatis, 132 multi-protein families unique to P. amoebophila and one family with multiple copies in both. Most families unique to C. trachomatis were polymorphic outer-membrane proteins. Additionally, multiple protein families lacking functional annotation were found. Predicted functional interactions suggest one of these families is involved with the exodeoxyribonuclease V complex. The Raspberry Pi computer is adequate for a comparative genomics project of this scope. The protein families unique to P. amoebophila may provide a basis for investigating the host-endosymbiont interaction. However, additional species should be included; and further laboratory research is required to identify the functions of unknown or putative proteins. Multiple outer membrane proteins were found in C. trachomatis, suggesting importance for host evasion. The tyrosine transport protein family is shared between both species, with four proteins in C. trachomatis and two in P. amoebophila. Shared protein families could provide a starting point for discovery of wide-spectrum drugs against Chlamydiae.
The complete mitochondrial genome of Sika deer Cervus nippon hortulorum (Artiodactyla: Cervidae) and phylogenetic studies.

PubMed

Liu, Yan-Hua; Liu, Xin-Xin; Zhang, Ming-Hai

2016-07-01

Sika deer (Cervus nippon Temminck 1836) are classified in the order Artiodactyla, family Cervidae, subfamily Cervinae. At present, the phylogenetic studies of C. nippon are problematic. In this study, we first determined and described the complete mitochondrial sequence of the wild C. nippon hortulorum. The complete mitogenome sequence is 16 566 bp in length, including 13 protein-coding genes, two rRNA genes, 22 tRNA genes, a putative control region (CR) and a light-strand replication origin (OL). The overall base composition was 33.4% A, 28.6% T, 24.5% C, 13.5% G, with a 62.0% AT bias. The 13 protein-coding genes encode 3782 amino acids in total. To further validate the new determined sequences and phylogeny of Sika deer, phylogenetic trees involving 15 most closely related species available in GenBank database were constructed. These results are expected to provide useful molecular data for deer species identification and further phylogenetic studies of Artiodactyla.
Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).

PubMed

Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang

2016-07-01

The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.
Long non-coding RNAs are associated with spatiotemporal gene expression profiles in the marine gastropod Tegula atra.

PubMed

Détrée, Camille; Núñez-Acuña, Gustavo; Tapia, Fabian; Gallardo-Escárate, Cristian

2017-06-01

Increasing evidence suggests that long non-coding RNAs (lncRNAs) play diverse roles in cellular processes, including in the regulation of embryogenesis and growth. However, little is known about the role of lncRNAs in marine invertebrates inhabiting changing environments. Therefore, the aim of this study was to present the first characterization of lncRNAs in an intertidal marine gastropod. Specifically, Tegula atra individuals were sampled in four sites of the central-northern Chilean coastline (28-31°) during summer and winter. A pipeline was constructed, and 3524 putative lncRNAs were identified from transcriptome databases specific to T. atra. These lncRNAs exhibited characteristics common to known lncRNAs, including a length shorter than coding sequences, low GC-content, and low sequence conservation. Expression analyses revealed that lncRNAs varied more in the summer. Furthermore, a majority of the differentially expressed lncRNAs were found in the southernmost population, the seasonal temperatures of which varied the greatest among all groups. Additionally, co-expression analysis found some lncRNAs strongly correlated with coding genes involved in the environmental stress response, such as heat shock proteins and metalloproteins. In contrast, other lncRNA expressions were strongly uncorrelated with genes involved in lipid/carbohydrates metabolism and cell-cell communication. This study provides the first large-scale characterization of lncRNAs in a marine gastropod, with results suggesting a putative role of lncRNAs in thermal tolerance, as well as an association with molecular mechanisms involved in the local adaptations of marine invertebrate populations. Copyright © 2017 Elsevier B.V. All rights reserved.
Positional cloning of a gene responsible for the cts mutation of the silkworm, Bombyx mori.

PubMed

Ito, Katsuhiko; Kidokoro, Kurako; Katsuma, Susumu; Shimada, Toru; Yamamoto, Kimiko; Mita, Kazuei; Kadono-Okuda, Keiko

2012-07-01

The larval head cuticle and anal plates of the silkworm mutant cheek and tail spot (cts) have chocolate-colored spots, unlike the entirely white appearance of the wild-type (WT) strain. We report the identification and characterization of the gene responsible for the cts mutation. Positional cloning revealed a cts candidate on chromosome 16, designated BmMFS, based on the high similarity of the deduced amino acid sequence between the candidate gene from the WT strain and the major facilitator superfamily (MFS) protein. BmMFS likely encodes a membrane protein with 11 putative transmembrane domains, while the putative structure deduced from the cts-type allele possesses only 10-pass transmembrane domains owing to a deletion in its coding region. Quantitative RT-PCR analysis showed that BmMFS mRNA was strongly expressed in the integument of the head and tail, where the cts phenotype is observed; expression markedly increased at the molting and newly ecdysed stages. These results indicate that the novel BmMFS gene is cts and the membrane structure of its protein accounts for the cts phenotype. These expression profiles and the cts phenotype are quite similar to those of melanin-related genes, such as Bmyellow-e and Bm-iAANAT, suggesting that BmMFS is involved in the melanin synthesis pathway.
PopF1 and PopF2, Two Proteins Secreted by the Type III Protein Secretion System of Ralstonia solanacearum, Are Translocators Belonging to the HrpF/NopX Family†

PubMed Central

Meyer, Damien; Cunnac, Sébastien; Guéneron, Mareva; Declercq, Céline; Van Gijsegem, Frédérique; Lauber, Emmanuelle; Boucher, Christian; Arlat, Matthieu

2006-01-01

Ralstonia solanacearum GMI1000 is a gram-negative plant pathogen which contains an hrp gene cluster which codes for a type III protein secretion system (TTSS). We identified two novel Hrp-secreted proteins, called PopF1 and PopF2, which display similarity to one another and to putative TTSS translocators, HrpF and NopX, from Xanthomonas spp. and rhizobia, respectively. They also show similarities with TTSS translocators of the YopB family from animal-pathogenic bacteria. Both popF1 and popF2 belong to the HrpB regulon and are required for the interaction with plants, but PopF1 seems to play a more important role in virulence and hypersensitive response (HR) elicitation than PopF2 under our experimental conditions. PopF1 and PopF2 are not necessary for the secretion of effector proteins, but they are required for the translocation of AvrA avirulence protein into tobacco cells. We conclude that PopF1 and PopF2 are type III translocators belonging to the HrpF/NopX family. The hrpF gene of Xanthomonas campestris pv. campestris partially restored HR-inducing ability to popF1 popF2 mutants of R. solanacearum, suggesting that translocators of R. solanacearum and Xanthomonas are functionally conserved. Finally, R. solanacearum strain UW551, which does not belong to the same phylotype as GMI1000, also possesses two putative translocator proteins. However, although one of these proteins is clearly related to PopF1 and PopF2, the other seems to be different and related to NopX proteins, thus showing that translocators might be variable in R. solanacearum. PMID:16788199
Biotin protein ligase from Corynebacterium glutamicum: role for growth and L: -lysine production.

PubMed

Peters-Wendisch, P; Stansen, K C; Götker, S; Wendisch, V F

2012-03-01

Corynebacterium glutamicum is a biotin auxotrophic Gram-positive bacterium that is used for large-scale production of amino acids, especially of L-glutamate and L-lysine. It is known that biotin limitation triggers L-glutamate production and that L-lysine production can be increased by enhancing the activity of pyruvate carboxylase, one of two biotin-dependent proteins of C. glutamicum. The gene cg0814 (accession number YP_225000) has been annotated to code for putative biotin protein ligase BirA, but the protein has not yet been characterized. A discontinuous enzyme assay of biotin protein ligase activity was established using a 105aa peptide corresponding to the carboxyterminus of the biotin carboxylase/biotin carboxyl carrier protein subunit AccBC of the acetyl CoA carboxylase from C. glutamicum as acceptor substrate. Biotinylation of this biotin acceptor peptide was revealed with crude extracts of a strain overexpressing the birA gene and was shown to be ATP dependent. Thus, birA from C. glutamicum codes for a functional biotin protein ligase (EC 6.3.4.15). The gene birA from C. glutamicum was overexpressed and the transcriptome was compared with the control strain revealing no significant gene expression changes of the bio-genes. However, biotin protein ligase overproduction increased the level of the biotin-containing protein pyruvate carboxylase and entailed a significant growth advantage in glucose minimal medium. Moreover, birA overexpression resulted in a twofold higher L-lysine yield on glucose as compared with the control strain.
Picornavirus Modification of a Host mRNA Decay Protein

PubMed Central

Rozovics, Janet M.; Chase, Amanda J.; Cathcart, Andrea L.; Chou, Wayne; Gershon, Paul D.; Palusa, Saiprasad; Wilusz, Jeffrey; Semler, Bert L.

2012-01-01

ABSTRACT Due to the limited coding capacity of picornavirus genomic RNAs, host RNA binding proteins play essential roles during viral translation and RNA replication. Here we describe experiments suggesting that AUF1, a host RNA binding protein involved in mRNA decay, plays a role in the infectious cycle of picornaviruses such as poliovirus and human rhinovirus. We observed cleavage of AUF1 during poliovirus or human rhinovirus infection, as well as interaction of this protein with the 5′ noncoding regions of these viral genomes. Additionally, the picornavirus proteinase 3CD, encoded by poliovirus or human rhinovirus genomic RNAs, was shown to cleave all four isoforms of recombinant AUF1 at a specific N-terminal site in vitro. Finally, endogenous AUF1 was found to relocalize from the nucleus to the cytoplasm in poliovirus-infected HeLa cells to sites adjacent to (but distinct from) putative viral RNA replication complexes. PMID:23131833
Predicted secondary structure similarity in the absence of primary amino acid sequence homology: hepatitis B virus open reading frames.

PubMed Central

Schaeffer, E; Sninsky, J J

1984-01-01

Proteins that are related evolutionarily may have diverged at the level of primary amino acid sequence while maintaining similar secondary structures. Computer analysis has been used to compare the open reading frames of the hepatitis B virus to those of the woodchuck hepatitis virus at the level of amino acid sequence, and to predict the relative hydrophilic character and the secondary structure of putative polypeptides. Similarity is seen at the levels of relative hydrophilicity and secondary structure, in the absence of sequence homology. These data reinforce the proposal that these open reading frames encode viral proteins. Computer analysis of this type can be more generally used to establish structural similarities between proteins that do not share obvious sequence homology as well as to assess whether an open reading frame is fortuitous or codes for a protein. PMID:6585835
The complete mitochondrial genome sequence of Aesopia cornuta (Pleuronectiformes: Soleidae).

PubMed

Wang, Shu-Ying; Shi, Wei; Wang, Zhong-Ming; Gong, Li; Kong, Xiao-Yu

2015-02-01

Aesopia cornuta belongs to the family Soleidae of Pleuronectiformes, and the morphological characters are much similar to those of Zebrias. In this article, we sequenced, characterized, and compared the complete mitogenome of A. cornuta for the first time. The genome is 16,737 base pairs in length, and is typically consist of 37 genes, including 13 protein-coding genes, two ribosomal RNA, 22 transfer RNA, as well as a putative L-strand replication origin and a putative control region. The gene organization is identical to that of typical bony fishes. The overall base composition is 29.1, 28.3, 26.8 and 15.8% for C, A, T and G, respectively, with a slight AT bias of 55.1%. This result is expected to contribute to understanding the systematic evolution of the genus Aesopia and further taxonomic and phylogenetic studies of Soleidae and Pleuronectiformes.
An insight into the sialome of the horse fly, Tabanus bromius

PubMed Central

Ribeiro, José M.C.; Kazimirova, Maria; Takac, Peter; Andersen, John F.; Francischetti, Ivo M.B.

2015-01-01

Blood feeding animals face their host's defenses against tissue injury and blood loss while attempting to feed. One adaptation to surmount these barriers involves the evolution of a salivary potion that disarms their host's inflammatory and anti-hemostatic processes. The composition of the peptide moiety of this potion, or sialome (from the Greek sialo=saliva), can be deducted in part by proper interpretation of the blood feeder' sialotranscriptome. In this work we disclose the sialome of the blood feeding adult female Tabanus bromius. Following assembly of over 75 million Illumina reads (101 nt long) 16,683 contigs were obtained from which 4,078 coding sequences were extracted. From these, 320 were assigned as coding for putative secreted proteins. These 320 contigs mapped 85% of the reads. The antigen-5 proteins family was studied in detail, indicating three Tabanus specific clades with and without disintegrin domains, as well as with and without leukotriene binding domains. Defensins were also detailed; a clade of salivary tabanid peptides was found lacking the propeptide domain ending in the KR dipeptide signaling furin cleavage. Novel protein families were also disclosed. Viral transcripts were identified closely matching the Kotonkan virus capsid proteins. Full length Mariner transposases were also identified. A total of 3,043 coding sequences and their protein products were deposited in Genbank. Hyperlinked excel spreadsheets containing the coding sequences and their annotation are available at http://exon.niaid.nih.gov/transcriptome/T_bromius/Tbromius-web.xlsx (hyperlinked excel spreadsheet, 11 MB) and http://exon.niaid.nih.gov/transcriptome/T_bromius/Tbromius-SA.zip (Standalone excel with all local links, 360 MB). These sequences provide for a platform from which further proteomic studies may be designed to identify salivary proteins from T. bromius that are of pharmacological interest or used as immunological markers of host exposure. PMID:26369729
A global transcriptional analysis of Plasmodium falciparum malaria reveals a novel family of telomere-associated lncRNAs

PubMed Central

2011-01-01

Background Mounting evidence suggests a major role for epigenetic feedback in Plasmodium falciparum transcriptional regulation. Long non-coding RNAs (lncRNAs) have recently emerged as a new paradigm in epigenetic remodeling. We therefore set out to investigate putative roles for lncRNAs in P. falciparum transcriptional regulation. Results We used a high-resolution DNA tiling microarray to survey transcriptional activity across 22.6% of the P. falciparum strain 3D7 genome. We identified 872 protein-coding genes and 60 putative P. falciparum lncRNAs under developmental regulation during the parasite's pathogenic human blood stage. Further characterization of lncRNA candidates led to the discovery of an intriguing family of lncRNA telomere-associated repetitive element transcripts, termed lncRNA-TARE. We have quantified lncRNA-TARE expression at 15 distinct chromosome ends and mapped putative transcriptional start and termination sites of lncRNA-TARE loci. Remarkably, we observed coordinated and stage-specific expression of lncRNA-TARE on all chromosome ends tested, and two dominant transcripts of approximately 1.5 kb and 3.1 kb transcribed towards the telomere. Conclusions We have characterized a family of 22 telomere-associated lncRNAs in P. falciparum. Homologous lncRNA-TARE loci are coordinately expressed after parasite DNA replication, and are poised to play an important role in P. falciparum telomere maintenance, virulence gene regulation, and potentially other processes of parasite chromosome end biology. Further study of lncRNA-TARE and other promising lncRNA candidates may provide mechanistic insight into P. falciparum transcriptional regulation. PMID:21689454
Genome-Wide Analysis of Mycoplasma bovirhinis GS01 Reveals Potential Virulence Factors and Phylogenetic Relationships.

PubMed

Chen, Shengli; Hao, Huafang; Zhao, Ping; Liu, Yongsheng; Chu, Yuefeng

2018-05-04

Mycoplasma bovirhinis is a significant etiology in bovine pneumonia and mastitis, but our knowledge about the genetic and pathogenic mechanisms of M. bovirhinis is very limited. In this study, we sequenced the complete genome of M. bovirhinis strain GS01 isolated from the nasal swab of pneumonic calves in Gansu, China, and we found that its genome forms a 847,985 bp single circular chromosome with a GC content of 27.57% and with 707 protein-coding genes. The putative virulence determinants of M. bovirhinis were then analyzed. Results showed that three genomic islands and 16 putative virulence genes, including one adhesion gene enolase, seven surface lipoproteins, proteins involved in glycerol metabolism, and cation transporters, might be potential virulence factors. Glycerol and pyruvate metabolic pathways were defective. Comparative analysis revealed remarkable genome variations between GS01 and a recently reported HAZ141_2 strain, and extremely low homology with others mycoplasma species. Phylogenetic analysis demonstrated that M. bovirhinis was most genetically close to M. canis , distant from other bovine Mycoplasma species. Genomic dissection may provide useful information on the pathogenic mechanisms and genetics of M. bovirhinis . Copyright © 2018 Chen et al.
Molecular cloning and characterization of a novel RING zinc-finger protein gene up-regulated under in vitro salt stress in cassava.

PubMed

dos Reis, Sávio Pinho; Tavares, Liliane de Souza Conceição; Costa, Carinne de Nazaré Monteiro; Brígida, Aílton Borges Santa; de Souza, Cláudia Regina Batista

2012-06-01

Cassava (Manihot esculenta Crantz) is one of the world's most important food crops. It is cultivated mainly in developing countries of tropics, since its root is a major source of calories for low-income people due to its high productivity and resistance to many abiotic and biotic factors. A previous study has identified a partial cDNA sequence coding for a putative RING zinc finger in cassava storage root. The RING zinc finger protein is a specialized type of zinc finger protein found in many organisms. Here, we isolated the full-length cDNA sequence coding for M. esculenta RZF (MeRZF) protein by a combination of 5' and 3' RACE assays. BLAST analysis showed that its deduced amino acid sequence has a high level of similarity to plant proteins of RZF family. MeRZF protein contains a signature sequence motif for a RING zinc finger at its C-terminal region. In addition, this protein showed a histidine residue at the fifth coordination site, likely belonging to the RING-H2 subgroup, as confirmed by our phylogenetic analysis. There is also a transmembrane domain in its N-terminal region. Finally, semi-quantitative RT-PCR assays showed that MeRZF expression is increased in detached leaves treated with sodium chloride. Here, we report the first evidence of a RING zinc finger gene of cassava showing potential role in response to salt stress.
Cloning and characterization of a basic phospholipase A2 homologue from Micrurus corallinus (coral snake) venom gland.

PubMed

de Oliveira, Ursula Castro; Assui, Alessandra; da Silva, Alvaro Rossan de Brandão Prieto; de Oliveira, Jane Silveira; Ho, Paulo Lee

2003-09-01

During the cloning of abundant cDNAs expressed in the Micrurus corallinus coral snake venom gland, several putative toxins, including a phospholipase A2 homologue cDNA (clone V2), were identified. The V2 cDNA clone codes for a potential coral snake toxin with a signal peptide of 27 amino acid residues plus a predicted mature protein with 119 amino acid residues. The deduced protein is highly similar to known phospholipases A2, with seven deduced S-S bridges at the same conserved positions. This protein was expressed in Escherichia coli as a His-tagged protein that allowed the rapid purification of the recombinant protein. This protein was used to generate antibodies, which recognized the recombinant protein in Western blot. This antiserum was used to screen a large number of venoms, showing a ubiquitous distribution of immunorelated proteins in all elapidic venoms but not in the viperidic Bothrops jararaca venom. This is the first description of a complete primary structure of a phospholipase A2 homologue deduced by cDNA cloning from a coral snake.
The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins.

PubMed Central

Fanning, T; Singer, M

1987-01-01

Recent work suggests that one or more members of the highly repeated LINE-1 (L1) DNA family found in all mammals may encode one or more proteins. Here we report the sequence of a portion of an L1 cloned from the domestic cat (Felis catus). These data permit comparison of the L1 sequences in four mammalian orders (Carnivore, Lagomorph, Rodent and Primate) and the comparison supports the suggested coding potential. In two separate, noncontiguous regions in the carboxy terminal half of the proteins predicted from the DNA sequences, there are several strongly conserved segments. In one region, these share homology with known or suspected reverse transcriptases, as described by others in rodents and primates. In the second region, closer to the carboxy terminus, the strongly conserved segments are over 90% homologous among the four orders. One of the latter segments is cysteine rich and resembles the putative metal binding domains of nucleic acid binding proteins, including those of TFIIIA and retroviruses. PMID:3562227
Gene 2 of the sigma rhabdovirus genome encodes the P protein, and gene 3 encodes a protein related to the reverse transcriptase of retroelements.

PubMed

Landès-Devauchelle, C; Bras, F; Dezélée, S; Teninges, D

1995-11-10

The nucleotide sequence of the genes 2 and 3 of the Drosophila rhabdovirus sigma was determined from cDNAs to viral genome and poly(A)+ mRNAs. Gene 2 comprises 1032 nucleotides and contains a long ORF encoding a molecular weight 35,208 polypeptide present in infected cells and in virions which migrates in SDS-PAGE as a doublet of M(r) about 60 kDa. The distribution of acidic charges as well as the electrophoretic properties of the protein are characteristic of the rhabdovirus P proteins. Gene 3 comprises 923 nucleotides and contains a long ORF capable of coding a polypeptide of 298 amino acids of MW 33,790. The putative protein (PP3) is similar in size to a minor component of the virions. Computer analysis shows that the sequence of PP3 contains three motifs related to the conserved motifs of reverse transcriptases.
The Glucuronic Acid Utilization Gene Cluster from Bacillus stearothermophilus T-6

PubMed Central

Shulami, Smadar; Gat, Orit; Sonenshein, Abraham L.; Shoham, Yuval

1999-01-01

A λ-EMBL3 genomic library of Bacillus stearothermophilus T-6 was screened for hemicellulolytic activities, and five independent clones exhibiting β-xylosidase activity were isolated. The clones overlap each other and together represent a 23.5-kb chromosomal segment. The segment contains a cluster of xylan utilization genes, which are organized in at least three transcriptional units. These include the gene for the extracellular xylanase, xylanase T-6; part of an operon coding for an intracellular xylanase and a β-xylosidase; and a putative 15.5-kb-long transcriptional unit, consisting of 12 genes involved in the utilization of α-d-glucuronic acid (GlcUA). The first four genes in the potential GlcUA operon (orf1, -2, -3, and -4) code for a putative sugar transport system with characteristic components of the binding-protein-dependent transport systems. The most likely natural substrate for this transport system is aldotetraouronic acid [2-O-α-(4-O-methyl-α-d-glucuronosyl)-xylotriose] (MeGlcUAXyl3). The following two genes code for an intracellular α-glucuronidase (aguA) and a β-xylosidase (xynB). Five more genes (kdgK, kdgA, uxaC, uxuA, and uxuB) encode proteins that are homologous to enzymes involved in galacturonate and glucuronate catabolism. The gene cluster also includes a potential regulatory gene, uxuR, the product of which resembles repressors of the GntR family. The apparent transcriptional start point of the cluster was determined by primer extension analysis and is located 349 bp from the initial ATG codon. The potential operator site is a perfect 12-bp inverted repeat located downstream from the promoter between nucleotides +170 and +181. Gel retardation assays indicated that UxuR binds specifically to this sequence and that this binding is efficiently prevented in vitro by MeGlcUAXyl3, the most likely molecular inducer. PMID:10368143

Genomic evidence for genes encoding leucine-rich repeat receptors linked to resistance against the eukaryotic extra- and intracellular Brassica napus pathogens Leptosphaeria maculans and Plasmodiophora brassicae.

PubMed

Stotz, Henrik U; Harvey, Pascoe J; Haddadi, Parham; Mashanova, Alla; Kukol, Andreas; Larkan, Nicholas J; Borhan, M Hossein; Fitt, Bruce D L

2018-01-01

Genes coding for nucleotide-binding leucine-rich repeat (LRR) receptors (NLRs) control resistance against intracellular (cell-penetrating) pathogens. However, evidence for a role of genes coding for proteins with LRR domains in resistance against extracellular (apoplastic) fungal pathogens is limited. Here, the distribution of genes coding for proteins with eLRR domains but lacking kinase domains was determined for the Brassica napus genome. Predictions of signal peptide and transmembrane regions divided these genes into 184 coding for receptor-like proteins (RLPs) and 121 coding for secreted proteins (SPs). Together with previously annotated NLRs, a total of 720 LRR genes were found. Leptosphaeria maculans-induced expression during a compatible interaction with cultivar Topas differed between RLP, SP and NLR gene families; NLR genes were induced relatively late, during the necrotrophic phase of pathogen colonization. Seven RLP, one SP and two NLR genes were found in Rlm1 and Rlm3/Rlm4/Rlm7/Rlm9 loci for resistance against L. maculans on chromosome A07 of B. napus. One NLR gene at the Rlm9 locus was positively selected, as was the RLP gene on chromosome A10 with LepR3 and Rlm2 alleles conferring resistance against L. maculans races with corresponding effectors AvrLm1 and AvrLm2, respectively. Known loci for resistance against L. maculans (extracellular hemi-biotrophic fungus), Sclerotinia sclerotiorum (necrotrophic fungus) and Plasmodiophora brassicae (intracellular, obligate biotrophic protist) were examined for presence of RLPs, SPs and NLRs in these regions. Whereas loci for resistance against P. brassicae were enriched for NLRs, no such signature was observed for the other pathogens. These findings demonstrate involvement of (i) NLR genes in resistance against the intracellular pathogen P. brassicae and a putative NLR gene in Rlm9-mediated resistance against the extracellular pathogen L. maculans.
Isolation and sequencing of the gene encoding Sp23, a structural protein of spermatophore of the mealworm beetle, Tenebrio molitor.

PubMed

Feng, X; Happ, G M

1996-11-14

The cDNA for Sp23, a structural protein of the spermatophore of Tenebrio molitor, had been previously cloned and characterized (Paesen, G.C., Schwartz, M.B., Peferoen, M., Weyda, F. and Happ, G.M. (1992a) Amino acid sequence of Sp23, a structure protein of the spermatophore of the mealworm beetle, Tenebrio molitor. J. Biol. Chem. 257, 18852-18857). Using the labeled cDNA for Sp23 as a probe to screen a library of genomic DNA from Tenebrio molitor, we isolated a genomic clone for Sp23. A 5373-base pair (bp) restriction fragment containing the Sp23 gene was sequenced. The coding region is separated by a 55-bp intron which is located close to the translation start site. Three putative ecdysone response elements (EcRE) are identified in the 5' flanking region of the Sp23 gene. Comparison of the flanking regions of the Sp23 gene with those of the D-protein gene expressed in the accessory glands of Tenebrio reveals similar sequences present in the flanking regions of the two genes. The genomic organization of the coding region of the Sp23 gene shares similarities with that of the D-protein gene, three Drosophila accessory gland genes and two Drosophila 20-OH ecdysone-responsive genes.
A Deeper Examination of Thorellius atrox Scorpion Venom Components with Omic Techonologies.

PubMed

Romero-Gutierrez, Teresa; Peguero-Sanchez, Esteban; Cevallos, Miguel A; Batista, Cesar V F; Ortiz, Ernesto; Possani, Lourival D

2017-12-12

This communication reports a further examination of venom gland transcripts and venom composition of the Mexican scorpion Thorellius atrox using RNA-seq and tandem mass spectrometry. The RNA-seq, which was performed with the Illumina protocol, yielded more than 20,000 assembled transcripts. Following a database search and annotation strategy, 160 transcripts were identified, potentially coding for venom components. A novel sequence was identified that potentially codes for a peptide with similarity to spider ω-agatoxins, which act on voltage-gated calcium channels, not known before to exist in scorpion venoms. Analogous transcripts were found in other scorpion species. They could represent members of a new scorpion toxin family, here named omegascorpins. The mass fingerprint by LC-MS identified 135 individual venom components, five of which matched with the theoretical masses of putative peptides translated from the transcriptome. The LC-MS/MS de novo sequencing allowed to reconstruct and identify 42 proteins encoded by assembled transcripts, thus validating the transcriptome analysis. Earlier studies conducted with this scorpion venom permitted the identification of only twenty putative venom components. The present work performed with more powerful and modern omic technologies demonstrates the capacity of accomplishing a deeper characterization of scorpion venom components and the identification of novel molecules with potential applications in biomedicine and the study of ion channel physiology.
Adaptation, ecology, and evolution of the halophilic stromatolite archaeon Halococcus hamelinensis inferred through genome analyses.

PubMed

Gudhka, Reema K; Neilan, Brett A; Burns, Brendan P

2015-01-01

Halococcus hamelinensis was the first archaeon isolated from stromatolites. These geomicrobial ecosystems are thought to be some of the earliest known on Earth, yet, despite their evolutionary significance, the role of Archaea in these systems is still not well understood. Detailed here is the genome sequencing and analysis of an archaeon isolated from stromatolites. The genome of H. hamelinensis consisted of 3,133,046 base pairs with an average G+C content of 60.08% and contained 3,150 predicted coding sequences or ORFs, 2,196 (68.67%) of which were protein-coding genes with functional assignments and 954 (29.83%) of which were of unknown function. Codon usage of the H. hamelinensis genome was consistent with a highly acidic proteome, a major adaptive mechanism towards high salinity. Amino acid transport and metabolism, inorganic ion transport and metabolism, energy production and conversion, ribosomal structure, and unknown function COG genes were overrepresented. The genome of H. hamelinensis also revealed characteristics reflecting its survival in its extreme environment, including putative genes/pathways involved in osmoprotection, oxidative stress response, and UV damage repair. Finally, genome analyses indicated the presence of putative transposases as well as positive matches of genes of H. hamelinensis against various genomes of Bacteria, Archaea, and viruses, suggesting the potential for horizontal gene transfer.
The putative multidrug resistance protein MRP-7 inhibits methylmercury-associated animal toxicity and dopaminergic neurodegeneration in Caenorhabditis elegans

PubMed Central

VanDuyn, Natalia; Nass, Richard

2013-01-01

Parkinson’s disease (PD) is the most prevalent neurodegenerative motor disorder worldwide, and results in the progressive loss of dopamine (DA) neurons in the substantia nigra pars compacta. Gene-environment interactions are believed to play a significant role in the vast majority of PD cases, yet the toxicants and the associated genes involved in the neuropathology are largely ill-defined. Recent epidemiological and biochemical evidence suggests that methylmercury (MeHg) may be an environmental toxicant that contributes to the development of PD. Here we report that a gene coding for the putative multidrug resistance protein MRP-7 in Caenorhabditis elegans (C. elegans) modulates whole animal and DA neuron sensitivity to MeHg. In this study we demonstrate that genetic knockdown of MRP-7 results in a 2-fold increase in Hg levels and a dramatic increase in stress response proteins associated with the endoplasmic reticulum, golgi apparatus, and mitochondria, as well as an increase in MeHg-associated animal death. Chronic exposure to low concentrations of MeHg induces MRP-7 gene expression, while exposures in MRP-7 genetic knockdown animals results in a loss of DA neuron integrity without affecting whole animal viability. Furthermore, transgenic animals expressing a fluorescent reporter behind the endogenous MRP-7 promoter indicate that the transporter is expressed in DA neurons. These studies show for the first time that a multidrug resistance protein is expressed in DA neurons, and its expression inhibits MeHg-associated DA neuron pathology. PMID:24266639
The putative multidrug resistance protein MRP-7 inhibits methylmercury-associated animal toxicity and dopaminergic neurodegeneration in Caenorhabditis elegans.

PubMed

VanDuyn, Natalia; Nass, Richard

2014-03-01

Parkinson's disease (PD) is the most prevalent neurodegenerative motor disorder worldwide, and results in the progressive loss of dopamine (DA) neurons in the substantia nigra pars compacta. Gene-environment interactions are believed to play a significant role in the vast majority of PD cases, yet the toxicants and the associated genes involved in the neuropathology are largely ill-defined. Recent epidemiological and biochemical evidence suggests that methylmercury (MeHg) may be an environmental toxicant that contributes to the development of PD. Here, we report that a gene coding for the putative multidrug resistance protein MRP-7 in Caenorhabditis elegans modulates whole animal and DA neuron sensitivity to MeHg. In this study, we demonstrate that genetic knockdown of MRP-7 results in a twofold increase in Hg levels and a dramatic increase in stress response proteins associated with the endoplasmic reticulum, golgi apparatus, and mitochondria, as well as an increase in MeHg-associated animal death. Chronic exposure to low concentrations of MeHg induces MRP-7 gene expression, while exposures in MRP-7 genetic knockdown animals results in a loss of DA neuron integrity without affecting whole animal viability. Furthermore, transgenic animals expressing a fluorescent reporter behind the endogenous MRP-7 promoter indicate that the transporter is expressed in DA neurons. These studies show for the first time that a multidrug resistance protein is expressed in DA neurons, and its expression inhibits MeHg-associated DA neuron pathology. © 2013 International Society for Neurochemistry.
Prevalence of transcription promoters within archaeal operons and coding sequences.

PubMed

Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S

2009-01-01

Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of approximately 64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein-DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3' ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes-events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements.
[Preliminary proteomics analysis of the total proteins of HL Type cytoplasmic male sterility rice anther].

PubMed

Wen, Li; Liu, Gai; Zhang, Zai-Jun; Tao, Jun; Wan, Cui-Xiang; Zhu, Ying-Guo

2006-03-01

The proteins of HL type cytoplasmic male sterility rice anther of YTA (CMS) and YTB (maintenance line) were separated by two-dimensional electrophoresis with immobilized ph (3-10 non-linear) gradients as the first dimension and SDS-PAGE as the second. The silver-stained proteins spots were analyzed using Image Master 2D software, there were about 1800 detectable spots on each 2D-gel, and about 85 spots were differential expressed. With direct MALDI-TOF mass spectrometry analysis and protein database searching, 9 protein spots out of 16 were identified. Among those proteins, there were Putative nucleic acid binding protein, glucose-1-phosphate adenylyltransferase (ADP-glucose pyrophosphorylase, AGPase) (EC: 2.7.7.27) large chain, UDP-glucuronic acid decarboxylase, putative calcium-binding protein annexin, putative acetyl-CoA synthetase and putative lipoamide dehydrogenase etc. They were closely associated with metabolism, protein biosynthesis, transcription, signal transduction and so on, all of which are cell activities that are essential to pollen development. Some of the identified proteins, i.e. AGPase, putative lipoamide dehydrogenase and putative acetyl-CoA synthetase were deeply discussed on the relationship to CMS. AGPase catalyzes a very important step in the biosynthesis of alpha 1,4-glucans (glycogen or starch) in bacteria and plants: synthesis of the activated glucosyl donor, ADP-glucose, from glucose-1-phosphate and ATP. The lack of the AGPase in male sterile line might directly result in the reduction of starch, and the synthesis of starch was the most important processes during the development of pollen. In present research, the descent or reduction of putative lipoamide dehydrogenase and putative acetyl-CoA synthetase seemed involved in pollen sterility in rice. The degeneration and formation of various tissues during pollen development may impose high demands for energy and key biosynthetic intermediates. Under such conditions, the TCA cycle needs to operate fully, because the TCA cycle is an important source for many intermediates required for biosynthetic pathways, in addition to performing an oxidative, energy-producing role. Thus, it seemed reasonable to infer that the decrease of putative lipoamide dehydrogenase and putative acetyl-CoA synthetase in anther might prevent the conversion of pyruvate into acetyl-CoA, and as a result, the TCA cycle could no longer operate at a sufficient rate to meet all requirements in anther cells, leading to pollen sterility. This study gave new insights into the mechanism of CMS in rice and demonstrated the power of the proteomic approach in plant biology studies.
The putative protein methyltransferase LAE1 controls cellulase gene expression in Trichoderma reesei

PubMed Central

Seiboth, Bernhard; Karimi, Razieh Aghcheh; Phatale, Pallavi A; Linke, Rita; Hartl, Lukas; Sauer, Dominik G; Smith, Kristina M; Baker, Scott E; Freitag, Michael; Kubicek, Christian P

2012-01-01

Summary Trichoderma reesei is an industrial producer of enzymes that degrade lignocellulosic polysaccharides to soluble monomers, which can be fermented to biofuels. Here we show that the expression of genes for lignocellulose degradation are controlled by the orthologous T. reesei protein methyltransferase LAE1. In a lae1 deletion mutant we observed a complete loss of expression of all seven cellulases, auxiliary factors for cellulose degradation, β-glucosidases and xylanases were no longer expressed. Conversely, enhanced expression of lae1 resulted in significantly increased cellulase gene transcription. Lae1-modulated cellulase gene expression was dependent on the function of the general cellulase regulator XYR1, but also xyr1 expression was LAE1-dependent. LAE1 was also essential for conidiation of T. reesei. Chromatin immunoprecipitation followed by high-throughput sequencing (‘ChIP-seq’) showed that lae1 expression was not obviously correlated with H3K4 di- or trimethylation (indicative of active transcription) or H3K9 trimethylation (typical for heterochromatin regions) in CAZyme coding regions, suggesting that LAE1 does not affect CAZyme gene expression by directly modulating H3K4 or H3K9 methylation. Our data demonstrate that the putative protein methyltransferase LAE1 is essential for cellulase gene expression in T. reesei through mechanisms that remain to be identified. PMID:22554051
Base Flipping in V(D)J Recombination: Insights into the Mechanism of Hairpin Formation, the 12/23 Rule, and the Coordination of Double-Strand Breaks▿ †

PubMed Central

Bischerour, Julien; Lu, Catherine; Roth, David B.; Chalmers, Ronald

2009-01-01

Tn5 transposase cleaves the transposon end using a hairpin intermediate on the transposon end. This involves a flipped base that is stacked against a tryptophan residue in the protein. However, many other members of the cut-and-paste transposase family, including the RAG1 protein, produce a hairpin on the flanking DNA. We have investigated the reversed polarity of the reaction for RAG recombination. Although the RAG proteins appear to employ a base-flipping mechanism using aromatic residues, the putatively flipped base is not at the expected location and does not appear to stack against any of the said aromatic residues. We propose an alternative model in which a flipped base is accommodated in a nonspecific pocket or cleft within the recombinase. This is consistent with the location of the flipped base at position −1 in the coding flank, which can be occupied by purine or pyrimidine bases that would be difficult to stabilize using a single, highly specific, interaction. Finally, during this work we noticed that the putative base-flipping events on either side of the 12/23 recombination signal sequence paired complex are coupled to the nicking steps and serve to coordinate the double-strand breaks on either side of the complex. PMID:19720743
Identification of the Operon for the Sorbitol (Glucitol) Phosphoenolpyruvate:Sugar Phosphotransferase System in Streptococcus mutans

PubMed Central

Boyd, David A.; Thevenot, Tracy; Gumbmann, Markus; Honeyman, Allen L.; Hamilton, Ian R.

2000-01-01

Transposon mutagenesis and marker rescue were used to isolate and identify an 8.5-kb contiguous region containing six open reading frames constituting the operon for the sorbitol P-enolpyruvate phosphotransferase transport system (PTS) of Streptococcus mutans LT11. The first gene, srlD, codes for sorbitol-6-phosphate dehydrogenase, followed downstream by srlR, coding for a transcriptional regulator; srlM, coding for a putative activator; and the srlA, srlE, and srlB genes, coding for the EIIC, EIIBC, and EIIA components of the sorbitol PTS, respectively. Among all sorbitol PTS operons characterized to date, the srlD gene is found after the genes coding for the EII components; thus, the location of the gene in S. mutans is unique. The SrlR protein is similar to several transcriptional regulators found in Bacillus spp. that contain PTS regulator domains (J. Stülke, M. Arnaud, G. Rapoport, and I. Martin-Verstraete, Mol. Microbiol. 28:865–874, 1998), and its gene overlaps the srlM gene by 1 bp. The arrangement of these two regulatory genes is unique, having not been reported for other bacteria. PMID:10639465
Insights from the genome of a high alkaline cellulase producing Aspergillus fumigatus strain obtained from Peruvian Amazon rainforest.

PubMed

Paul, Sujay; Zhang, Angel; Ludeña, Yvette; Villena, Gretty K; Yu, Fengan; Sherman, David H; Gutiérrez-Correa, Marcel

2017-06-10

Here, we report the complete genome sequence of a high alkaline cellulase producing Aspergillus fumigatus strain LMB-35Aa isolated from soil of Peruvian Amazon rainforest. The genome is ∼27.5mb in size, comprises of 228 scaffolds with an average GC content of 50%, and is predicted to contain a total of 8660 protein-coding genes. Of which, 6156 are with known function; it codes for 607 putative CAZymes families potentially involved in carbohydrate metabolism. Several important cellulose degrading genes, such as endoglucanase A, endoglucanase B, endoglucanase D and beta-glucosidase, are also identified. The genome of A. fumigatus strain LMB-35Aa represents the first whole sequenced genome of non-clinical, high cellulase producing A. fumigatus strain isolated from forest soil. Copyright © 2017 Elsevier B.V. All rights reserved.
An open reading frame in intron seven of the sea urchin DNA-methyltransferase gene codes for a functional AP1 endonuclease.

PubMed

Cioffi, Anna Valentina; Ferrara, Diana; Cubellis, Maria Vittoria; Aniello, Francesco; Corrado, Marcella; Liguori, Francesca; Amoroso, Alessandro; Fucci, Laura; Branno, Margherita

2002-08-01

Analysis of the genome structure of the Paracentrotus lividus (sea urchin) DNA methyltransferase (DNA MTase) gene showed the presence of an open reading frame, named METEX, in intron 7 of the gene. METEX expression is developmentally regulated, showing no correlation with DNA MTase expression. In fact, DNA MTase transcripts are present at high concentrations in the early developmental stages, while METEX is expressed at late stages of development. Two METEX cDNA clones (Met1 and Met2) that are different in the 3' end have been isolated in a cDNA library screening. The putative translated protein from Met2 cDNA clone showed similarity with Escherichia coli endonuclease III on the basis of sequence and predictive three-dimensional structure. The protein, overexpressed in E. coli and purified, had functional properties similar to the endonuclease specific for apurinic/apyrimidinic (AP) sites on the basis of the lyase activity. Therefore the open reading frame, present in intron 7 of the P. lividus DNA MTase gene, codes for a functional AP endonuclease designated SuAP1.
The draft genome sequence of Mangrovibacter sp. strain MP23, an endophyte isolated from the roots of Phragmites karka.

PubMed

Behera, Pratiksha; Vaishampayan, Parag; Singh, Nitin K; Mishra, Samir R; Raina, Vishakha; Suar, Mrutyunjay; Pattnaik, Ajit K; Rastogi, Gurdeep

2016-09-01

Till date, only one draft genome has been reported within the genus Mangrovibacter. Here, we report the second draft genome shotgun sequence of a Mangrovibacter sp. strain MP23 that was isolated from the roots of Phargmites karka (P. karka), an invasive weed growing in the Chilika Lagoon, Odisha, India. Strain MP23 is a facultative anaerobic, nitrogen-fixing endophytic bacteria that grows optimally at 37 °C, 7.0 pH, and 1% NaCl concentration. The draft genome sequence of strain MP23 contains 4,947,475 bp with an estimated G + C content of 49.9% and total 4392 protein coding genes. The genome sequence has provided information on putative genes that code for proteins involved in oxidative stress, uptake of nutrients, and nitrogen fixation that might offer niche specific ecological fitness and explain the invasive success of P. karka in Chilika Lagoon. The draft genome sequence and annotation have been deposited at DDBJ/EMBL/GenBank under the accession number LYRP00000000.
Non-coding RNAs—Novel targets in neurotoxicity

PubMed Central

Tal, Tamara L.; Tanguay, Robert L.

2012-01-01

Over the past ten years non-coding RNAs (ncRNAs) have emerged as pivotal players in fundamental physiological and cellular processes and have been increasingly implicated in cancer, immune disorders, and cardiovascular, neurodegenerative, and metabolic diseases. MicroRNAs (miRNAs) represent a class of ncRNA molecules that function as negative regulators of post-transcriptional gene expression. miRNAs are predicted to regulate 60% of all human protein-coding genes and as such, play key roles in cellular and developmental processes, human health, and disease. Relative to counterparts that lack bindings sites for miRNAs, genes encoding proteins that are post-transcriptionally regulated by miRNAs are twice as likely to be sensitive to environmental chemical exposure. Not surprisingly, miRNAs have been recognized as targets or effectors of nervous system, developmental, hepatic, and carcinogenic toxicants, and have been identified as putative regulators of phase I xenobiotic-metabolizing enzymes. In this review, we give an overview of the types of ncRNAs and highlight their roles in neurodevelopment, neurological disease, activity-dependent signaling, and drug metabolism. We then delve into specific examples that illustrate their importance as mediators, effectors, or adaptive agents of neurotoxicants or neuroactive pharmaceutical compounds. Finally, we identify a number of outstanding questions regarding ncRNAs and neurotoxicity. PMID:22394481
Transcriptome and gene expression profile of ovarian follicle tissue of the triatomine bug Rhodnius prolixus

PubMed Central

Medeiros, Marcelo N.; Logullo, Raquel; Ramos, Isabela B.; Sorgine, Marcos H. F.; Paiva-Silva, Gabriela O.; Mesquita, Rafael D.; Machado, Ednildo Alcantara; Coutinho, Maria Alice; Masuda, Hatisaburo; Capurro, Margareth L.; Ribeiro, José M.C.; Cardoso Braz, Glória Regina; Oliveira, Pedro L

2013-01-01

Insect oocytes grow in close association with the ovarian follicular epithelium (OFE), which escorts the oocyte during oogenesis and is responsible for synthesis and secretion of the eggshell. We describe a transcriptome of OFE of the triatomine bug Rhodnius prolixus, a vector of Chagas disease, to increase our knowledge of the role of FE in egg development. Random clones were sequenced from a cDNA library of different stages of follicle development. The transcriptome showed high commitment to transcription, protein synthesis, and secretion. The most abundant cDNA was a secreted (S) small, proline-rich protein with maximal expression in the vitellogenic follicle, suggesting a role in oocyte maturation. We also found Rp45, a chorion protein already described, and a putative chitin-associated cuticle protein that was an eggshell component candidate. Six transcripts coding for proteins related to the unfolded protein response (UPR) by were chosen and their expression analyzed. Surprisingly, transcripts related to UPR showed higher expression during early stages of development and downregulation during late stages, when transcripts coding for S proteins participating in chorion formation were highly expressed. Several transcripts with potential roles in oogenesis and embryo development are also discussed. We propose that intense protein synthesis at the FE results in reticulum stress (RS) and that lowering expression of a set of genes related to cell survival should lead to degeneration of follicular cells at oocyte maturation. This paradoxical suppression of UPR suggests that ovarian follicles may represent an interesting model for studying control of RS and cell survival in professional S cell types. PMID:21736942
A Sabin 3-Derived Poliovirus Recombinant Contained a Sequence Homologous with Indigenous Human Enterovirus Species C in the Viral Polymerase Coding Region†

PubMed Central

Arita, Minetaro; Zhu, Shuang-Li; Yoshida, Hiromu; Yoneyama, Tetsuo; Miyamura, Tatsuo; Shimizu, Hiroyuki

2005-01-01

Outbreaks of poliomyelitis caused by circulating vaccine-derived polioviruses (cVDPVs) have been reported in areas where indigenous wild polioviruses (PVs) were eliminated by vaccination. Most of these cVDPVs contained unidentified sequences in the nonstructural protein coding region which were considered to be derived from human enterovirus species C (HEV-C) by recombination. In this study, we report isolation of a Sabin 3-derived PV recombinant (Cambodia-02) from an acute flaccid paralysis (AFP) case in Cambodia in 2002. We attempted to identify the putative recombination counterpart of Cambodia-02 by sequence analysis of nonpolio enterovirus isolates from AFP cases in Cambodia from 1999 to 2003. Based on the previously estimated evolution rates of PVs, the recombination event resulting in Cambodia-02 was estimated to have occurred within 6 months after the administration of oral PV vaccine (99.3% nucleotide identity in VP1 region). The 2BC and the 3Dpol coding regions of Cambodia-02 were grouped into the genetic cluster of indigenous coxsackie A virus type 17 (CAV17) (the highest [87.1%] nucleotide identity) and the cluster of indigenous CAV13-CAV18 (the highest [94.9%] nucleotide identity) by the phylogenic analysis of the HEV-C isolates in 2002, respectively. CAV13-CAV18 and CAV17 were the dominant HEV-C serotypes in 2002 but not in 2001 and in 2003. We found a putative recombination between CAV13-CAV18 and CAV17 in the 3CDpro coding region of a CAV17 isolate. These results suggested that a part of the 3Dpol coding region of PV3(Cambodia-02) was derived from a HEV-C strain genetically related to indigenous CAV13-CAV18 strains in 2002 in Cambodia. PMID:16188967
Complete genome sequence of lymphocystis disease virus isolated from China.

PubMed

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-07-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.
Complete Genome Sequence of Lymphocystis Disease Virus Isolated from China

PubMed Central

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-01-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1. PMID:15194775
The genome of obligately intracellular Ehrlichia canis revealsthemes of complex membrane structure and immune evasion strategies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mavromatis, K.; Kuyler Doyle, C.; Lykidis, A.

2005-09-01

Ehrlichia canis, a small obligately intracellular, tick-transmitted, gram-negative, a-proteobacterium is the primary etiologic agent of globally distributed canine monocytic ehrlichiosis. Complete genome sequencing revealed that the E. canis genome consists of a single circular chromosome of 1,315,030 bp predicted to encode 925 proteins, 40 stable RNA species, and 17 putative pseudogenes, and a substantial proportion of non-coding sequence (27 percent). Interesting genome features include a large set of proteins with transmembrane helices and/or signal sequences, and a unique serine-threonine bias associated with the potential for O-glycosylation that was prominent in proteins associated with pathogen-host interactions. Furthermore, two paralogous protein familiesmore » associated with immune evasion were identified, one of which contains poly G:C tracts, suggesting that they may play a role in phase variation and facilitation of persistent infections. Proteins associated with pathogen-host interactions were identified including a small group of proteins (12) with tandem repeats and another with eukaryotic-like ankyrin domains (7).« less

Whole-Genome Survey of the Putative ATP-Binding Cassette Transporter Family Genes in Vitis vinifera

PubMed Central

Çakır, Birsen; Kılıçkaya, Ozan

2013-01-01

The ATP-binding cassette (ABC) protein superfamily constitutes one of the largest protein families known in plants. In this report, we performed a complete inventory of ABC protein genes in Vitis vinifera, the whole genome of which has been sequenced. By comparison with ABC protein members of Arabidopsis thaliana, we identified 135 putative ABC proteins with 1 or 2 NBDs in V. vinifera. Of these, 120 encode intrinsic membrane proteins, and 15 encode proteins missing TMDs. V. vinifera ABC proteins can be divided into 13 subfamilies with 79 “full-size,” 41 “half-size,” and 15 “soluble” putative ABC proteins. The main feature of the Vitis ABC superfamily is the presence of 2 large subfamilies, ABCG (pleiotropic drug resistance and white-brown complex homolog) and ABCC (multidrug resistance-associated protein). We identified orthologs of V. vinifera putative ABC transporters in different species. This work represents the first complete inventory of ABC transporters in V. vinifera. The identification of Vitis ABC transporters and their comparative analysis with the Arabidopsis counterparts revealed a strong conservation between the 2 species. This inventory could help elucidate the biological and physiological functions of these transporters in V. vinifera. PMID:24244377
Infection of capilloviruses requires subgenomic RNAs whose transcription is controlled by promoter-like sequences conserved among flexiviruses.

PubMed

Komatsu, Ken; Hirata, Hisae; Fukagawa, Takako; Yamaji, Yasuyuki; Okano, Yukari; Ishikawa, Kazuya; Adachi, Tatsushi; Maejima, Kensaku; Hashimoto, Masayoshi; Namba, Shigetou

2012-07-01

The first open-reading frame (ORF) of apple stem grooving virus (ASGV), of the genus Capillovirus, encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP). However, our previous study revealed that ASGV mutants with distinct and discontinuous Rep- and CP-coding regions successfully infect plants, indicating that CP expressed via a subgenomic RNA (sgRNA) is sufficient for viability of the virus. Here we identified a transcription start site of the CP sgRNA and revealed that CP translated from the sgRNA is essential for ASGV infection. We mapped the transcription start sites of both the CP and the movement protein (MP) sgRNAs of ASGV and found a hexanucleotide motif, UUAGGU, conserved upstream from both sgRNA transcription start sites. Mutational analysis of the putative CP initiation codon and of the UUAGGU sequence upstream from the transcription start site of CP sgRNA demonstrated their importance for ASGV accumulation. Our results also demonstrated that potato virus T (PVT), an unassigned species closely related to ASGV, produces two sgRNAs putatively deployed for the CP and MP expression and that the same hexanucleotide motif as found in ASGV is located upstream from the transcription start sites of both sgRNAs. This motif, which constituted putative core elements of the sgRNA promoter, is broadly conserved among viruses in the families Alphaflexiviridae and Betaflexiviridae, suggesting that the gene expression strategy of the viruses in both families has been conserved throughout evolution. Copyright © 2012 Elsevier B.V. All rights reserved.
Identification of an Na(+)-dependent transporter associated with saxitoxin-producing strains of the cyanobacterium Anabaena circinalis.

PubMed

Pomati, Francesco; Burns, Brendan P; Neilan, Brett A

2004-08-01

Blooms of the freshwater cyanobacterium Anabaena circinalis are recognized as an important health risk worldwide due to the production of a range of toxins such as saxitoxin (STX) and its derivatives. In this study we used HIP1 octameric-palindrome repeated-sequence PCR to compare the genomic structure of phylogenetically similar Australian isolates of A. circinalis. STX-producing and nontoxic cyanobacterial strains showed different HIP1 (highly iterated octameric palindrome 1) DNA patterns, and characteristic interrepeat amplicons for each group were identified. Suppression subtractive hybridization (SSH) was performed using HIP1 PCR-generated libraries to further identify toxic-strain-specific genes. An STX-producing strain and a nontoxic strain of A. circinalis were chosen as testers in two distinct experiments. The two categories of SSH putative tester-specific sequences were characterized by different families of encoded proteins that may be representative of the differences in metabolism between STX-producing and nontoxic A. circinalis strains. DNA-microarray hybridization and genomic screening revealed a toxic-strain-specific HIP1 fragment coding for a putative Na(+)-dependent transporter. Analysis of this gene demonstrated analogy to the mrpF gene of Bacillus subtilis, whose encoded protein is involved in Na(+)-specific pH homeostasis. The application of this gene as a molecular probe in laboratory and environmental screening for STX-producing A. circinalis strains was demonstrated. The possible role of this putative Na(+)-dependent transporter in the toxic cyanobacterial phenotype is also discussed, in light of recent physiological studies of STX-producing cyanobacteria.
Complete genome sequence of Enterobacter sp. IIT-BT 08: A potential microbial strain for high rate hydrogen production.

PubMed

Khanna, Namita; Ghosh, Ananta Kumar; Huntemann, Marcel; Deshpande, Shweta; Han, James; Chen, Amy; Kyrpides, Nikos; Mavrommatis, Kostas; Szeto, Ernest; Markowitz, Victor; Ivanova, Natalia; Pagani, Ioanna; Pati, Amrita; Pitluck, Sam; Nolan, Matt; Woyke, Tanja; Teshima, Hazuki; Chertkov, Olga; Daligault, Hajnalka; Davenport, Karen; Gu, Wei; Munk, Christine; Zhang, Xiaojing; Bruce, David; Detter, Chris; Xu, Yan; Quintana, Beverly; Reitenga, Krista; Kunde, Yulia; Green, Lance; Erkkila, Tracy; Han, Cliff; Brambilla, Evelyne-Marie; Lang, Elke; Klenk, Hans-Peter; Goodwin, Lynne; Chain, Patrick; Das, Debabrata

2013-12-20

Enterobacter sp. IIT-BT 08 belongs to Phylum: Proteobacteria, Class: Gammaproteobacteria, Order: Enterobacteriales, Family: Enterobacteriaceae. The organism was isolated from the leaves of a local plant near the Kharagpur railway station, Kharagpur, West Bengal, India. It has been extensively studied for fermentative hydrogen production because of its high hydrogen yield. For further enhancement of hydrogen production by strain development, complete genome sequence analysis was carried out. Sequence analysis revealed that the genome was linear, 4.67 Mbp long and had a GC content of 56.01%. The genome properties encode 4,393 protein-coding and 179 RNA genes. Additionally, a putative pathway of hydrogen production was suggested based on the presence of formate hydrogen lyase complex and other related genes identified in the genome. Thus, in the present study we describe the specific properties of the organism and the generation, annotation and analysis of its genome sequence as well as discuss the putative pathway of hydrogen production by this organism.
Extensive in silico analysis of Mimivirus coded Rab GTPase homolog suggests a possible role in virion membrane biogenesis.

PubMed

Zade, Amrutraj; Sengupta, Malavi; Kondabagil, Kiran

2015-01-01

Rab GTPases are the key regulators of intracellular membrane trafficking in eukaryotes. Many viruses and intracellular bacterial pathogens have evolved to hijack the host Rab GTPase functions, mainly through activators and effector proteins, for their benefit. Acanthamoeba polyphaga mimivirus (APMV) is one of the largest viruses and belongs to the monophyletic clade of nucleo-cytoplasmic large DNA viruses (NCLDV). The inner membrane lining is integral to the APMV virion structure. APMV assembly involves extensive host membrane modifications, like vesicle budding and fusion, leading to the formation of a membrane sheet that is incorporated into the virion. Intriguingly, APMV and all group I members of the Mimiviridae family code for a putative Rab GTPase protein. APMV is the first reported virus to code for a Rab GTPase (encoded by R214 gene). Our thorough in silico analysis of the subfamily specific (SF) region of Mimiviridae Rab GTPase sequences suggests that they are related to Rab5, a member of the group II Rab GTPases, of lower eukaryotes. Because of their high divergence from the existing three isoforms, A, B, and C of the Rab5-family, we suggest that Mimiviridae Rabs constitute a new isoform, Rab5D. Phylogenetic analysis indicated probable horizontal acquisition from a lower eukaryotic ancestor followed by selection and divergence. Furthermore, interaction network analysis suggests that vps34 (a Class III PI3K homolog, coded by APMV L615), Atg-8 and dynamin (host proteins) are recruited by APMV Rab GTPase during capsid assembly. Based on these observations, we hypothesize that APMV Rab plays a role in the acquisition of inner membrane during virion assembly.
Characterization of the Aspergillus nidulans aspnd1 gene demonstrates that the ASPND1 antigen, which it encodes, and several Aspergillus fumigatus immunodominant antigens belong to the same family.

PubMed Central

Calera, J A; Ovejero, M C; López-Medrano, R; Segurado, M; Puente, P; Leal, F

1997-01-01

For the first time, an immunodominant Aspergillus nidulans antigen (ASPND1) consistently reactive with serum samples from aspergilloma patients has been purified and characterized, and its coding gene (aspnd1) has been cloned and sequenced. ASPND1 is a glycoprotein with four N-glycosidically-bound sugar chains (around 2.1 kDa each) which are not necessary for reactivity with immune human sera. The polypeptide part is synthesized as a 277-amino-acid precursor of 30.6 kDa that after cleavage of a putative signal peptide of 16 amino acids, affords a mature protein of 261 amino acids with a molecular mass of 29 kDa and a pI of 4.24 (as deduced from the sequence). The ASPND1 protein is 53.1% identical to the AspfII allergen from Aspergillus fumigatus and 48% identical to an unpublished Candida albicans antigen. All of the cysteine residues and most of the glycosylation sites are perfectly conserved in the three proteins, suggesting a similar but yet unknown function. Analysis of the primary structure of the ASPND1 coding gene (aspnd1) has allowed the establishment of a clear relationship between several previously reported A. fumigatus and A. nidulans immunodominant antigens. PMID:9119471
Redox changes accompanying storage protein mobilization in moist chilled and warm incubated walnut kernels prior to germination.

PubMed

Shahmoradi, Zeynab; Tamaskani, Fatemeh; Sadeghipour, Hamid Reza; Abdolzadeh, Ahmad

2013-01-01

Alterations in the redox state of storage proteins and the associated proteolytic processes were investigated in moist-chilled and warm-incubated walnut (Juglans regia L.) kernels prior to germination. The kernel total protein labeling with a thiol-specific fluorochrome i.e. monobromobimane (mBBr) revealed more reduction of 29-32 kDa putative glutelins, while in the soluble proteins, both putative glutelins and 41, 55 and 58 kDa globulins contained reduced disulfide bonds during mobilization. Thus, the in vivo more reduced disulfide bonds of storage proteins corresponds to greater solubility. After the in vitro reduction of walnut kernel proteins pre-treated by N-ethyl maleimide (NEM) with dithioerythrethiol (DTT) and bacterial thioredoxin, the 58 kDa putative globulin and a 6 kDa putative albumin were identified as disulfide proteins. Thioredoxin stimulated the reduction of the H(2)O(2)-oxidized 6 kDa polypeptide, but not the 58 kDa polypeptide by DTT. The solubility of 6 kDa putative albumin, 58 and 19-24 kDa putative globulins and glutelins, respectively, were increased by DTT. The in vitro specific mobilization of the 58 kDa polypeptide that occurred at pH 5.0 by the kernel endogenous protease was sensitive to the serine-protease inhibitor phenylmethylsulfonyl fluoride (PMSF) and stimulated by DTT. The specific degradation of the 58 kDa polypeptide might be achieved through thioredoxin-mediated activation of a serine protease and/or reductive unfolding of its 58 kDa polypeptide substrate. As redox changes in storage proteins occurred equally in both moist chilled and warm incubated walnut kernels, the regulatory functions of thioredoxins in promoting seed germination may be due to other germination related processes. Copyright © 2012 Elsevier GmbH. All rights reserved.
Comparison of the protein-coding genomes of three deep-sea, sulfur-oxidising bacteria: "Candidatus Ruthia magnifica", "Candidatus Vesicomyosocius okutanii" and Thiomicrospira crunogena.

PubMed

McGill, Susan E; Barker, Daniel

2017-07-20

" Candidatus Ruthia magnifica", "Candidatus Vesicomyosocius okutanii" and Thiomicrospira crunogena are all sulfur-oxidising bacteria found in deep-sea vent environments. Recent research suggests that the two symbiotic organisms, "Candidatus R. magnifica" and "Candidatus V. okutanii", may share common ancestry with the autonomously living species T. crunogena. We used comparative genomics to examine the genome-wide protein-coding content of all three species to explore their similarities. In particular, we used the OrthoMCL algorithm to sort proteins into groups of putative orthologs on the basis of sequence similarity. The OrthoMCL inflation parameter was tuned using biological criteria. Using the tuned value, OrthoMCL delimited 1070 protein groups. 63.5% of these groups contained one protein from each species. Two groups contained duplicate protein copies from all three species. 123 groups were unique to T. crunogena and ten groups included multiple copies of T. crunogena proteins but only single copies from the other species. "Candidatus R. magnifica" had one unique group, and had multiple copies in one group where the other species had a single copy. There were no groups unique to "Candidatus V. okutanii", and no groups in which there were multiple "Candidatus V. okutanii" proteins but only single proteins from the other species. Results align with previous suggestions that all three species share a common ancestor. However this is not definitive evidence to make taxonomic conclusions and the possibility of horizontal gene transfer was not investigated. Methodologically, the tuning of the OrthoMCL inflation parameter using biological criteria provides further methods to refine the OrthoMCL procedure.
Repression of YdaS Toxin Is Mediated by Transcriptional Repressor RacR in the Cryptic rac Prophage of Escherichia coli K-12.

PubMed

Krishnamurthi, Revathy; Ghosh, Swagatha; Khedkar, Supriya; Seshasayee, Aswin Sai Narain

2017-01-01

Horizontal gene transfer is a major driving force behind the genomic diversity seen in prokaryotes. The cryptic rac prophage in Escherichia coli K-12 carries the gene for a putative transcription factor RacR, whose deletion is lethal. We have shown that the essentiality of racR in E. coli K-12 is attributed to its role in transcriptionally repressing toxin gene(s) called ydaS and ydaT , which are adjacent to and coded divergently to racR . IMPORTANCE Transcription factors in the bacterium E. coli are rarely essential, and when they are essential, they are largely toxin-antitoxin systems. While studying transcription factors encoded in horizontally acquired regions in E. coli , we realized that the protein RacR, a putative transcription factor encoded by a gene on the rac prophage, is an essential protein. Here, using genetics, biochemistry, and bioinformatics, we show that its essentiality derives from its role as a transcriptional repressor of the ydaS and ydaT genes, whose products are toxic to the cell. Unlike type II toxin-antitoxin systems in which transcriptional regulation involves complexes of the toxin and antitoxin, repression by RacR is sufficient to keep ydaS transcriptionally silent.
Garvicin A, a Novel Class IId Bacteriocin from Lactococcus garvieae That Inhibits Septum Formation in L. garvieae Strains

PubMed Central

Cárdenas, Nivia; Martínez, Beatriz; Ruiz-Barba, José Luis; Fernández-Garayzábal, José F.; Rodríguez, Juan M.; Gibello, Alicia

2013-01-01

Lactococcus garvieae 21881, isolated in a human clinical case, produces a novel class IId bacteriocin, garvicin A (GarA), which is specifically active against other L. garvieae strains, including fish- and bovine-pathogenic isolates. Purification from active supernatants, sequence analyses, and plasmid-curing experiments identified pGL5, one of the five plasmids found in L. garvieae [M. Aguado-Urda et al., PLoS One 7(6):e40119, 2012], as the coding plasmid for the structural gene of GarA (lgnA), its putative immunity protein (lgnI), and the ABC transporter and its accessory protein (lgnC and lgnD). Interestingly, pGL5-cured strains were still resistant to GarA. Other putative bacteriocins encoded by the remaining plasmids were not detected during purification, pointing to GarA as the main inhibitor secreted by L. garvieae 21881. Mode-of-action studies revealed a potent bactericidal activity of GarA. Moreover, transmission microscopy showed that GarA seems to act by inhibiting septum formation in L. garvieae cells. This potent and species-specific inhibition by GarA holds promise for applications in the prevention or treatment of infections caused by pathogenic strains of L. garvieae in both veterinary and clinical settings. PMID:23666326
Regulation of the alpha-glucuronidase-encoding gene ( aguA) from Aspergillus niger.

PubMed

de Vries, R P; van de Vondervoort, P J I; Hendriks, L; van de Belt, M; Visser, J

2002-09-01

The alpha-glucuronidase gene aguA from Aspergillus niger was cloned and characterised. Analysis of the promoter region of aguA revealed the presence of four putative binding sites for the major carbon catabolite repressor protein CREA and one putative binding site for the transcriptional activator XLNR. In addition, a sequence motif was detected which differed only in the last nucleotide from the XLNR consensus site. A construct in which part of the aguA coding region was deleted still resulted in production of a stable mRNA upon transformation of A. niger. The putative XLNR binding sites and two of the putative CREA binding sites were mutated individually in this construct and the effects on expression were examined in A. niger transformants. Northern analysis of the transformants revealed that the consensus XLNR site is not actually functional in the aguA promoter, whereas the sequence that diverges from the consensus at a single position is functional. This indicates that XLNR is also able to bind to the sequence GGCTAG, and the XLNR binding site consensus should therefore be changed to GGCTAR. Both CREA sites are functional, indicating that CREA has a strong influence on aguA expression. A detailed expression analysis of aguA in four genetic backgrounds revealed a second regulatory system involved in activation of aguA gene expression. This system responds to the presence of glucuronic and galacturonic acids, and is not dependent on XLNR.
A New Set of ESTs from Chickpea (Cicer arietinum L.) Embryo Reveals Two Novel F-Box Genes, CarF-box_PP2 and CarF-box_LysM, with Potential Roles in Seed Development

PubMed Central

Gupta, Shefali; Garg, Vanika; Bhatia, Sabhyata

2015-01-01

Considering the economic importance of chickpea (C. arietinum L.) seeds, it is important to understand the mechanisms underlying seed development for which a cDNA library was constructed from 6 day old chickpea embryos. A total of 8,186 ESTs were obtained from which 4,048 high quality ESTs were assembled into 1,480 unigenes that majorly encoded genes involved in various metabolic and regulatory pathways. Of these, 95 ESTs were found to be involved in ubiquitination related protein degradation pathways and 12 ESTs coded specifically for putative F-box proteins. Differential transcript accumulation of these putative F-box genes was observed in chickpea tissues as evidenced by quantitative real-time PCR. Further, to explore the role of F-box proteins in chickpea seed development, two F-box genes were selected for molecular characterization. These were named as CarF-box_PP2 and CarF-box_LysM depending on their C-terminal domains, PP2 and LysM, respectively. Their highly conserved structures led us to predict their target substrates. Subcellular localization experiment revealed that CarF-box_PP2 was localized in the cytoplasm and CarF-box_LysM was localized in the nucleus. We demonstrated their physical interactions with SKP1 protein, which validated that they function as F-box proteins in the formation of SCF complexes. Sequence analysis of their promoter regions revealed certain seed specific cis-acting elements that may be regulating their preferential transcript accumulation in the seed. Overall, the study helped in expanding the EST database of chickpea, which was further used to identify two novel F-box genes having a potential role in seed development. PMID:25803812
Discrimination of Pathogenic vs. Nonpathogenic Francisella tularensis and Burkholderia pseudomallei Using Proteomics Mass Spectrometry

DTIC Science & Technology

2011-03-01

GroEL AhpC/TSA family protein hypothetical protein FTL0617 heat shock protein DnaK succinyl-CoA synthetase subunit beta hypothetical protein...lipoprotein chaperonin GroEL co-chaperonin GroES DNA-directed RNA polymerase subunit beta intracellular growth locus, subunit C 3.2 Differentiation...thailandensis E264 Unique Proteins Whole Cell Lysates OMPs putative lipoprotein glucan 1,4-a-glucosidase glycosy hydrolase family protein putative
A Deeper Examination of Thorellius atrox Scorpion Venom Components with Omic Techonologies

PubMed Central

Romero-Gutierrez, Teresa; Batista, Cesar V. F.

2017-01-01

This communication reports a further examination of venom gland transcripts and venom composition of the Mexican scorpion Thorellius atrox using RNA-seq and tandem mass spectrometry. The RNA-seq, which was performed with the Illumina protocol, yielded more than 20,000 assembled transcripts. Following a database search and annotation strategy, 160 transcripts were identified, potentially coding for venom components. A novel sequence was identified that potentially codes for a peptide with similarity to spider ω-agatoxins, which act on voltage-gated calcium channels, not known before to exist in scorpion venoms. Analogous transcripts were found in other scorpion species. They could represent members of a new scorpion toxin family, here named omegascorpins. The mass fingerprint by LC-MS identified 135 individual venom components, five of which matched with the theoretical masses of putative peptides translated from the transcriptome. The LC-MS/MS de novo sequencing allowed to reconstruct and identify 42 proteins encoded by assembled transcripts, thus validating the transcriptome analysis. Earlier studies conducted with this scorpion venom permitted the identification of only twenty putative venom components. The present work performed with more powerful and modern omic technologies demonstrates the capacity of accomplishing a deeper characterization of scorpion venom components and the identification of novel molecules with potential applications in biomedicine and the study of ion channel physiology. PMID:29231872
DOE Office of Scientific and Technical Information (OSTI.GOV)

Villard, L.; Lossi, A.M.; Fontes, M.

We have previously reported the isolation of a gene from Xq13 that codes for a putative regulator of transcription (XNP) and has now been shown to be the gene involved in the X-linked {alpha}-thalassemia with mental retardation (ATR-X) syndrome. The widespread expression and numerous domains present in the putative protein suggest that this gene could be involved in other phenotypes. The predominant expression of the gene in the developing brain, as well as its association with neuron differentiation, indicates that mutations of this gene might result in a mental retardation (MR) phenotype. In this paper we present a family withmore » a splice junction mutation in XNP that results in the skipping of an exon and in the introduction of a stop codon in the middle of the XNP-coding sequence. Only the abnormal transcript is expressed in two first cousins presenting the classic ATR-X phenotype (with {alpha}-thalassemia and HbH inclusions). In a distant cousin presenting a similar dysmorphic MR phenotype but not having thalassemia, {approximately}30% of the XNP transcripts are normal. These data demonstrate that the mode of action of the XNP gene product on globin expression is distinct from its mode of action in brain development and facial morphogenesis and suggest that other dysmorphic mental retardation phenotypes, such as Juberg-Marsidi or some sporadic cases of Coffin-Lowry, could be due to mutations in XNP. 20 refs., 5 figs., 2 tabs.« less
Complete plastid genome of Astragalus mongholicus var. nakaianus (Fabaceae).

PubMed

Choi, In-Su; Kim, Joo-Hwan; Choi, Byoung-Hee

2016-07-01

The first complete plastid genome (plastome) of the largest angiosperm genus, Astragalus, was sequenced for the Korean endangered endemic species A. mongholicus var. nakaianus. Its genome is relatively short (123,633 bp) because it lacks an Inverted Repeat (IR) region. It comprises 110 genes, including four unique rRNAs, 30 tRNAs, and 76 protein-coding genes. Similar to other closely related plastomes, rpl22 and rps16 are absent. The putative pseudogene with abnormal stop codons is atpE. This plastome has no additional inversions when compared with highly variable plastomes from IRLC tribes Fabeae and Trifolieae. Our phylogenetic analysis confirms the non-monophyly of Galegeae.
Localization of the putative precursor of Alzheimer's disease-specific amyloid at nuclear envelopes of adult human muscle.

PubMed Central

Zimmermann, K; Herget, T; Salbaum, J M; Schubert, W; Hilbich, C; Cramer, M; Masters, C L; Multhaup, G; Kang, J; Lemaire, H G

1988-01-01

Cloning and sequence analysis revealed the putative amyloid A4 precursor (pre-A4) of Alzheimer's disease to have characteristics of a membrane-spanning glycoprotein. In addition to brain, pre-A4 mRNA was found in adult human muscle and other tissues. We demonstrate by in situ hybridization that pre-A4 mRNA is present in adult human muscle, in cultured human myoblasts and myotubes. Immunofluorescence with antipeptide antibodies shows the putative pre-A4 protein to be expressed in adult human muscle and associated with some but not all nuclear envelopes. Despite high levels of a single 3.5-kb pre-A4 mRNA species in cultured myoblasts and myotubes, the presence of putative pre-A4 protein could not be detected by immunofluorescence. This suggests that putative pre-A4 protein is stabilized and therefore functioning in the innervated muscle tissue but not in developing, i.e. non-innervated cultured muscle cells. The selective localization of the protein on distinct nuclear envelopes could reflect an interaction with motor endplates. Images PMID:2896589
Identification of Putative Olfactory Genes from the Oriental Fruit Moth Grapholita molesta via an Antennal Transcriptome Analysis

PubMed Central

Li, Yiping; Wu, Junxiang

2015-01-01

Background The oriental fruit moth, Grapholita molesta, is an extremely important oligophagous pest species of stone and pome fruits throughout the world. As a host-switching species, adult moths, especially females, depend on olfactory cues to a large extent in locating host plants, finding mates, and selecting oviposition sites. The identification of olfactory genes can facilitate investigation on mechanisms for chemical communications. Methodology/Principal Finding We generated transcriptome of female antennae of G.molesta using the next-generation sequencing technique, and assembled transcripts from RNA-seq reads using Trinity, SOAPdenovo-trans and Abyss-trans assemblers. We identified 124 putative olfactory genes. Among the identified olfactory genes, 118 were novel to this species, including 28 transcripts encoding for odorant binding proteins, 17 chemosensory proteins, 48 odorant receptors, four gustatory receptors, 24 ionotropic receptors, two sensory neuron membrane proteins, and one odor degrading enzyme. The identified genes were further confirmed through semi-quantitative reverse transcription PCR for transcripts coding for 26 OBPs and 17 CSPs. OBP transcripts showed an obvious antenna bias, whereas CSP transcripts were detected in different tissues. Conclusion Antennal transcriptome data derived from the oriental fruit moth constituted an abundant molecular resource for the identification of genes potentially involved in the olfaction process of the species. This study provides a foundation for future research on the molecules involved in olfactory recognition of this insect pest, and in particular, the feasibility of using semiochemicals to control this pest. PMID:26540284
Sialotranscriptomics of Rhipicephalus zambeziensis reveals intricate expression profiles of secretory proteins and suggests tight temporal transcriptional regulation during blood-feeding.

PubMed

de Castro, Minique Hilda; de Klerk, Daniel; Pienaar, Ronel; Rees, D Jasper G; Mans, Ben J

2017-08-10

Ticks secrete a diverse mixture of secretory proteins into the host to evade its immune response and facilitate blood-feeding, making secretory proteins attractive targets for the production of recombinant anti-tick vaccines. The largely neglected tick species, Rhipicephalus zambeziensis, is an efficient vector of Theileria parva in southern Africa but its available sequence information is limited. Next generation sequencing has advanced sequence availability for ticks in recent years and has assisted the characterisation of secretory proteins. This study focused on the de novo assembly and annotation of the salivary gland transcriptome of R. zambeziensis and the temporal expression of secretory protein transcripts in female and male ticks, before the onset of feeding and during early and late feeding. The sialotranscriptome of R. zambeziensis yielded 23,631 transcripts from which 13,584 non-redundant proteins were predicted. Eighty-six percent of these contained a predicted start and stop codon and were estimated to be putatively full-length proteins. A fifth (2569) of the predicted proteins were annotated as putative secretory proteins and explained 52% of the expression in the transcriptome. Expression analyses revealed that 2832 transcripts were differentially expressed among feeding time points and 1209 between the tick sexes. The expression analyses further indicated that 57% of the annotated secretory protein transcripts were differentially expressed. Dynamic expression profiles of secretory protein transcripts were observed during feeding of female ticks. Whereby a number of transcripts were upregulated during early feeding, presumably for feeding site establishment and then during late feeding, 52% of these were downregulated, indicating that transcripts were required at specific feeding stages. This suggested that secretory proteins are under stringent transcriptional regulation that fine-tunes their expression in salivary glands during feeding. No open reading frames were predicted for 7947 transcripts. This class represented 17% of the differentially expressed transcripts, suggesting a potential transcriptional regulatory function of long non-coding RNA in tick blood-feeding. The assembled sialotranscriptome greatly expands the sequence availability of R. zambeziensis, assists in our understanding of the transcription of secretory proteins during blood-feeding and will be a valuable resource for future vaccine candidate selection.
Expression, regulation and functional assessment of the 80 amino acid Small Adipocyte Factor 1 (Smaf1) protein in adipocytes.

PubMed

Ren, Gang; Eskandari, Parisa; Wang, Siqian; Smas, Cynthia M

2016-01-15

The gene for Small Adipocyte Factor 1, Smaf1 (also known as adipogenin, ADIG), encodes a ∼600 base transcript that is highly upregulated during 3T3-L1 in vitro adipogenesis and markedly enriched in adipose tissues. Based on the lack of an obvious open reading frame in the Smaf1 transcript, it is not known if the Smaf1 gene is protein coding or non-coding RNA. Using a peptide from a putative open reading frame of Smaf1 as antigen, we generated antibodies for western analysis. Our studies prove that Smaf1 encodes an adipose-enriched protein which in western blot analysis migrates at ∼10 kDa. Rapid induction of Smaf1 protein occurs during in vitro adipogenesis and its expression in 3T3-L1 adipocytes is positively regulated by insulin and glucose. Moreover, siRNA studies reveal that expression of Smaf1 in adipocytes is wholly dependent on PPARγ. On the other hand, use of siRNA for Smaf1 to nearly abolish its protein expression in adipocytes revealed that Smaf1 does not have a major role in adipocyte triglyceride accumulation, lipolysis or insulin-stimulated pAkt induction. However, immunolocalization studies using HA-tagged Smaf1 reveal enrichment at adipocyte lipid droplets. Together our findings show that Smaf1 is a novel small protein endogenous to adipocytes and that Smaf1 expression is closely tied to PPARγ-mediated signals and the adipocyte phenotype. Copyright © 2015 Elsevier Inc. All rights reserved.

Phylogenetic and comparative gene expression analysis of barley (Hordeum vulgare)WRKY transcription factor family reveals putatively retained functions betweenmonocots and dicots

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mangelsen, Elke; Kilian, Joachim; Berendzen, Kenneth W.

2008-02-01

WRKY proteins belong to the WRKY-GCM1 superfamily of zinc finger transcription factors that have been subject to a large plant-specific diversification. For the cereal crop barley (Hordeum vulgare), three different WRKY proteins have been characterized so far, as regulators in sucrose signaling, in pathogen defense, and in response to cold and drought, respectively. However, their phylogenetic relationship remained unresolved. In this study, we used the available sequence information to identify a minimum number of 45 barley WRKY transcription factor (HvWRKY) genes. According to their structural features the HvWRKY factors were classified into the previously defined polyphyletic WRKY subgroups 1 tomore » 3. Furthermore, we could assign putative orthologs of the HvWRKY proteins in Arabidopsis and rice. While in most cases clades of orthologous proteins were formed within each group or subgroup, other clades were composed of paralogous proteins for the grasses and Arabidopsis only, which is indicative of specific gene radiation events. To gain insight into their putative functions, we examined expression profiles of WRKY genes from publicly available microarray data resources and found group specific expression patterns. While putative orthologs of the HvWRKY transcription factors have been inferred from phylogenetic sequence analysis, we performed a comparative expression analysis of WRKY genes in Arabidopsis and barley. Indeed, highly correlative expression profiles were found between some of the putative orthologs. HvWRKY genes have not only undergone radiation in monocot or dicot species, but exhibit evolutionary traits specific to grasses. HvWRKY proteins exhibited not only sequence similarities between orthologs with Arabidopsis, but also relatedness in their expression patterns. This correlative expression is indicative for a putative conserved function of related WRKY proteins in mono- and dicot species.« less
High quality draft genome sequence of Olivibacter sitiensis type strain (AW-6T), a diphenol degrader with genes involved in the catechol pathway

PubMed Central

Ntougias, Spyridon; Lapidus, Alla; Han, James; Mavromatis, Konstantinos; Pati, Amrita; Chen, Amy; Klenk, Hans-Peter; Woyke, Tanja; Fasseas, Constantinos; Kyrpides, Nikos C.; Zervakis, Georgios I.

2014-01-01

Olivibacter sitiensis Ntougias et al. 2007 is a member of the family Sphingobacteriaceae, phylum Bacteroidetes. Members of the genus Olivibacter are phylogenetically diverse and of significant interest. They occur in diverse habitats, such as rhizosphere and contaminated soils, viscous wastes, composts, biofilter clean-up facilities on contaminated sites and cave environments, and they are involved in the degradation of complex and toxic compounds. Here we describe the features of O. sitiensis AW-6T, together with the permanent-draft genome sequence and annotation. The organism was sequenced under the Genomic Encyclopedia for Bacteria and Archaea (GEBA) project at the DOE Joint Genome Institute and is the first genome sequence of a species within the genus Olivibacter. The genome is 5,053,571 bp long and is comprised of 110 scaffolds with an average GC content of 44.61%. Of the 4,565 genes predicted, 4,501 were protein-coding genes and 64 were RNA genes. Most protein-coding genes (68.52%) were assigned to a putative function. The identification of 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase-coding genes indicates involvement of this organism in the catechol catabolic pathway. In addition, genes encoding for β-1,4-xylanases and β-1,4-xylosidases reveal the xylanolytic action of O. sitiensis. PMID:25197463
Genome wide discovery of long intergenic non-coding RNAs in Diamondback moth (Plutella xylostella) and their expression in insecticide resistant strains

PubMed Central

Etebari, Kayvan; Furlong, Michael J.; Asgari, Sassan

2015-01-01

Long non-coding RNAs (lncRNAs) play important roles in genomic imprinting, cancer, differentiation and regulation of gene expression. Here, we identified 3844 long intergenic ncRNAs (lincRNA) in Plutella xylostella, which is a notorious pest of cruciferous plants that has developed field resistance to all classes of insecticides, including Bacillus thuringiensis (Bt) endotoxins. Further, we found that some of those lincRNAs may potentially serve as precursors for the production of small ncRNAs. We found 280 and 350 lincRNAs that are differentially expressed in Chlorpyrifos and Fipronil resistant larvae. A survey on P. xylostella midgut transcriptome data from Bt-resistant populations revealed 59 altered lincRNA in two resistant strains compared with the susceptible population. We validated the transcript levels of a number of putative lincRNAs in deltamethrin-resistant larvae that were exposed to deltamethrin, which indicated that this group of lincRNAs might be involved in the response to xenobiotics in this insect. To functionally characterize DBM lincRNAs, gene ontology (GO) enrichment of their associated protein-coding genes was extracted and showed over representation of protein, DNA and RNA binding GO terms. The data presented here will facilitate future studies to unravel the function of lincRNAs in insecticide resistance or the response to xenobiotics of eukaryotic cells. PMID:26411386
The Complete Mitochondrial Genome and Novel Gene Arrangement of the Unique-Headed Bug Stenopirates sp. (Hemiptera: Enicocephalidae)

PubMed Central

Li, Hu; Liu, Hui; Shi, Aimin; Štys, Pavel; Zhou, Xuguo; Cai, Wanzhi

2012-01-01

Many of true bugs are important insect pests to cultivated crops and some are important vectors of human diseases, but few cladistic analyses have addressed relationships among the seven infraorders of Heteroptera. The Enicocephalomorpha and Nepomorpha are consider the basal groups of Heteroptera, but the basal-most lineage remains unresolved. Here we report the mitochondrial genome of the unique-headed bug Stenopirates sp., the first mitochondrial genome sequenced from Enicocephalomorpha. The Stenopirates sp. mitochondrial genome is a typical circular DNA molecule of 15, 384 bp in length, and contains 37 genes and a large non-coding fragment. The gene order differs substantially from other known insect mitochondrial genomes, with rearrangements of both tRNA genes and protein-coding genes. The overall AT content (82.5%) of Stenopirates sp. is the highest among all the known heteropteran mitochondrial genomes. The strand bias is consistent with other true bugs with negative GC-skew and positive AT-skew for the J-strand. The heteropteran mitochondrial atp8 exhibits the highest evolutionary rate, whereas cox1 appears to have the lowest rate. Furthermore, a negative correlation was observed between the variation of nucleotide substitutions and the GC content of each protein-coding gene. A microsatellite was identified in the putative control region. Finally, phylogenetic reconstruction suggests that Enicocephalomorpha is the sister group to all the remaining Heteroptera. PMID:22235294
Horizontal gene acquisitions contributed to genome expansion in insect-symbiotic Spiroplasma clarkii.

PubMed

Tsai, Yi-Ming; Chang, An; Kuo, Chih-Horng

2018-06-01

Genome reduction is a recurring theme of symbiont evolution. The genus Spiroplasma contains species that are mostly facultative insect symbionts. The typical genome sizes of those species within the Apis clade were estimated to be ∼1.0-1.4 Mb. Intriguingly, Spiroplasma clarkii was found to have a genome size that is > 30% larger than the median of other species within the same clade. To investigate the molecular evolution events that led to the genome expansion of this bacterium, we determined its complete genome sequence and inferred the evolutionary origin of each protein-coding gene based on the phylogenetic distribution of homologs. Among the 1,346 annotated protein-coding genes, 641 were originated from within the Apis clade while 233 were putatively acquired from outside of the clade (including 91 high-confidence candidates). Additionally, 472 were specific to S. clarkii without homologs in the current database (i.e., the origins remained unknown). The acquisition of protein-coding genes, rather than mobile genetic elements, appeared to be a major contributing factor of genome expansion. Notably, >50% of the high-confidence acquired genes are related to carbohydrate transport and metabolism, suggesting that these acquired genes contributed to the expansion of both genome size and metabolic capability. The findings of this work provided an interesting case against the general evolutionary trend observed among symbiotic bacteria and further demonstrated the flexibility of Spiroplasma genomes. For future studies, investigation on the functional integration of these acquired genes, as well as the inference of their contribution to fitness could improve our knowledge of symbiont evolution.
Isolation and characterization of the promoter sequence of a cassava gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in storage roots.

PubMed

de Souza, C R; Aragão, F J; Moreira, E C O; Costa, C N M; Nascimento, S B; Carvalho, L J

2009-03-24

Cassava is one of the most important tropical food crops for more than 600 million people worldwide. Transgenic technologies can be useful for increasing its nutritional value and its resistance to viral diseases and insect pests. However, tissue-specific promoters that guarantee correct expression of transgenes would be necessary. We used inverse polymerase chain reaction to isolate a promoter sequence of the Mec1 gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in cassava storage roots. In silico analysis revealed putative cis-acting regulatory elements within this promoter sequence, including root-specific elements that may be required for its expression in vascular tissues. Transient expression experiments showed that the Mec1 promoter is functional, since this sequence was able to drive GUS expression in bean embryonic axes. Results from our computational analysis can serve as a guide for functional experiments to identify regions with tissue-specific Mec1 promoter activity. The DNA sequence that we identified is a new promoter that could be a candidate for genetic engineering of cassava roots.
Genome and Proteome Analysis of Rhodococcus erythropolis MI2: Elucidation of the 4,4´-Dithiodibutyric Acid Catabolism

PubMed Central

Khairy, Heba; Meinert, Christina; Wübbeler, Jan Hendrik; Poehlein, Anja; Daniel, Rolf; Voigt, Birgit; Riedel, Katharina; Steinbüchel, Alexander

2016-01-01

Rhodococcus erythropolis MI2 has the extraordinary ability to utilize the xenobiotic 4,4´-dithiodibutyric acid (DTDB). Cleavage of DTDB by the disulfide-reductase Nox, which is the only verified enzyme involved in DTDB-degradation, raised 4-mercaptobutyric acid (4MB). 4MB could act as building block of a novel polythioester with unknown properties. To completely unravel the catabolism of DTDB, the genome of R. erythropolis MI2 was sequenced, and subsequently the proteome was analyzed. The draft genome sequence consists of approximately 7.2 Mbp with an overall G+C content of 62.25% and 6,859 predicted protein-encoding genes. The genome of strain MI2 is composed of three replicons: one chromosome and two megaplasmids with sizes of 6.45, 0.4 and 0.35 Mbp, respectively. When cells of strain MI2 were cultivated with DTDB as sole carbon source and compared to cells grown with succinate, several interesting proteins with significantly higher expression levels were identified using 2D-PAGE and MALDI-TOF mass spectrometry. A putative luciferase-like monooxygenase-class F420-dependent oxidoreductase (RERY_05640), which is encoded by one of the 126 monooxygenase-encoding genes of the MI2-genome, showed a 3-fold increased expression level. This monooxygenase could oxidize the intermediate 4MB into 4-oxo-4-sulfanylbutyric acid. Next, a desulfurization step, which forms succinic acid and volatile hydrogen sulfide, is proposed. One gene coding for a putative desulfhydrase (RERY_06500) was identified in the genome of strain MI2. However, the gene product was not recognized in the proteome analyses. But, a significant expression level with a ratio of up to 7.3 was determined for a putative sulfide:quinone oxidoreductase (RERY_02710), which could also be involved in the abstraction of the sulfur group. As response to the toxicity of the intermediates, several stress response proteins were strongly expressed, including a superoxide dismutase (RERY_05600) and an osmotically induced protein (RERY_02670). Accordingly, novel insights in the catabolic pathway of DTDB were gained. PMID:27977722
Genome and Proteome Analysis of Rhodococcus erythropolis MI2: Elucidation of the 4,4´-Dithiodibutyric Acid Catabolism.

PubMed

Khairy, Heba; Meinert, Christina; Wübbeler, Jan Hendrik; Poehlein, Anja; Daniel, Rolf; Voigt, Birgit; Riedel, Katharina; Steinbüchel, Alexander

2016-01-01

Rhodococcus erythropolis MI2 has the extraordinary ability to utilize the xenobiotic 4,4´-dithiodibutyric acid (DTDB). Cleavage of DTDB by the disulfide-reductase Nox, which is the only verified enzyme involved in DTDB-degradation, raised 4-mercaptobutyric acid (4MB). 4MB could act as building block of a novel polythioester with unknown properties. To completely unravel the catabolism of DTDB, the genome of R. erythropolis MI2 was sequenced, and subsequently the proteome was analyzed. The draft genome sequence consists of approximately 7.2 Mbp with an overall G+C content of 62.25% and 6,859 predicted protein-encoding genes. The genome of strain MI2 is composed of three replicons: one chromosome and two megaplasmids with sizes of 6.45, 0.4 and 0.35 Mbp, respectively. When cells of strain MI2 were cultivated with DTDB as sole carbon source and compared to cells grown with succinate, several interesting proteins with significantly higher expression levels were identified using 2D-PAGE and MALDI-TOF mass spectrometry. A putative luciferase-like monooxygenase-class F420-dependent oxidoreductase (RERY_05640), which is encoded by one of the 126 monooxygenase-encoding genes of the MI2-genome, showed a 3-fold increased expression level. This monooxygenase could oxidize the intermediate 4MB into 4-oxo-4-sulfanylbutyric acid. Next, a desulfurization step, which forms succinic acid and volatile hydrogen sulfide, is proposed. One gene coding for a putative desulfhydrase (RERY_06500) was identified in the genome of strain MI2. However, the gene product was not recognized in the proteome analyses. But, a significant expression level with a ratio of up to 7.3 was determined for a putative sulfide:quinone oxidoreductase (RERY_02710), which could also be involved in the abstraction of the sulfur group. As response to the toxicity of the intermediates, several stress response proteins were strongly expressed, including a superoxide dismutase (RERY_05600) and an osmotically induced protein (RERY_02670). Accordingly, novel insights in the catabolic pathway of DTDB were gained.
Functional expression and characterization of recombinant NADPH-P450 reductase from Malassezia globosa.

PubMed

Lee, Hwayoun; Park, Hyoung-Goo; Lim, Young-Ran; Lee, Im-Soon; Kim, Beom Joon; Seong, Cheul-Hun; Chun, Young-Jin; Kim, Donghak

2012-01-01

Malassezia globosa is a common pathogenic fungus that causes skin diseases including dandruff and seborrheic dermatitis in humans. Analysis of its genome identified a gene (MGL_1677) coding for a putative NADPH-P450 reductase (NPR) to support the fungal cytochrome P450 enzymes. The heterologously expressed recombinant M. globosa NPR protein was purified, and its functional features were characterized. The purified protein generated a single band on SDS-PAGE at 80.74 kDa and had an absorption maximum at 452 nm, indicating its possible function as an oxidized flavin cofactor. It evidenced NADPH-dependent reducing activity for cytochrome c or nitroblue tetrazolium. Human P450 1A2 and 2A6 were able to successfully catalyze the O-deethylation of 7- ethoxyresorufin and the 7-hydroxylation of coumarin, respectively, with the support of the purified NPR. These results demonstrate that purified NPR is an orthologous reductase protein that supports cytochrome P450 enzymes in M. globosa.
A Case of Beta-propeller Protein-associated Neurodegeneration due to a Heterozygous Deletion of WDR45.

PubMed

Hermann, Andreas; Kitzler, Hagen H; Pollack, Tobias; Biskup, Saskia; Krüger, Stefanie; Funke, Claudia; Terrile, Caterina; Haack, Tobias B

2017-01-01

Static encephalopathy of childhood with neurodegeneration in adulthood is a phenotypically distinctive, X-linked dominant subtype of neurodegeneration with brain iron accumulation (NBIA). WDR45 mutations were recently identified as causal. WDR45 encodes a beta-propeller scaffold protein with a putative role in autophagy, and the disease has been renamed beta-propeller protein-associated neurodegeneration (BPAN). Here we describe a female patient suffering from a classical BPAN phenotype due to a novel heterozygous deletion of WDR45 . An initial gene panel and Sanger sequencing approach failed to uncover the molecular defect. Based on the typical clinical and neuroimaging phenotype, quantitative polymerase chain reaction of the WDR45 coding regions was undertaken, and this showed a reduction of the gene dosage by 50% compared with controls. An extended search for deletions should be performed in apparently WDR45- negative cases presenting with features of NBIA and should also be considered in young patients with predominant intellectual disabilities and hypertonia/parkinsonism/dystonia.
Comparative analyses of putative toxin gene homologs from an Old World viper, Daboia russelii

PubMed Central

Krishnan, Neeraja M.

2017-01-01

Availability of snake genome sequences has opened up exciting areas of research on comparative genomics and gene diversity. One of the challenges in studying snake genomes is the acquisition of biological material from live animals, especially from the venomous ones, making the process cumbersome and time-consuming. Here, we report comparative sequence analyses of putative toxin gene homologs from Russell’s viper (Daboia russelii) using whole-genome sequencing data obtained from shed skin. When compared with the major venom proteins in Russell’s viper studied previously, we found 45–100% sequence similarity between the venom proteins and their putative homologs in the skin. Additionally, comparative analyses of 20 putative toxin gene family homologs provided evidence of unique sequence motifs in nerve growth factor (NGF), platelet derived growth factor (PDGF), Kunitz/Bovine pancreatic trypsin inhibitor (Kunitz BPTI), cysteine-rich secretory proteins, antigen 5, andpathogenesis-related1 proteins (CAP) and cysteine-rich secretory protein (CRISP). In those derived proteins, we identified V11 and T35 in the NGF domain; F23 and A29 in the PDGF domain; N69, K2 and A5 in the CAP domain; and Q17 in the CRISP domain to be responsible for differences in the largest pockets across the protein domain structures in crotalines, viperines and elapids from the in silico structure-based analysis. Similarly, residues F10, Y11 and E20 appear to play an important role in the protein structures across the kunitz protein domain of viperids and elapids. Our study highlights the usefulness of shed skin in obtaining good quality high-molecular weight DNA for comparative genomic studies, and provides evidence towards the unique features and evolution of putative venom gene homologs in vipers. PMID:29230357
Bioinformatic Analysis Reveals Archaeal tRNATyr and tRNATrp Identities in Bacteria

PubMed Central

Mukai, Takahito; Reynolds, Noah M.; Crnković, Ana; Söll, Dieter

2017-01-01

The tRNA identity elements for some amino acids are distinct between the bacterial and archaeal domains. Searching in recent genomic and metagenomic sequence data, we found some candidate phyla radiation (CPR) bacteria with archaeal tRNA identity for Tyr-tRNA and Trp-tRNA synthesis. These bacteria possess genes for tyrosyl-tRNA synthetase (TyrRS) and tryptophanyl-tRNA synthetase (TrpRS) predicted to be derived from DPANN superphylum archaea, while the cognate tRNATyr and tRNATrp genes reveal bacterial or archaeal origins. We identified a trace of domain fusion and swapping in the archaeal-type TyrRS gene of a bacterial lineage, suggesting that CPR bacteria may have used this mechanism to create diverse proteins. Archaeal-type TrpRS of bacteria and a few TrpRS species of DPANN archaea represent a new phylogenetic clade (named TrpRS-A). The TrpRS-A open reading frames (ORFs) are always associated with another ORF (named ORF1) encoding an unknown protein without global sequence identity to any known protein. However, our protein structure prediction identified a putative HIGH-motif and KMSKS-motif as well as many α-helices that are characteristic of class I aminoacyl-tRNA synthetase (aaRS) homologs. These results provide another example of the diversity of molecular components that implement the genetic code and provide a clue to the early evolution of life and the genetic code. PMID:28230768
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production.

PubMed

Roth, Melissa S; Cokus, Shawn J; Gallaher, Sean D; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A; Merchant, Sabeeha S; Pellegrini, Matteo; Niyogi, Krishna K

2017-05-23

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis , because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase ( BKT ), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

DOE PAGES

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; ...

2017-05-08

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production

PubMed Central

Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A.; Merchant, Sabeeha S.; Pellegrini, Matteo

2017-01-01

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production. PMID:28484037
Pseudo-polyprotein translated from the full-length ORF1 of capillovirus is important for pathogenicity, but a truncated ORF1 protein without variable and CP regions is sufficient for replication.

PubMed

Hirata, Hisae; Yamaji, Yasuyuki; Komatsu, Ken; Kagiwada, Satoshi; Oshima, Kenro; Okano, Yukari; Takahashi, Shuichiro; Ugaki, Masashi; Namba, Shigetou

2010-09-01

The first open-reading frame (ORF) of the genus Capillovirus encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP), while other viruses in the family Flexiviridae have separate ORFs encoding these proteins. To investigate the role of the full-length ORF1 polyprotein of capillovirus, we generated truncation mutants of ORF1 of apple stem grooving virus by inserting a termination codon into the variable region located between the putative Rep- and CP-coding regions. These mutants were capable of systemic infection, although their pathogenicity was attenuated. In vitro translation of ORF1 produced both the full-length polyprotein and the smaller Rep protein. The results of in vivo reporter assays suggested that the mechanism of this early termination is a ribosomal -1 frame-shift occurring downstream from the conserved Rep domains. The mechanism of capillovirus gene expression and the very close evolutionary relationship between the genera Capillovirus and Trichovirus are discussed. Copyright (c) 2010. Published by Elsevier B.V.
'Candidatus Phytoplasma phoenicium' associated with almond witches'-broom disease: from draft genome to genetic diversity among strain populations.

PubMed

Quaglino, Fabio; Kube, Michael; Jawhari, Maan; Abou-Jawdah, Yusuf; Siewert, Christin; Choueiri, Elia; Sobh, Hana; Casati, Paola; Tedeschi, Rosemarie; Lova, Marina Molino; Alma, Alberto; Bianco, Piero Attilio

2015-07-30

Almond witches'-broom (AlmWB), a devastating disease of almond, peach and nectarine in Lebanon, is associated with 'Candidatus Phytoplasma phoenicium'. In the present study, we generated a draft genome sequence of 'Ca. P. phoenicium' strain SA213, representative of phytoplasma strain populations from different host plants, and determined the genetic diversity among phytoplasma strain populations by phylogenetic analyses of 16S rRNA, groEL, tufB and inmp gene sequences. Sequence-based typing and phylogenetic analysis of the gene inmp, coding an integral membrane protein, distinguished AlmWB-associated phytoplasma strains originating from diverse host plants, whereas their 16S rRNA, tufB and groEL genes shared 100 % sequence identity. Moreover, dN/dS analysis indicated positive selection acting on inmp gene. Additionally, the analysis of 'Ca. P. phoenicium' draft genome revealed the presence of integral membrane proteins and effector-like proteins and potential candidates for interaction with hosts. One of the integral membrane proteins was predicted as BI-1, an inhibitor of apoptosis-promoting Bax factor. Bioinformatics analyses revealed the presence of putative BI-1 in draft and complete genomes of other 'Ca. Phytoplasma' species. The genetic diversity within 'Ca. P. phoenicium' strain populations in Lebanon suggested that AlmWB disease could be associated with phytoplasma strains derived from the adaptation of an original strain to diverse hosts. Moreover, the identification of a putative inhibitor of apoptosis-promoting Bax factor (BI-1) in 'Ca. P. phoenicium' draft genome and within genomes of other 'Ca. Phytoplasma' species suggested its potential role as a phytoplasma fitness-increasing factor by modification of the host-defense response.
Blunt Snout Bream (Megalobrama amblycephala) MyD88 and TRAF6: Characterisation, Comparative Homology Modelling and Expression

PubMed Central

Tran, Ngoc Tuan; Liu, Han; Jakovlić, Ivan; Wang, Wei-Min

2015-01-01

MyD88 and TRAF6 play an essential role in the innate immune response in most animals. This study reports the full-length MaMyD88 and MaTRAF6 genes identified from the blunt snout bream (Megalobrama amblycephala) transcriptome profile. MaMyD88 is 2501 base pairs (bp) long, encoding a putative protein of 284 amino acids (aa), including the N-terminal DEATH domain of 78 aa and the C-terminal TIR domain of 138 aa. MaTRAF6 is 2252 bp long, encoding a putative protein of 542 aa, including the N-terminal low-complexity region, RING domain (40 aa), a coiled-coil region (64 aa) and C-terminal MATH domain (147 aa). Coding regions of MaMyD88 and MaTRAF6 genomic sequences consisted of five and six exons, respectively. Physicochemical and functional characteristics of the proteins were analysed. Alpha helices were dominant in the secondary structure of the proteins. Homology models of the MaMyD88 and MaTRAF6 domains were constructed applying the comparative modelling method. RT-qPCR was used to analyse the expression of MaMyD88 and MaTRAF6 mRNA transcripts in response to Aeromonas hydrophila challenge. Both genes were highly upregulated in the liver, spleen and kidney during the first 24 h after the challenge. While MyD88 and TRAF6 have been reported in various aquatic species, this is the first report and characterisation of these genes in blunt snout bream. This research also provides evidence of the important roles of these two genes in the blunt snout bream innate immune system. PMID:25830478
Fluconazole Resistance Associated with Drug Efflux and Increased Transcription of a Drug Transporter Gene, PDH1, in Candida glabrata

PubMed Central

Miyazaki, Haruko; Miyazaki, Yoshitsugu; Geber, Antonia; Parkinson, Tanya; Hitchcock, Christopher; Falconer, Derek J.; Ward, Douglas J.; Marsden, Katherine; Bennett, John E.

1998-01-01

Sequential Candida glabrata isolates were obtained from the mouth of a patient infected with human immunodeficiency virus type 1 who was receiving high doses of fluconazole for oropharyngeal thrush. Fluconazole-susceptible colonies were replaced by resistant colonies that exhibited both increased fluconazole efflux and increased transcripts of a gene which codes for a protein with 72.5% identity to Pdr5p, an ABC multidrug transporter in Saccharomyces cerevisiae. The deduced protein had a molecular mass of 175 kDa and was composed of two homologous halves, each with six putative transmembrane domains and highly conserved sequences of ATP-binding domains. When the earliest and most azole-susceptible isolate of C. glabrata from this patient was exposed to fluconazole, increased transcripts of the PDR5 homolog appeared, linking azole exposure to regulation of this gene. PMID:9661006

A comparison of complete mitochondrial genomes of silver carp hypophthalmichthys molitrix and bighead carp hypophthalmichthys nobilis: Implications for their taxonomic relationship and phylogeny

USGS Publications Warehouse

Li, S.-F.; Xu, J.-W.; Yang, Q.-L.; Wang, C.H.; Chen, Q.; Chapman, D.C.; Lu, G.

2009-01-01

Based upon morphological characters, Silver carp Hypophthalmichthys molitrix and bighead carp Hypophthalmichthys nobilis (or Aristichthys nobilis) have been classified into either the same genus or two distinct genera. Consequently, the taxonomic relationship of the two species at the generic level remains equivocal. This issue is addressed by sequencing complete mitochondrial genomes of H. molitrix and H. nobilis, comparing their mitogenome organization, structure and sequence similarity, and conducting a comprehensive phylogenetic analysis of cyprinid species. As with other cyprinid fishes, the mitogenomes of the two species were structurally conserved, containing 37 genes including 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA (tRNAs) genes and a putative control region (D-loop). Sequence similarity between the two mitogenomes varied in different genes or regions, being highest in the tRNA genes (98??8%), lowest in the control region (89??4%) and intermediate in the protein-coding genes (94??2%). Analyses of the sequence comparison and phylogeny using concatenated protein sequences support the view that the two species belong to the genus Hypophthalmichthys. Further studies using nuclear markers and involving more closely related species, and the systematic combination of traditional biology and molecular biology are needed in order to confirm this conclusion. ?? 2009 The Fisheries Society of the British Isles.
Altered Gene Expression in Three Plant Species in Response to Treatment with Nep1, a Fungal Protein That Causes Necrosis

PubMed Central

Keates, Sarah E.; Kostman, Todd A.; Anderson, James D.; Bailey, Bryan A.

2003-01-01

Nep1 is an extracellular fungal protein that causes necrosis when applied to many dicotyledonous plants, including invasive weed species. Using transmission electron microscopy, it was determined that application of Nep1 (1.0 μg mL–1, 0.1% [v/v] Silwet-L77) to Arabidopsis and two invasive weed species, spotted knapweed (Centaurea maculosa) and dandelion (Taraxacum officinale), caused a reduction in the thickness of the cuticle and a breakdown of chloroplasts 1 to 4 h after treatment. Membrane breakdown was most severe in cells closest to the surface of application. Differential display was used to isolate cDNA clones from the three species showing differential expression in response to Nep1 treatment. Differential gene expression was observed for a putative serpin (CmSER-1) and a calmodulin-like (CmCAL-1) protein from spotted knapweed, and a putative protein phosphatase 2C (ToPP2C-1) and cytochrome P-450 (ToCYP-1) protein from dandelion. In addition, differential expression was observed for genes coding for a putative protein kinase (AtPK-1), a homolog (AtWI-12) of wound-induced WI12, a homolog (AtLEA-1) of late embryogenesis abundant LEA-5, a WRKY-18 DNA-binding protein (AtWRKY-18), and a phospholipase D (AtPLD-1) from Arabidopsis. Genes showing elevated mRNA levels in Nep1-treated (5 μg mL–1, 0.1% [v/v] Silwet-L77) leaves 15 min after Nep1 treatment included CmSER-1 and CmCAL-1 for spotted knapweed, ToCYP-1 and CmCAL-1 for dandelion, and AtPK-1, AtWRKY-18, AtWI-12, and AtLEA-1 for Arabidopsis. Levels of mRNA for AtPLD-1 (Arabidopsis) and ToPP2C-1 (dandelion) decreased rapidly in Silwet-l77-treated plants between 15 min and 4 h of treatment, but were maintained or decreased more slowly over time in Nep1-treated (5 μg mL–1, 0.1% [v/v] Silwet-L77) leaves. In general, increases in mRNA band intensities were in the range of two to five times, with only ToCYP-1 in dandelion exceeding an increase of 10 times. The identified genes have been shown to be involved or are related to gene families that are involved in plant stress responses, including wounding, drought, senescence, and disease resistance. PMID:12857840
Characterization of an AtCCX5 gene from Arabidopsis thaliana that involves in high-affinity K{sup +} uptake and Na{sup +} transport in yeast

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Xinxin; Zhang, Min; Takano, Tetsuo

Highlights: {yields} The AtCCX5 protein coding a putative cation calcium exchanger was characterized. {yields} AtCCX5 expressed in yeast was localized in the plasma membrane and nuclear periphery. {yields} AtCCX5 protein did not show the same transport properties as the CAXs. {yields} AtCCX5 protein involves in mediating high-affinity K{sup +} uptake in yeast. {yields} AtCCX5 protein also involves in Na{sup +} transport in yeast. -- Abstract: The gene for a putative cation calcium exchanger (CCX) from Arabidopsis thaliana, AtCCX5, was cloned and its function was analyzed in yeast. Green fluorescent protein-tagged AtCCX5 expressed in yeast was localized in the plasma membranemore » and nuclear periphery. The yeast transformants expressing AtCCX5 were created and their growth in the presence of various cations (K{sup +}, Na{sup +}, Ca{sup 2+}, Mg{sup 2+}, Fe{sup 2+}, Cu{sup 2+}, Co{sup 2+}, Cd{sup 2+}, Mn{sup 2+}, Ba{sup 2+}, Ni{sup 2+}, Zn{sup 2+}, and Li{sup +}) were analyzed. AtCCX5 expression was found to affect the response to K{sup +} and Na{sup +} in yeast. The AtCCX5 transformant also showed a little better growth to Zn{sup 2+}. The yeast mutant 9.3 expressing AtCCX5 restored growth of the mutant on medium with low K{sup +} (0.5 mM), and also suppressed its Na{sup +} sensitivity. Ion uptake experiments showed that AtCCX5 mediated relatively high-affinity K{sup +} uptake and was also involved in Na{sup +} transport in yeast. Taken together, these findings suggest that the AtCCX5 is a novel transport protein involves in mediating high-affinity K{sup +} uptake and Na{sup +} transport in yeast.« less
Genome-scale metabolic network of Cordyceps militaris useful for comparative analysis of entomopathogenic fungi.

PubMed

Vongsangnak, Wanwipa; Raethong, Nachon; Mujchariyakul, Warasinee; Nguyen, Nam Ninh; Leong, Hon Wai; Laoteng, Kobkul

2017-08-30

The first genome-scale metabolic network of Cordyceps militaris (iWV1170) was constructed representing its whole metabolisms, which consisted of 894 metabolites and 1,267 metabolic reactions across five compartments, including the plasma membrane, cytoplasm, mitochondria, peroxisome and extracellular space. The iWV1170 could be exploited to explain its phenotypes of growth ability, cordycepin and other metabolites production on various substrates. A high number of genes encoding extracellular enzymes for degradation of complex carbohydrates, lipids and proteins were existed in C. militaris genome. By comparative genome-scale analysis, the adenine metabolic pathway towards putative cordycepin biosynthesis was reconstructed, indicating their evolutionary relationships across eleven species of entomopathogenic fungi. The overall metabolic routes involved in the putative cordycepin biosynthesis were also identified in C. militaris, including central carbon metabolism, amino acid metabolism (glycine, l-glutamine and l-aspartate) and nucleotide metabolism (adenosine and adenine). Interestingly, a lack of the sequence coding for ribonucleotide reductase inhibitor was observed in C. militaris that might contribute to its over-production of cordycepin. Copyright © 2017. Published by Elsevier B.V.
Molecular and biochemical characterization of two tungsten- and selenium-containing formate dehydrogenases from Eubacterium acidaminophilum that are associated with components of an iron-only hydrogenase.

PubMed

Graentzdoerffer, Andrea; Rauh, David; Pich, Andreas; Andreesen, Jan R

2003-01-01

Two gene clusters encoding similar formate dehydrogenases (FDH) were identified in Eubacterium acidaminophilum. Each cluster is composed of one gene coding for a catalytic subunit ( fdhA-I, fdhA-II) and one for an electron-transferring subunit ( fdhB-I, fdhB-II). Both fdhA genes contain a TGA codon for selenocysteine incorporation and the encoded proteins harbor five putative iron-sulfur clusters in their N-terminal region. Both FdhB subunits resemble the N-terminal region of FdhA on the amino acid level and contain five putative iron-sulfur clusters. Four genes thought to encode the subunits of an iron-only hydrogenase are located upstream of the FDH gene cluster I. By sequence comparison, HymA and HymB are predicted to contain one and four iron-sulfur clusters, respectively, the latter protein also binding sites for FMN and NAD(P). Thus, HymA and HymB seem to represent electron-transferring subunits, and HymC the putative catalytic subunit containing motifs for four iron-sulfur clusters and one H-cluster specific for Fe-only hydrogenases. HymD has six predicted transmembrane helices and might be an integral membrane protein. Viologen-dependent FDH activity was purified from serine-grown cells of E. acidaminophilum and the purified protein complex contained four subunits, FdhA and FdhB, encoded by FDH gene cluster II, and HymA and HymB, identified after determination of their N-terminal sequences. Thus, this complex might represent the most simple type of a formate hydrogen lyase. The purified formate dehydrogenase fraction contained iron, tungsten, a pterin cofactor, and zinc, but no molybdenum. FDH-II had a two-fold higher K(m) for formate (0.37 mM) than FDH-I and also catalyzed CO(2) reduction to formate. Reverse transcription (RT)-PCR pointed to increased expression of FDH-II in serine-grown cells, supporting the isolation of this FDH isoform. The fdhA-I gene was expressed as inactive protein in Escherichia coli. The in-frame UGA codon for selenocysteine incorporation was read in the heterologous system only as stop codon, although its potential SECIS element exhibited a quite high similarity to that of E. coli FDH.
Molecular characterization of two serine proteases expressed in gut tissue of the African trypanosome vector, Glossina morsitans morsitans.

PubMed

Yan, J; Cheng, Q; Li, C B; Aksoy, S

2001-02-01

Serine proteases are major insect gut enzymes involved in digestion of dietary proteins, and in addition they have been implicated in the process of pathogen establishment in several vector insects. The medically important vector, tsetse fly (Diptera:Glossinidiae), is involved in the transmission of African trypanosomes, which cause devastating diseases in animals and humans. Both the male and female tsetse can transmit trypanosomes and both are strict bloodfeeders throughout all stages of their development. Here, we describe the characterization of two putative serine protease-encoding genes, Glossina serine protease-1 (Gsp1) and Glossina serine protease-2 (Gsp2) from gut tissue. Both putative cDNA products represent prepro peptides with hydrophobic signal peptide sequences associated with their 5'-end terminus. The Gsp1 cDNA encodes a putative mature protein of 245 amino acids with a molecular mass of 26 428 Da, while the predicted size of the 228 amino acid mature peptide encoded by Gsp2 cDNA is 24 573 Da. Both deduced peptides contain the Asp/His/Ser catalytic triad and the conserved residues surrounding it which are characteristic of serine proteases. In addition, both proteins have the six-conserved cysteine residues to form the three-cysteine bonds typically present in invertebrate serine proteases. Based on the presence of substrate specific residues, the Gsp1 gene encodes a chymotrypsin-like protease while Gsp2 gene encodes for a protein with trypsin-like activity. Both proteins are encoded by few loci in tsetse genome, being present in one or two copies only. The mRNA expression levels for the genes do not vary extensively throughout the digestive cycle, and high levels of mRNAs can be readily detected in the gut tissue of newly emerged flies. The levels of trypsin and chymotrypsin activities in the gut lumen increase following blood feeding and change significantly in the gut cells throughout the digestion cycle. Hence, the regulation of expression for trypsin and chymotrypsin occurs at the post-transcriptional level in tsetse. Both the coding sequences and patterns of expression of Gsp1 and Gsp2 genes are similar to the serine proteases that have been reported from the bloodfeeding insect Stomoxys calcitrans.
Draft genome sequence of Trametes villosa (Sw.) Kreisel CCMB561, a tropical white-rot Basidiomycota from the semiarid region of Brazil.

PubMed

Ferreira, Dalila Souza Santos; Kato, Rodrigo Bentes; Miranda, Fábio Malcher; da Costa Pinheiro, Kenny; Fonseca, Paula Luize Camargos; Tomé, Luiz Marcelo Ribeiro; Vaz, Aline Bruna Martins; Badotti, Fernanda; Ramos, Rommel Thiago Jucá; Brenig, Bertram; Azevedo, Vasco Ariston de Carvalho; Benevides, Raquel Guimarães; Góes-Neto, Aristóteles

2018-06-01

Herein, we present the draft genome of Trametes villosa isolate CCMB561, a wood-decaying Basidiomycota commonly found in tropical semiarid climate. The genome assembly was 57.98 Mb in size with an L50 of 691. A total of 16,711 putative protein-encoding genes was predicted, including 590 genes coding for carbohydrate-active enzymes (CAZy), directly involved in the decomposition of lignocellulosic materials. This is the first genome of this species of high interest in bioenergy research. The draft genome of Trametes villosa isolate CCMB561 will provide an important resource for future investigations in biofuel production, bioremediation and other green technologies.
Structure and chromosomal localization of the human PD-1 gene (PDCD1)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shinohara, T.; Ishida, Y.; Kawaichi, M.

1994-10-01

A cDNA encoding mouse PD-1, a member of the immunoglobulin superfamily, was previously isolated from apoptosis-induced cells by subtractive hybridization. To determine the structure and chromosomal location of the human PD-1 gene, we screened a human T cell cDNA library by mouse PD-1 probe and isolated a cDNA coding for the human PD-1 protein. The deduced amino acid sequence of human PD-1 was 60% identical to the mouse counterpart, and a putative tyrosine kinase-association motif was well conserved. The human PD-1 gene was mapped to 2q37.3 by chromosomal in situ hybridization. 7 refs., 3 figs.
Expression and characterization of a new esterase with GCSAG motif from a permafrost metagenomic library.

PubMed

Petrovskaya, Lada E; Novototskaya-Vlasova, Ksenia A; Spirina, Elena V; Durdenko, Ekaterina V; Lomakina, Galina Yu; Zavialova, Maria G; Nikolaev, Evgeny N; Rivkina, Elizaveta M

2016-05-01

As a result of construction and screening of a metagenomic library prepared from a permafrost-derived microcosm, we have isolated a novel gene coding for a putative lipolytic enzyme that belongs to the hormone-sensitive lipase family. It encodes a polypeptide of 343 amino acid residues whose amino acid sequence displays maximum likelihood with uncharacterized proteins from Sphingomonas species. A putative catalytic serine residue of PMGL2 resides in a new variant of a recently discovered GTSAG sequence in which a Thr residue is replaced by a Cys residue (GCSAG). The recombinant PMGL2 was produced in Escherichia coli cells and purified by Ni-affinity chromatography. The resulting protein preferably utilizes short-chain p-nitrophenyl esters (C4 and C8) and therefore is an esterase. It possesses maximum activity at 45°C in slightly alkaline conditions and has limited thermostability at higher temperatures. Activity of PMGL2 is stimulated in the presence of 0.25-1.5 M NaCl indicating the good salt tolerance of the new enzyme. Mass spectrometric analysis demonstrated that N-terminal methionine in PMGL2 is processed and cysteine residues do not form a disulfide bond. The results of the study demonstrate the significance of the permafrost environment as a unique genetic reservoir and its potential for metagenomic exploration. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Structure-based activity prediction of CYP21A2 stability variants: A survey of available gene variations.

PubMed

Bruque, Carlos D; Delea, Marisol; Fernández, Cecilia S; Orza, Juan V; Taboas, Melisa; Buzzalino, Noemí; Espeche, Lucía D; Solari, Andrea; Luccerini, Verónica; Alba, Liliana; Nadra, Alejandro D; Dain, Liliana

2016-12-14

Congenital adrenal hyperplasia due to 21-hydroxylase deficiency accounts for 90-95% of CAH cases. In this work we performed an extensive survey of mutations and SNPs modifying the coding sequence of the CYP21A2 gene. Using bioinformatic tools and two plausible CYP21A2 structures as templates, we initially classified all known mutants (n = 343) according to their putative functional impacts, which were either reported in the literature or inferred from structural models. We then performed a detailed analysis on the subset of mutations believed to exclusively impact protein stability. For those mutants, the predicted stability was calculated and correlated with the variant's expected activity. A high concordance was obtained when comparing our predictions with available in vitro residual activities and/or the patient's phenotype. The predicted stability and derived activity of all reported mutations and SNPs lacking functional assays (n = 108) were assessed. As expected, most of the SNPs (52/76) showed no biological implications. Moreover, this approach was applied to evaluate the putative synergy that could emerge when two mutations occurred in cis. In addition, we propose a putative pathogenic effect of five novel mutations, p.L107Q, p.L122R, p.R132H, p.P335L and p.H466fs, found in 21-hydroxylase deficient patients of our cohort.
Structure-based activity prediction of CYP21A2 stability variants: A survey of available gene variations

PubMed Central

Bruque, Carlos D.; Delea, Marisol; Fernández, Cecilia S.; Orza, Juan V.; Taboas, Melisa; Buzzalino, Noemí; Espeche, Lucía D.; Solari, Andrea; Luccerini, Verónica; Alba, Liliana; Nadra, Alejandro D.; Dain, Liliana

2016-01-01

Congenital adrenal hyperplasia due to 21-hydroxylase deficiency accounts for 90–95% of CAH cases. In this work we performed an extensive survey of mutations and SNPs modifying the coding sequence of the CYP21A2 gene. Using bioinformatic tools and two plausible CYP21A2 structures as templates, we initially classified all known mutants (n = 343) according to their putative functional impacts, which were either reported in the literature or inferred from structural models. We then performed a detailed analysis on the subset of mutations believed to exclusively impact protein stability. For those mutants, the predicted stability was calculated and correlated with the variant’s expected activity. A high concordance was obtained when comparing our predictions with available in vitro residual activities and/or the patient’s phenotype. The predicted stability and derived activity of all reported mutations and SNPs lacking functional assays (n = 108) were assessed. As expected, most of the SNPs (52/76) showed no biological implications. Moreover, this approach was applied to evaluate the putative synergy that could emerge when two mutations occurred in cis. In addition, we propose a putative pathogenic effect of five novel mutations, p.L107Q, p.L122R, p.R132H, p.P335L and p.H466fs, found in 21-hydroxylase deficient patients of our cohort. PMID:27966633
Cloning and characterization of an inulinase gene from the marine yeast Candida membranifaciens subsp. flavinogenie W14-3 and its expression in Saccharomyces sp. W0 for ethanol production.

PubMed

Zhang, Lin-Lin; Tan, Mei-Juan; Liu, Guang-Lei; Chi, Zhe; Wang, Guang-Yuan; Chi, Zhen-Ming

2015-04-01

The INU1 gene encoding an exo-inulinase from the marine-derived yeast Candida membranifaciens subsp. flavinogenie W14-3 was cloned and characterized. It had an open reading frame of 1,536 bp long encoding an inulinase. The coding region of it was not interrupted by any intron. The cloned gene encoded 512 amino acid residues of a protein with a putative signal peptide of 23 amino acids and a calculated molecular mass of 57.8 kDa. The protein sequence deduced from the inulinase gene contained the inulinase consensus sequences (WMNDPNGL), (RDP), ECP FS and Q. The protein also had six conserved putative N-glycosylation sites. The deduced inulinase from the yeast strain W14-3 was found to be closely related to that from Candida kutaonensis sp. nov. KRF1, Kluyveromyces marxianus, and Cryptococcus aureus G7a. The inulinase gene with its signal peptide encoding sequence was subcloned into the pMIRSC11 expression vector and expressed in Saccharomyces sp. W0. The recombinant yeast strain W14-3-INU-112 obtained could produce 16.8 U/ml of inulinase activity and 12.5 % (v/v) ethanol from 250 g/l of inulin within 168 h. The monosaccharides were detected after the hydrolysis of inulin with the crude inulinase (the yeast culture). All the results indicated that the cloned gene and the recombinant yeast strain W14-3-INU-112 had potential applications in biotechnology.
Microtubule actin cross-linking factor (MACF): a hybrid of dystonin and dystrophin that can interact with the actin and microtubule cytoskeletons.

PubMed

Leung, C L; Sun, D; Zheng, M; Knowles, D R; Liem, R K

1999-12-13

We cloned and characterized a full-length cDNA of mouse actin cross-linking family 7 (mACF7) by sequential rapid amplification of cDNA ends-PCR. The completed mACF7 cDNA is 17 kb and codes for a 608-kD protein. The closest relative of mACF7 is the Drosophila protein Kakapo, which shares similar architecture with mACF7. mACF7 contains a putative actin-binding domain and a plakin-like domain that are highly homologous to dystonin (BPAG1-n) at its NH(2) terminus. However, unlike dystonin, mACF7 does not contain a coiled-coil rod domain; instead, the rod domain of mACF7 is made up of 23 dystrophin-like spectrin repeats. At its COOH terminus, mACF7 contains two putative EF-hand calcium-binding motifs and a segment homologous to the growth arrest-specific protein, Gas2. In this paper, we demonstrate that the NH(2)-terminal actin-binding domain of mACF7 is functional both in vivo and in vitro. More importantly, we found that the COOH-terminal domain of mACF7 interacts with and stabilizes microtubules. In transfected cells full-length mACF7 can associate not only with actin but also with microtubules. Hence, we suggest a modified name: MACF (microtubule actin cross-linking factor). The properties of MACF are consistent with the observation that mutations in kakapo cause disorganization of microtubules in epidermal muscle attachment cells and some sensory neurons.
Purification and characterization pecan (Carya Illinoinensis) vicilin, a putative food allergen (abstract)

USDA-ARS?s Scientific Manuscript database

The pecan seed storage protein vicilin, a putative food allergen, was recombinantly expressed for and purified by a combination of metal affinity and gel filtration chromatography. The protein was crystallized and studied by crystallography. The obtained crystals belonged to space group P212121 with...
Identification of positive selection in disease response genes within members of the Poaceae.

PubMed

Rech, Gabriel E; Vargas, Walter A; Sukno, Serenella A; Thon, Michael R

2012-12-01

Millions of years of coevolution between plants and pathogens can leave footprints on their genomes and genes involved on this interaction are expected to show patterns of positive selection in which novel, beneficial alleles are rapidly fixed within the population. Using information about upregulated genes in maize during Colletotrichum graminicola infection and resources available in the Phytozome database, we looked for evidence of positive selection in the Poaceae lineage, acting on protein coding sequences related with plant defense. We found six genes with evidence of positive selection and another eight with sites showing episodic selection. Some of them have already been described as evolving under positive selection, but others are reported here for the first time including genes encoding isocitrate lyase, dehydrogenases, a multidrug transporter, a protein containing a putative leucine-rich repeat and other proteins with unknown functions. Mapping positively selected residues onto the predicted 3-D structure of proteins showed that most of them are located on the surface, where proteins are in contact with other molecules. We present here a set of Poaceae genes that are likely to be involved in plant defense mechanisms and have evidence of positive selection. These genes are excellent candidates for future functional validation.
Quantitative protein expression analysis of CLL B cells from mutated and unmutated IgV(H) subgroups using acid-cleavable isotope-coded affinity tag reagents.

PubMed

Barnidge, David R; Jelinek, Diane F; Muddiman, David C; Kay, Neil E

2005-01-01

Relative protein expression levels were compared in leukemic B cells from two patients with chronic lymphocytic leukemia (CLL) having either mutated (M-CLL) or unmutated (UM-CLL) immunoglobulin variable heavy chain genes (IgV(H)). Cells were separated into cytosol and membrane protein fractions then labeled with acid-cleavable ICAT reagents (cICAT). Labeled proteins were digested with trypsin then subjected to SCX and affinity chromatography followed by LC-ESI-MS/MS analysis on a linear ion trap mass spectrometer. A total of 9 proteins from the cytosol fraction and 4 from the membrane fraction showed a 3-fold or greater difference between M-CLL and UM-CLL and a subset of these were examined by Western blot where results concurred with cICAT abundance ratios. The abundance of one of the proteins in particular, the mitochondrial membrane protein cytochrome c oxidase subunit COX G was examined in 6 M-CLL and 6 UM-CLL patients using western blot and results showed significantly greater levels (P < 0.001) in M-CLL patients vs UM-CLL patients. These results demonstrate that stable isotope labeling and mass spectrometry can complement 2D gel electrophoresis and gene microarray technologies for identifying putative and perhaps unique prognostic markers in CLL.
Complex Interplay among DNA Modification, Noncoding RNA Expression and Protein-Coding RNA Expression in Salvia miltiorrhiza Chloroplast Genome

PubMed Central

Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

2014-01-01

Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box–like motif (CPGDMM1, “TATANNNATNA”), and an unknown motif (CPGDMM2 “WNYANTGAW”). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome. PMID:24914614
Complex interplay among DNA modification, noncoding RNA expression and protein-coding RNA expression in Salvia miltiorrhiza chloroplast genome.

PubMed

Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

2014-01-01

Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box-like motif (CPGDMM1, "TATANNNATNA"), and an unknown motif (CPGDMM2 "WNYANTGAW"). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome.
A long natural-antisense RNA is accumulated in the conidia of Aspergillus oryzae.

PubMed

Tsujii, Masaru; Okuda, Satoshi; Ishi, Kazutomo; Madokoro, Kana; Takeuchi, Michio; Yamagata, Youhei

2016-01-01

Analysis of expressed sequence tag libraries from various culture conditions revealed the existence of conidia-specific transcripts assembled to putative conidiation-specific reductase gene (csrA) in Aspergillus oryzae. However, the all transcripts were transcribed with opposite direction to the gene csrA. The sequence analysis of the transcript revealed that the RNA overlapped mRNA of csrA with 3'-end, and did not code protein longer than 60 amino acid residues. We designated the transcript Conidia Specific Long Natural-antisense RNA (CSLNR). The real-time PCR analysis demonstrated that the CSLNR is conidia-specific transcript, which cannot be transcribed in the absence of brlA, and the amount of CSLNR was much more than that of the transcript from csrA in conidia. Furthermore, the csrA deletion, also lacking coding region of CSLNR in A. oryzae reduced the number of conidia. Overexpression of CsrA demonstrated the inhibition of growth and conidiation, while CSLNR did not affect conidiation.
A Potato cDNA Encoding a Homologue of Mammalian Multidrug Resistant P-Glycoprotein

NASA Technical Reports Server (NTRS)

Wang, W.; Takezawa, D.; Poovaiah, B. W.

1996-01-01

A homologue of the multidrug resistance (MDR) gene was obtained while screening a potato stolon tip cDNA expression library with S-15-labeled calmodulin. The mammalian MDR gene codes for a membrane-bound P-glycoprotein (170-180 kDa) which imparts multidrug resistance to cancerous cells. The potato cDNA (PMDR1) codes for a polypeptide of 1313 amino acid residues (ca. 144 kDa) and its structural features are very similar to the MDR P-glycoprotein. The N-terminal half of the PMDR1-encoded protein shares striking homology with its C-terminal half, and each half contains a conserved ATP-binding site and six putative transmembrane domains. Southern blot analysis indicated that potato has one or two MDR-like genes. PMDR1 mRNA is constitutively expressed in all organs studied with higher expression in the stem and stolon tip. The PMDR1 expression was highest during tuber initiation and decreased during tuber development.

Molecular cloning, structural analysis, and expression in Escherichia coli of a chitinase gene from Enterobacter agglomerans.

PubMed Central

Chernin, L S; De la Fuente, L; Sobolev, V; Haran, S; Vorgias, C E; Oppenheim, A B; Chet, I

1997-01-01

The gene chiA, which codes for endochitinase, was cloned from a soilborne Enterobacter agglomerans. Its complete sequence was determined, and the deduced amino acid sequence of the enzyme designated Chia_Entag yielded an open reading frame coding for 562 amino acids of a 61-kDa precursor protein with a putative leader peptide at its N terminus. The nucleotide and polypeptide sequences of Chia_Entag showed 86.8 and 87.7% identity with the corresponding gene and enzyme, Chia_Serma, of Serratia marcescens, respectively. Homology modeling of Chia_Entag's three-dimensional structure demonstrated that most amino acid substitutions are at solvent-accessible sites. Escherichia coli JM109 carrying the E. agglomerans chiA gene produced and secreted Chia_Entag. The antifungal activity of the secreted endochitinase was demonstrated in vitro by inhibition of Fusarium oxysporum spore germination. The transformed strain inhibited Rhizoctonia solani growth on plates and the root rot disease caused by this fungus in cotton seedlings under greenhouse conditions. PMID:9055404
Mining novel effector proteins from the esophageal gland cells of Meloidogyne incognita

PubMed Central

Rutter, William B.; Hewezi, Tarek; Abubucker, Sahar; Maier, Tom R.; Huang, Guozhong; Mitreva, Makedonka; Hussey, Richard S.; Baum, Thomas J.

2014-01-01

Meloidogyne incognita is one of the most economically damaging plant pathogens in agriculture and horticulture. Identifying and characterizing the effector proteins, which M. incognita secretes into its host plants during infection, is an important step towards finding new ways to manage this pest. In this study we have identified the cDNAs for 18 putative effectors, i.e., proteins that have the potential to facilitate M. incognita parasitism of host plants. These putative effectors are secretory proteins that do not contain transmembrane domains and whose genes are specifically expressed in the secretory gland cells of the nematode, indicating that they are likely secreted from the nematode through its stylet. We have determined that in the plant cells, these putative effectors are likely to localize to the cytoplasm. Furthermore, the transcripts of many of these novel effectors are specifically up-regulated during different stages of the nematode’s life cycle, indicating that they function at specific stages during M. incognita parasitism. The predicted proteins showed little to no homology to known proteins from free-living nematode species, suggesting that they evolved recently to support the parasitic lifestyle. On the other hand, several of the effectors are part of gene families within the M. incognita genome as well as that of Meloidogyne hapla, which points to an important role that these putative effectors are playing in both parasites. With the discovery of these putative effectors we have increased our knowledge of the effector repertoire utilized by root-knot nematodes to infect, feed, and reproduce on their host plants. Future studies investigating the roles these proteins play in planta will help mitigate the effects of this damaging pest. PMID:24875667
Mining novel effector proteins from the esophageal gland cells of Meloidogyne incognita.

PubMed

Rutter, William B; Hewezi, Tarek; Abubucker, Sahar; Maier, Tom R; Huang, Guozhong; Mitreva, Makedonka; Hussey, Richard S; Baum, Thomas J

2014-09-01

Meloidogyne incognita is one of the most economically damaging plant pathogens in agriculture and horticulture. Identifying and characterizing the effector proteins which M. incognita secretes into its host plants during infection is an important step toward finding new ways to manage this pest. In this study, we have identified the cDNAs for 18 putative effectors (i.e., proteins that have the potential to facilitate M. incognita parasitism of host plants). These putative effectors are secretory proteins that do not contain transmembrane domains and whose genes are specifically expressed in the secretory gland cells of the nematode, indicating that they are likely secreted from the nematode through its stylet. We have determined that, in the plant cells, these putative effectors are likely to localize to the cytoplasm. Furthermore, the transcripts of many of these novel effectors are specifically upregulated during different stages of the nematode's life cycle, indicating that they function at specific stages during M. incognita parasitism. The predicted proteins showed little to no homology to known proteins from free-living nematode species, suggesting that they evolved recently to support the parasitic lifestyle. On the other hand, several of the effectors are part of gene families within the M. incognita genome as well as that of M. hapla, which points to an important role that these putative effectors are playing in both parasites. With the discovery of these putative effectors, we have increased our knowledge of the effector repertoire utilized by root-knot nematodes to infect, feed on, and reproduce on their host plants. Future studies investigating the roles that these proteins play in planta will help mitigate the effects of this damaging pest.
Phenome-genome association studies of pancreatic cancer: new targets for therapy and diagnosis.

PubMed

Narayanan, Ramaswamy

2015-01-01

Pancreatic cancer, has a very high mortality rate and requires novel molecular targets for diagnosis and therapy. Genetic association studies over databases offer an attractive starting point for gene discovery. The National Center for Biotechnology Information (NCBI) Phenome Genome Integrator (PheGenI) tool was enriched for pancreatic cancer-associated traits. The genes associated with the trait were characterized using diverse bioinformatics tools for Genome-Wide Association (GWA), transcriptome and proteome profile and protein classes for motif and domain. Two hundred twenty-six genes were identified that had a genetic association with pancreatic cancer in the human genome. This included 25 uncharacterized open reading frames (ORFs). Bioinformatics analysis of these ORFs identified putative druggable proteins and biomarkers including enzymes, transporters and G-protein-coupled receptor signaling proteins. Secreted proteins including a neuroendocrine factor and a chemokine were identified. Five out of these ORFs encompassed non coding RNAs. The ORF protein expression was detected in numerous body fluids, such as ascites, bile, pancreatic juice, milk, plasma, serum and saliva. Transcriptome and proteome analyses showed a correlation of mRNA and protein expression for nine ORFs. Analysis of the Catalogue of Somatic Mutations in Cancer (COSMIC) database revealed a strong correlation across copy number variations and mRNA over-expression for four ORFs. Mining of the International Cancer Gene Consortium (ICGC) database identified somatic mutations in a significant number of pancreatic patients' tumors for most of these ORFs. The pancreatic cancer-associated ORFs were also found to be genetically associated with other neoplasms, including leukemia, malignant melanoma, neuroblastoma and prostate carcinomas, as well as other unrelated diseases and disorders, such as Alzheimer's disease, Crohn's disease, coronary diseases, attention deficit disorder and addiction. Based on Genome-Wide Association Studies (GWAS), copy number variations, somatic mutational status and correlation of gene expression in pancreatic tumors at the mRNA and protein level, expression specificity in normal tissues and detection in body fluids, six ORFs emerged as putative leads for pancreatic cancer. These six targets provide a basis for accelerated drug discovery and diagnostic marker development for pancreatic cancer. Copyright© 2015, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.
An in silico pipeline to filter the Toxoplasma gondii proteome for proteins that could traffic to the host cell nucleus and influence host cell epigenetic regulation.

PubMed

Syn, Genevieve; Blackwell, Jenefer M; Jamieson, Sarra E; Francis, Richard W

2018-01-01

Toxoplasma gondii uses epigenetic mechanisms to regulate both endogenous and host cell gene expression. To identify genes with putative epigenetic functions, we developed an in silico pipeline to interrogate the T. gondii proteome of 8313 proteins. Step 1 employs PredictNLS and NucPred to identify genes predicted to target eukaryotic nuclei. Step 2 uses GOLink to identify proteins of epigenetic function based on Gene Ontology terms. This resulted in 611 putative nuclear localised proteins with predicted epigenetic functions. Step 3 filtered for secretory proteins using SignalP, SecretomeP, and experimental data. This identified 57 of the 611 putative epigenetic proteins as likely to be secreted. The pipeline is freely available online, uses open access tools and software with user-friendly Perl scripts to automate and manage the results, and is readily adaptable to undertake any such in silico search for genes contributing to particular functions.
The 'permeome' of the malaria parasite: an overview of the membrane transport proteins of Plasmodium falciparum

PubMed Central

Martin, Rowena E; Henry, Roselani I; Abbey, Janice L; Clements, John D; Kirk, Kiaran

2005-01-01

Background The uptake of nutrients, expulsion of metabolic wastes and maintenance of ion homeostasis by the intraerythrocytic malaria parasite is mediated by membrane transport proteins. Proteins of this type are also implicated in the phenomenon of antimalarial drug resistance. However, the initial annotation of the genome of the human malaria parasite Plasmodium falciparum identified only a limited number of transporters, and no channels. In this study we have used a combination of bioinformatic approaches to identify and attribute putative functions to transporters and channels encoded by the malaria parasite, as well as comparing expression patterns for a subset of these. Results A computer program that searches a genome database on the basis of the hydropathy plots of the corresponding proteins was used to identify more than 100 transport proteins encoded by P. falciparum. These include all the transporters previously annotated as such, as well as a similar number of candidate transport proteins that had escaped detection. Detailed sequence analysis enabled the assignment of putative substrate specificities and/or transport mechanisms to all those putative transport proteins previously without. The newly-identified transport proteins include candidate transporters for a range of organic and inorganic nutrients (including sugars, amino acids, nucleosides and vitamins), and several putative ion channels. The stage-dependent expression of RNAs for 34 candidate transport proteins of particular interest are compared. Conclusion The malaria parasite possesses substantially more membrane transport proteins than was originally thought, and the analyses presented here provide a range of novel insights into the physiology of this important human pathogen. PMID:15774027
Discovery-2: an interactive resource for the rational selection and comparison of putative drug target proteins in malaria

PubMed Central

2013-01-01

Background Drug resistance to anti-malarial compounds remains a serious problem, with resistance to newer pharmaceuticals developing at an alarming rate. The development of new anti-malarials remains a priority, and the rational selection of putative targets is a key element of this process. Discovery-2 is an update of the original Discovery in silico resource for the rational selection of putative drug target proteins, enabling researchers to obtain information for a protein which may be useful for the selection of putative drug targets, and to perform advanced filtering of proteins encoded by the malaria genome based on a series of molecular properties. Methods An updated in silico resource has been developed where researchers are able to mine information on malaria proteins and predicted ligands, as well as perform comparisons to the human and mosquito host characteristics. Protein properties used include: domains, motifs, EC numbers, GO terms, orthologs, protein-protein interactions, protein-ligand interactions. Newly added features include drugability measures from ChEMBL, automated literature relations and links to clinical trial information. Searching by chemical structure is also available. Results The updated functionality of the Discovery-2 resource is presented, together with a detailed case study of the Plasmodium falciparum S-adenosyl-L-homocysteine hydrolase (PfSAHH) protein. A short example of a chemical search with pyrimethamine is also illustrated. Conclusion The updated Discovery-2 resource allows researchers to obtain detailed properties of proteins from the malaria genome, which may be of interest in the target selection process, and to perform advanced filtering and selection of proteins based on a relevant range of molecular characteristics. PMID:23537208
Complete genomic characterisation of two novel poxviruses (WKPV and EKPV) from western and eastern grey kangaroos.

PubMed

Bennett, Mark; Tu, Shin-Lin; Upton, Chris; McArtor, Cassie; Gillett, Amber; Laird, Tanya; O'Dea, Mark

2017-10-15

Poxviruses have previously been detected in macropods with cutaneous papillomatous lesions, however to date, no comprehensive analysis of a poxvirus from kangaroos has been performed. Here we report the genome sequences of a western grey kangaroo poxvirus (WKPV) and an eastern grey kangaroo poxvirus (EKPV), named for the host species from which they were isolated, western grey (Macropus fuliginosus) and eastern grey (Macropus giganteus) kangaroos. Poxvirus DNA from WKPV and EKPV was isolated and entire coding genome regions determined through Roche GS Junior and Illumina Miseq sequencing, respectively. Viral genomes were assembled using MIRA and SPAdes, and annotations performed using tools available from the Viral Bioinformatics Resource Centre. Histopathology and transmission electron microscopy analysis was also performed on WKPV and its associated lesions. The WKPV and EKPV genomes show 96% identity (nucleotide) to each other and phylogenetic analysis places them on a distinct branch between the established Molluscipoxvirus and Avipoxvirus genera. WKPV and EKPV are 170 kbp and 167 kbp long, containing 165 and 162 putative genes, respectively. Together, their genomes encode up to 47 novel unique hypothetical proteins, and possess virulence proteins including a major histocompatibility complex class II inhibitor, a semaphorin-like protein, a serpin, a 3-β-hydroxysteroid dehydrogenase/δ 5→4 isomerase, and a CD200-like protein. These viruses also encode a large putative protein (WKPV-WA-039 and EKPV-SC-038) with a C-terminal domain that is structurally similar to the C-terminal domain of a cullin, suggestive of a role in the control of host ubiquitination. The relationship of these viruses to members of the Molluscipoxvirus and Avipoxvirus genera is discussed in terms of sequence similarity, gene content and nucleotide composition. A novel genus within subfamily Chordopoxvirinae is proposed to accommodate these two poxvirus species from kangaroos; we suggest the name, Thylacopoxvirus (thylaco-: [Gr.] thylakos meaning sac or pouch). Copyright © 2017 Elsevier B.V. All rights reserved.
The Long Noncoding RNA Transcriptome of Dictyostelium discoideum Development.

PubMed

Rosengarten, Rafael D; Santhanam, Balaji; Kokosar, Janez; Shaulsky, Gad

2017-02-09

Dictyostelium discoideum live in the soil as single cells, engulfing bacteria and growing vegetatively. Upon starvation, tens of thousands of amoebae enter a developmental program that includes aggregation, multicellular differentiation, and sporulation. Major shifts across the protein-coding transcriptome accompany these developmental changes. However, no study has presented a global survey of long noncoding RNAs (ncRNAs) in D. discoideum To characterize the antisense and long intergenic noncoding RNA (lncRNA) transcriptome, we analyzed previously published developmental time course samples using an RNA-sequencing (RNA-seq) library preparation method that selectively depletes ribosomal RNAs (rRNAs). We detected the accumulation of transcripts for 9833 protein-coding messenger RNAs (mRNAs), 621 lncRNAs, and 162 putative antisense RNAs (asRNAs). The noncoding RNAs were interspersed throughout the genome, and were distinct in expression level, length, and nucleotide composition. The noncoding transcriptome displayed a temporal profile similar to the coding transcriptome, with stages of gradual change interspersed with larger leaps. The transcription profiles of some noncoding RNAs were strongly correlated with known differentially expressed coding RNAs, hinting at a functional role for these molecules during development. Examining the mitochondrial transcriptome, we modeled two novel antisense transcripts. We applied yet another ribosomal depletion method to a subset of the samples to better retain transfer RNA (tRNA) transcripts. We observed polymorphisms in tRNA anticodons that suggested a post-transcriptional means by which D. discoideum compensates for codons missing in the genomic complement of tRNAs. We concluded that the prevalence and characteristics of long ncRNAs indicate that these molecules are relevant to the progression of molecular and cellular phenotypes during development. Copyright © 2017 Rosengarten et al.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-02-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed Central

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-01-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators. Images PMID:3257578
Nucleotide sequence of the 3' terminal region of lettuce mosaic potyvirus RNA shows a Gln/Val dipeptide at the cleavage site between the polymerase and the coat protein.

PubMed

Dinant, S; Lot, H; Albouy, J; Kuziak, C; Meyer, M; Astier-Manifacier, S

1991-01-01

DNA complementary to the 3' terminal 1651 nucleotides of the genome of the common strain of lettuce mosaic virus (LMV-O) has been cloned and sequenced. Microsequencing of the N-terminus enabled localization of the coat protein gene in this sequence. It showed also that the LMV coat protein coding region is at the 3' end of the genome, and that the coat protein is processed from a larger protein by cleavage at an unusual Q/V dipeptide between the polymerase and the coat protein. This is the first report of such a site for cleavage of a potyvirus polyprotein, where only Q/A, Q/S, and Q/G cleavage sites have been reported. The LMV coat protein gene encodes a 278 amino acid polypeptide with a calculated Mr of 31,171 and is flanked by a region which has a high degree of homology with the putative polymerase and a 3' untranslated region of 211 nucleotides in length. Percentage of homology with the coat protein of other potyviruses confirms that LMV is a distinct member of this group. Moreover, amino acid homologies noticed with the coat protein of potexvirus, bymovirus, and carlavirus elongated plant viruses suggest a functional significance for the conserved domains.
Functional similarity and molecular divergence of a novel reproductive transcriptome in two male-pregnant Syngnathus pipefish species

PubMed Central

Small, Clayton M; Harlin-Cognato, April D; Jones, Adam G

2013-01-01

Evolutionary studies have revealed that reproductive proteins in animals and plants often evolve more rapidly than the genome-wide average. The causes of this pattern, which may include relaxed purifying selection, sexual selection, sexual conflict, pathogen resistance, reinforcement, or gene duplication, remain elusive. Investigative expansions to additional taxa and reproductive tissues have the potential to shed new light on this unresolved problem. Here, we embark on such an expansion, in a comparison of the brood-pouch transcriptome between two male-pregnant species of the pipefish genus Syngnathus. Male brooding tissues in syngnathid fishes represent a novel, nonurogenital reproductive trait, heretofore mostly uncharacterized from a molecular perspective. We leveraged next-generation sequencing (Roche 454 pyrosequencing) to compare transcript abundance in the male brooding tissues of pregnant with nonpregnant samples from Gulf (S. scovelli) and dusky (S. floridae) pipefish. A core set of protein-coding genes, including multiple members of astacin metalloprotease and c-type lectin gene families, is consistent between species in both the direction and magnitude of expression bias. As predicted, coding DNA sequence analysis of these putative “male pregnancy proteins” suggests rapid evolution relative to nondifferentially expressed genes and reflects signatures of adaptation similar in magnitude to those reported from Drosophila male accessory gland proteins. Although the precise drivers of male pregnancy protein divergence remain unknown, we argue that the male pregnancy transcriptome in syngnathid fishes, a clade diverse with respect to brooding morphology and mating system, represents a unique and promising object of study for understanding the perplexing evolutionary nature of reproductive molecules. PMID:24324861
Gene-enriched draft genome of the cattle tick Rhipicephalus microplus: assembly by the hybrid Pacific Biosciences/Illumina approach enabled analysis of the highly repetitive genome.

PubMed

Barrero, Roberto A; Guerrero, Felix D; Black, Michael; McCooke, John; Chapman, Brett; Schilkey, Faye; Pérez de León, Adalberto A; Miller, Robert J; Bruns, Sara; Dobry, Jason; Mikhaylenko, Galina; Stormo, Keith; Bell, Callum; Tao, Quanzhou; Bogden, Robert; Moolhuijzen, Paula M; Hunter, Adam; Bellgard, Matthew I

2017-08-01

The genome of the cattle tick Rhipicephalus microplus, an ectoparasite with global distribution, is estimated to be 7.1Gbp in length and consists of approximately 70% repetitive DNA. We report the draft assembly of a tick genome that utilized a hybrid sequencing and assembly approach to capture the repetitive fractions of the genome. Our hybrid approach produced an assembly consisting of 2.0Gbp represented in 195,170 scaffolds with a N50 of 60,284bp. The Rmi v2.0 assembly is 51.46% repetitive with a large fraction of unclassified repeats, short interspersed elements, long interspersed elements and long terminal repeats. We identified 38,827 putative R. microplus gene loci, of which 24,758 were protein coding genes (≥100 amino acids). OrthoMCL comparative analysis against 11 selected species including insects and vertebrates identified 10,835 and 3,423 protein coding gene loci that are unique to R. microplus or common to both R. microplus and Ixodes scapularis ticks, respectively. We identified 191 microRNA loci, of which 168 have similarity to known miRNAs and 23 represent novel miRNA families. We identified the genomic loci of several highly divergent R. microplus esterases with sequence similarity to acetylcholinesterase. Additionally we report the finding of a novel cytochrome P450 CYP41 homolog that shows similar protein folding structures to known CYP41 proteins known to be involved in acaricide resistance. Copyright © 2017 Australian Society for Parasitology. Published by Elsevier Ltd. All rights reserved.
The Caenorhabditis elegans gene unc-89, required fpr muscle M-line assembly, encodes a giant modular protein composed of Ig and signal transduction domains

PubMed Central

1996-01-01

Mutations in the Caenorhabditis elegans gene unc-89 result in nematodes having disorganized muscle structure in which thick filaments are not organized into A-bands, and there are no M-lines. Beginning with a partial cDNA from the C. elegans sequencing project, we have cloned and sequenced the unc-89 gene. An unc-89 allele, st515, was found to contain an 84-bp deletion and a 10-bp duplication, resulting in an in- frame stop codon within predicted unc-89 coding sequence. Analysis of the complete coding sequence for unc-89 predicts a novel 6,632 amino acid polypeptide consisting of sequence motifs which have been implicated in protein-protein interactions. UNC-89 begins with 67 residues of unique sequences, SH3, dbl/CDC24, and PH domains, 7 immunoglobulins (Ig) domains, a putative KSP-containing multiphosphorylation domain, and ends with 46 Ig domains. A polyclonal antiserum raised to a portion of unc-89 encoded sequence reacts to a twitchin-sized polypeptide from wild type, but truncated polypeptides from st515 and from the amber allele e2338. By immunofluorescent microscopy, this antiserum localizes to the middle of A-bands, consistent with UNC-89 being a structural component of the M-line. Previous studies indicate that myofilament lattice assembly begins with positional cues laid down in the basement membrane and muscle cell membrane. We propose that the intracellular protein UNC-89 responds to these signals, localizes, and then participates in assembling an M-line. PMID:8603916
Prunus necrotic ringspot ilarvirus: nucleotide sequence of RNA3 and the relationship to other ilarviruses based on coat protein comparison.

PubMed

Guo, D; Maiss, E; Adam, G; Casper, R

1995-05-01

The RNA3 of prunus necrotic ringspot ilarvirus (PNRSV) has been cloned and its entire sequence determined. The RNA3 consists of 1943 nucleotides (nt) and possesses two large open reading frames (ORFs) separated by an intergenic region of 74 nt. The 5' proximal ORF is 855 nt in length and codes for a protein of molecular mass 31.4 kDa which has homologies with the putative movement protein of other members of the Bromoviridae. The 3' proximal ORF of 675 nt is the cistron for the coat protein (CP) and has a predicted molecular mass of 24.9 kDa. The sequence of the 3' non-coding region (NCR) of PNRSV RNA3 showed a high degree of similarity with those of tobacco streak virus (TSV), prune dwarf virus (PDV), apple mosaic virus (ApMV) and also alfalfa mosaic virus (AIMV). In addition it contained potential stem-loop structures with interspersed AUGC motifs characteristic for ilar- and alfamoviruses. This conserved primary and secondary structure in all 3' NCRs may be responsible for the interaction with homologous and heterologous CPs and subsequent activation of genome replication. The CP gene of an ApMV isolate (ApMV-G) of 657 nt has also been cloned and sequenced. Although ApMV and PNRSV have a distant serological relationship, the deduced amino acid sequences of their CPs have an identity of only 51.8%. The N termini of PNRSV and ApMV CPs have in common a zinc-finger motif and the potential to form an amphipathic helix.
Structure and evolution of the mitochondrial genome of Exorista sorbillans: the Tachinidae (Diptera: Calyptratae) perspective.

PubMed

Shao, Yuan-jun; Hu, Xian-qiong; Peng, Guang-da; Wang, Rui-xian; Gao, Rui-na; Lin, Chao; Shen, Wei-de; Li, Rui; Li, Bing

2012-12-01

The first complete mitochondrial genome (mitogenome) of Tachinidae Exorista sorbillans (Diptera) is sequenced by PCR-based approach. The circular mitogenome is 14,960 bp long and has the representative mitochondrial gene (mt gene) organization and order of Diptera. All protein-coding sequences are initiated with ATN codon; however, the only exception is Cox I gene, which has a 4-bp ATCG putative start codon. Ten of the thirteen protein-coding genes have a complete termination codon (TAA), but the rest are seated on the H strand with incomplete codons. The mitogenome of E. sorbillans is biased toward A+T content at 78.4 %, and the strand-specific bias is in reflection of the third codon positions of mt genes, and their T/C ratios as strand indictor are higher on the H strand more than those on the L strand pointing at any strain of seven Diptera flies. The length of the A+T-rich region of E. sorbillans is 106 bp, including a tandem triple copies of a13-bp fragment. Compared to Haematobia irritans, E. sorbillans holds distant relationship with Drosophila. Phylogenetic topologies based on the amino acid sequences, supporting that E. sorbillans (Tachinidae) is clustered with strains of Calliphoridae and Oestridae, and superfamily Oestroidea are polyphyletic groups with Muscidae in a clade.
Complete mitochondrial genome of the Asian paddle crab Charybdis japonica (Crustacea: Decapoda: Portunidae): gene rearrangement of the marine brachyurans and phylogenetic considerations of the decapods.

PubMed

Liu, Yuan; Cui, Zhaoxia

2010-06-01

Given the commercial and ecological importance of the Asian paddle crab, Charybdis japonica, there is a clearly need for genetic and molecular research on this species. Here, we present the complete mitochondrial genome sequence of C. japonica, determined by the long-polymerase chain reaction and primer walking sequencing method. The entire genome is 15,738 bp in length, encoding a standard set of 13 protein-coding genes, two ribosomal RNA genes, and 22 transfer RNA genes, plus the putative control region, which is typical for metazoans. The total A+T content of the genome is 69.2%, lower than the other brachyuran crabs except for Callinectes sapidus. The gene order is identical to the published marine brachyurans and differs from the ancestral pancrustacean order by only the position of the tRNA ( His ) gene. Phylogenetic analyses using the concatenated nucleotide and amino acid sequences of 13 protein-coding genes strongly support the monophyly of Dendrobranchiata and Pleocyemata, which is consistent with the previous taxonomic classification. However, the systematic status of Charybdis within subfamily Thalamitinae of family Portunidae is not supported. C. japonica, as the first species of Charybdis with complete mitochondrial genome available, will provide important information on both genomics and molecular ecology of the group.
The nagA gene of Penicillium chrysogenum encoding beta-N-acetylglucosaminidase.

PubMed

Díez, Bruno; Rodríguez-Sáiz, Marta; de la Fuente, Juan Luis; Moreno, Miguel Angel; Barredo, José Luis

2005-01-15

We purified the beta-N-acetylglucosaminidase from the filamentous fungus Penicillium chrysogenum and its N-terminal sequence was determined, showing the presence of a mixture of two proteins (P1 and P2). A genomic DNA fragment was cloned by using degenerated oligonucleotides from the Nt sequences. The nucleotide sequence showed the presence of an ORF (nagA gene) lacking introns, with a length of 1791 bp, and coding for a protein of 66.5 kDa showing similarity to acetylglucosaminidases. The NagA deduced protein includes P1 and P2 as incomplete forms of the mature protein, and contains putative features for protein maturation: an 18-amino acid signal peptide, a KEX2 processing site, and four glycosylation motifs. The sequence just after the signal peptide corresponds to P2 and that after the KEX2 site to P1. The nagA transcript has a size of about 2.1 kb and is present until the end of the fermentation process for penicillin production. NagA is one of the most largely represented proteins in P. chrysogenum, increasing along the fermentation process. The suitability of the nagA promoter (PnagA) for gene expression in fungi was demonstrated by expressing the bleomycin resistance gene (ble(R)) from Streptoalloteichus hindustanus in P. chrysogenum.
Diversification and Expression of the PIN, AUX/LAX, and ABCB Families of Putative Auxin Transporters in Populus

PubMed Central

Carraro, Nicola; Tisdale-Orr, Tracy Eizabeth; Clouse, Ronald Matthew; Knöller, Anne Sophie; Spicer, Rachel

2012-01-01

Intercellular transport of the plant hormone auxin is mediated by three families of membrane-bound protein carriers, with the PIN and ABCB families coding primarily for efflux proteins and the AUX/LAX family coding for influx proteins. In the last decade our understanding of gene and protein function for these transporters in Arabidopsis has expanded rapidly but very little is known about their role in woody plant development. Here we present a comprehensive account of all three families in the model woody species Populus, including chromosome distribution, protein structure, quantitative gene expression, and evolutionary relationships. The PIN and AUX/LAX gene families in Populus comprise 16 and 8 members respectively and show evidence for the retention of paralogs following a relatively recent whole genome duplication. There is also differential expression across tissues within many gene pairs. The ABCB family is previously undescribed in Populus and includes 20 members, showing a much deeper evolutionary history, including both tandem and whole genome duplication as well as probable gene loss. A striking number of these transporters are expressed in developing Populus stems and we suggest that evolutionary and structural relationships with known auxin transporters in Arabidopsis can point toward candidate genes for further study in Populus. This is especially important for the ABCBs, which is a large family and includes members in Arabidopsis that are able to transport other substrates in addition to auxin. Protein modeling, sequence alignment and expression data all point to ABCB1.1 as a likely auxin transport protein in Populus. Given that basipetal auxin flow through the cambial zone shapes the development of woody stems, it is important that we identify the full complement of genes involved in this process. This work should lay the foundation for studies targeting specific proteins for functional characterization and in situ localization. PMID:22645571

Comparative Proteomics Reveals a Significant Bias Toward Alternative Protein Isoforms with Conserved Structure and Function

PubMed Central

Ezkurdia, Iakes; del Pozo, Angela; Frankish, Adam; Rodriguez, Jose Manuel; Harrow, Jennifer; Ashman, Keith; Valencia, Alfonso; Tress, Michael L.

2012-01-01

Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of “novel” and “putative” protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is remarkable and strongly suggests that the translation of alternative transcripts may be subject to selective constraints. PMID:22446687
[Genetic hypophosphatemia: recent advances in physiopathogenic concept].

PubMed

Beraud, G; Perimenis, P; Velayoudom, Fr-L; Wemeau, J-L; Vantyghem, M-Chr

2005-04-01

Renal proximal tubular reabsorption of phosphate and intestinal absorption both regulate phosphate homeostasis. Brush-border membrane Npt2a cotransporter is the key element in proximal tubular P (i) reabsorption. Inactivating mutations of Npt2a cause bone demineralisation and urolithiasis. An excess of a phosphaturic factor, called "Phosphatonin", could modulate phosphate reabsorption by inhibition on Npt2a. Inactivating mutation of PHEX, an endopeptidase-membrane coding gene, is responsible for X-linked Hypophosphatemia (XLH), because of an impaired degradation of phosphatonine by PHEX product. Autosomic Dominant Hypophosphatemic Rickets (ADHR) is explained by a mutation preventing FGF23 (one of the best identified phosphatonines) from cleavage. According recent data, FGF23, MEPE (Matrix Extracellular Phosphoglycoprotein) et FRP4 (frizzled related protein-4) are 3 putative "phosphatonines".
Trypanosome RNA polymerases and transcription factors: sensible trypanocidal drug targets?

PubMed

Vanhamme, Luc

2008-11-01

Trypanosomes and Leishmaniae are the agents of several important parasitic diseases threatening hundreds of million human beings worldwide. As they diverged early in evolution, they display original molecular characteristics. These peculiarities are each defining putative specific targets for anti-parasitic drugs. Transcription displays its lot of unique characteristics in trypanosomes and will be taken as an example to uncover these targets. Unique features of transcription in trypanosomes include constitutive and poly-cistronic transcription by RNA polymerase II as well as transcription of protein-coding genes by RNA polymerase I. It is becoming clear that these unique mechanisms are performed by dedicated molecular players. The first of them have been recently characterized. They are reviewed and their suitability as drug targets is commented.
Complete mitochondrial genome of the giant African snail, Achatina fulica (Mollusca: Achatinidae): a novel location of putative control regions (CR) in the mitogenome within Pulmonate species.

PubMed

He, Zhang-Ping; Dai, Xia-Bin; Zhang, Shuai; Zhi, Ting-Ting; Lun, Zhao-Rong; Wu, Zhong-Dao; Yang, Ting-Bao

2016-01-01

The whole sequence (15,057 bp) of the mitochondrial DNA (mtDNA) of the terrestrial snail Achatina fulica (order Stylommatophora) was determined. The mitogenome, as the typical metazoan mtDNA, contains 13 protein-coding genes (PCG), 2 ribosomal RNA genes (rRNA) and 22 transfer RNA genes (tRNA). The tRNA genes include two trnS without standard secondary structure. Interestingly, among the known mitogenomes of Pulmonata species, we firstly characterized an unassigned lengthy sequence (551 bp) between the cox1 and the trnV which may be the CR for the sake of its AT bases usage bias (65.70%) and potential hairpin structure.
The complete DNA sequence of lymphocystis disease virus.

PubMed

Tidona, C A; Darai, G

1997-04-14

Lymphocystis disease virus (LCDV) is the causative agent of lymphocystis disease, which has been reported to occur in over 100 different fish species worldwide. LCDV is a member of the family Iridoviridae and the type species of the genus Lymphocystivirus. The virions contain a single linear double-stranded DNA molecule, which is circularly permuted, terminally redundant, and heavily methylated at cytosines in CpG sequences. The complete nucleotide sequence of LCDV-1 (flounder isolate) was determined by automated cycle sequencing and primer walking. The genome of LCDV-1 is 102.653 bp in length and contains 195 open reading frames with coding capacities ranging from 40 to 1199 amino acids. Computer-assisted analyses of the deduced amino acid sequences led to the identification of several putative gene products with significant homologies to entries in protein data banks, such as the two major subunits of the viral DNA-dependent RNA polymerase, DNA polymerase, several protein kinases, two subunits of the ribonucleoside diphosphate reductase, DNA methyltransferase, the viral major capsid protein, insulin-like growth factor, and tumor necrosis factor receptor homolog.
Quantitative proteome-based systematic identification of SIRT7 substrates.

PubMed

Zhang, Chaohua; Zhai, Zichao; Tang, Ming; Cheng, Zhongyi; Li, Tingting; Wang, Haiying; Zhu, Wei-Guo

2017-07-01

SIRT7 is a class III histone deacetylase that is involved in numerous cellular processes. Only six substrates of SIRT7 have been reported thus far, so we aimed to systematically identify SIRT7 substrates using stable-isotope labeling with amino acids in cell culture (SILAC) coupled with quantitative mass spectrometry (MS). Using SIRT7 +/+ and SIRT7 -/- mouse embryonic fibroblasts as our model system, we identified and quantified 1493 acetylation sites in 789 proteins, of which 261 acetylation sites in 176 proteins showed ≥2-fold change in acetylation state between SIRT7 -/- and SIRT7 +/+ cells. These proteins were considered putative SIRT7 substrates and were carried forward for further analysis. We then validated the predictive efficiency of the SILAC-MS experiment by assessing substrate acetylation status in vitro in six predicted proteins. We also performed a bioinformatic analysis of the MS data, which indicated that many of the putative protein substrates were involved in metabolic processes. Finally, we expanded our list of candidate substrates by performing a bioinformatics-based prediction analysis of putative SIRT7 substrates, using our list of putative substrates as a positive training set, and again validated a subset of the proteins in vitro. In summary, we have generated a comprehensive list of SIRT7 candidate substrates. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A Glycine Riboswitch in Streptococcus pyogenes Controls Expression of a Sodium:Alanine Symporter Family Protein Gene.

PubMed

Khani, Afsaneh; Popp, Nicole; Kreikemeyer, Bernd; Patenge, Nadja

2018-01-01

Regulatory RNAs play important roles in the control of bacterial gene expression. In this study, we investigated gene expression regulation by a putative glycine riboswitch located in the 5'-untranslated region of a sodium:alanine symporter family (SAF) protein gene in the group A Streptococcus pyogenes serotype M49 strain 591. Glycine-dependent gene expression mediated by riboswitch activity was studied using a luciferase reporter gene system. Maximal reporter gene expression was observed in the absence of glycine and in the presence of low glycine concentrations. Differences in glycine-dependent gene expression were not based on differential promoter activity. Expression of the SAF protein gene and the downstream putative cation efflux protein gene was investigated in wild-type bacteria by RT-qPCR transcript analyses. During growth in the presence of glycine (≥1 mM), expression of the genes were downregulated. Northern blot analyses revealed premature transcription termination in the presence of high glycine concentrations. Growth in the presence of 0.1 mM glycine led to the production of a full-length transcript. Furthermore, stability of the SAF protein gene transcript was drastically reduced in the presence of glycine. We conclude that the putative glycine riboswitch in S. pyogenes serotype M49 strain 591 represses expression of the SAF protein gene and the downstream putative cation efflux protein gene in the presence of high glycine concentrations. Sequence and secondary structure comparisons indicated that the streptococcal riboswitch belongs to the class of tandem aptamer glycine riboswitches.
Crystal structure of a two-subunit TrkA octameric gating ring assembly

DOE PAGES

Deller, Marc C.; Johnson, Hope A.; Miller, Mitchell D.; ...

2015-03-31

The TM1088 locus of T. maritima codes for two proteins designated TM1088A and TM1088B, which combine to form the cytosolic portion of a putative Trk K⁺ transporter. We report the crystal structure of this assembly to a resolution of 3.45 Å. The high resolution crystal structures of the components of the assembly, TM1088A and TM1088B, were also determined independently to 1.50 Å and 1.55 Å, respectively. The TM1088 proteins are structurally homologous to each other and to other K⁺ transporter proteins, such as TrkA. These proteins form a cytosolic gating ring assembly that controls the flow of K⁺ ions acrossmore » the membrane. TM1088 represents the first structure of a two-subunit Trk assembly. Despite the atypical genetics and chain organization of the TM1088 assembly, it shares significant structural homology and an overall quaternary organization with other single-subunit K⁺ gating ring assemblies. This structure provides the first structural insights into what may be an evolutionary ancestor of more modern single-subunit K⁺ gating ring assemblies.« less
The potato virus X TGBp2 protein association with the endoplasmic reticulum plays a role in but is not sufficient for viral cell-to-cell movement

NASA Technical Reports Server (NTRS)

Mitra, Ruchira; Krishnamurthy, Konduru; Blancaflor, Elison; Payton, Mark; Nelson, Richard S.; Verchot-Lubicz, Jeanmarie

2003-01-01

Potato virus X (PVX) TGBp1, TGBp2, TGBp3, and coat protein are required for virus cell-to-cell movement. Plasmids expressing GFP fused to TGBp2 were bombarded to leaf epidermal cells and GFP:TGBp2 moved cell to cell in Nicotiana benthamiana leaves but not in Nicotiana tabacum leaves. GFP:TGBp2 movement was observed in TGBp1-transgenic N. tabacum, indicating that TGBp2 requires TGBp1 to promote its movement in N. tabacum. In this study, GFP:TGBp2 was detected in a polygonal pattern that resembles the endoplasmic reticulum (ER) network. Amino acid sequence analysis revealed TGBp2 has two putative transmembrane domains. Two mutations separately introduced into the coding sequences encompassing the putative transmembrane domains within the GFP:TGBp2 plasmids and PVX genome, disrupted membrane binding of GFP:TGBp2, inhibited GFP:TGBp2 movement in N. benthamiana and TGBp1-expressing N. tabacum, and inhibited PVX movement. A third mutation, lying outside the transmembrane domains, had no effect on GFP:TGBp2 ER association or movement in N. benthamiana but inhibited GFP:TGBp2 movement in TGBp1-expressing N. tabacum and PVX movement in either Nicotiana species. Thus, ER association of TGBp2 may be required but not be sufficient for virus movement. TGBp2 likely provides an activity for PVX movement beyond ER association.
Rise of Microbial Culturomics: Noncontiguous Finished Genome Sequence and Description of Beduini massiliensis gen. nov., sp. nov.

PubMed Central

Mourembou, Gaël; Yasir, Muhammad; Azhar, Esam Ibraheem; Lagier, Jean Christophe; Bibi, Fehmida; Jiman-Fatani, Asif Ahmad; Helmy, Nayel; Robert, Catherine; Rathored, Jaishriram; Fournier, Pierre-Edouard; Raoult, Didier

2015-01-01

Abstract Microbial culturomics is a new field of omics sciences that examines the bacterial diversity of human gut coupled with a taxono-genomic strategy. Using microbial culturomics, we report here for the first time a novel Gram negative, catalase- and oxidase-negative, strict anaerobic bacilli named Beduini massiliensis gen. nov., sp nov. strain GM1 (= CSUR P1440 = DSM 100188), isolated from the stools of a female nomadic Bedouin from Saudi Arabia. With a length of 2,850,586 bp, the Beduini massiliensis genome exhibits a G + C content of 35.9%, and contains 2819 genes (2744 protein-coding and 75 RNA genes including 57 tRNA and 18 rRNA genes). It is composed of 6 scaffolds (composed of 6 contigs). A total of 1859 genes (67.75%) were assigned a putative function (by COGs or by NR blast). At least 1457 (53%) orthologous proteins were not shared with the closest phylogenetic species. 274 genes (10.0%) were identified as ORFans. These results show that microbial culturomics can dramatically improve the characterization of the human microbiota repertoire, deciphering new bacterial species and new genes. Further studies will clarify the geographic specificity and the putative role of these new microbes and their related functional genetic content in health and disease. Microbial culturomics is an emerging frontier of omics systems sciences and integrative biology and thus, warrants further consideration as part of the postgenomics methodology toolbox. PMID:26669711
Rise of Microbial Culturomics: Noncontiguous Finished Genome Sequence and Description of Beduini massiliensis gen. nov., sp. nov.

PubMed

Mourembou, Gaël; Yasir, Muhammad; Azhar, Esam Ibraheem; Lagier, Jean Christophe; Bibi, Fehmida; Jiman-Fatani, Asif Ahmad; Helmy, Nayel; Robert, Catherine; Rathored, Jaishriram; Fournier, Pierre-Edouard; Raoult, Didier; Million, Matthieu

2015-12-01

Microbial culturomics is a new field of omics sciences that examines the bacterial diversity of human gut coupled with a taxono-genomic strategy. Using microbial culturomics, we report here for the first time a novel Gram negative, catalase- and oxidase-negative, strict anaerobic bacilli named Beduini massiliensis gen. nov., sp nov. strain GM1 (= CSUR P1440 = DSM 100188), isolated from the stools of a female nomadic Bedouin from Saudi Arabia. With a length of 2,850,586 bp, the Beduini massiliensis genome exhibits a G + C content of 35.9%, and contains 2819 genes (2744 protein-coding and 75 RNA genes including 57 tRNA and 18 rRNA genes). It is composed of 6 scaffolds (composed of 6 contigs). A total of 1859 genes (67.75%) were assigned a putative function (by COGs or by NR blast). At least 1457 (53%) orthologous proteins were not shared with the closest phylogenetic species. 274 genes (10.0%) were identified as ORFans. These results show that microbial culturomics can dramatically improve the characterization of the human microbiota repertoire, deciphering new bacterial species and new genes. Further studies will clarify the geographic specificity and the putative role of these new microbes and their related functional genetic content in health and disease. Microbial culturomics is an emerging frontier of omics systems sciences and integrative biology and thus, warrants further consideration as part of the postgenomics methodology toolbox.
A 2,4-dichlorophenoxyacetic acid degradation plasmid pM7012 discloses distribution of an unclassified megaplasmid group across bacterial species.

PubMed

Sakai, Yoriko; Ogawa, Naoto; Shimomura, Yumi; Fujii, Takeshi

2014-03-01

Analysis of the complete nucleotide sequence of plasmid pM7012 from 2,4-dichlorophenoxyacetic-acid (2,4-D)-degrading bacterium Burkholderia sp. M701 revealed that the plasmid had 582 142 bp, with 541 putative protein-coding sequences and 39 putative tRNA genes for the transport of the standard 20 aa. pM7012 contains sequences homologous to the regions involved in conjugal transfer and plasmid maintenance found in plasmids byi_2p from Burkholderia sp. YI23 and pBVIE01 from Burkholderia sp. G4. No relaxase gene was found in any of these plasmids, although genes for a type IV secretion system and type IV coupling proteins were identified. Plasmids with no relaxase gene have been classified as non-mobile plasmids. However, nucleotide sequences with a high level of similarity to the genes for plasmid transfer, plasmid maintenance, 2,4-D degradation and arsenic resistance contained on pM7012 were also detected in eight other megaplasmids (~600 or 900 kb) found in seven Burkholderia strains and a strain of Cupriavidus, which were isolated as 2,4-D-degrading bacteria in Japan and the United States. These results suggested that the 2,4-D degradation megaplasmids related to pM7012 are mobile and distributed across various bacterial species worldwide, and that the plasmid group could be distinguished from known mobile plasmid groups.
Fine mapping of RYMV3: a new resistance gene to Rice yellow mottle virus from Oryza glaberrima.

PubMed

Pidon, Hélène; Ghesquière, Alain; Chéron, Sophie; Issaka, Souley; Hébrard, Eugénie; Sabot, François; Kolade, Olufisayo; Silué, Drissa; Albar, Laurence

2017-04-01

A new resistance gene against Rice yellow mottle virus was identified and mapped in a 15-kb interval. The best candidate is a CC-NBS-LRR gene. Rice yellow mottle virus (RYMV) disease is a serious constraint to the cultivation of rice in Africa and selection for resistance is considered to be the most effective management strategy. The aim of this study was to characterize the resistance of Tog5307, a highly resistant accession belonging to the African cultivated rice species (Oryza glaberrima), that has none of the previously identified resistance genes to RYMV. The specificity of Tog5307 resistance was analyzed using 18 RYMV isolates. While three of them were able to infect Tog5307 very rapidly, resistance against the others was effective despite infection events attributed to resistance-breakdown or incomplete penetrance of the resistance. Segregation of resistance in an interspecific backcross population derived from a cross between Tog5307 and the susceptible Oryza sativa variety IR64 showed that resistance is dominant and is controlled by a single gene, named RYMV3. RYMV3 was mapped in an approximately 15-kb interval in which two candidate genes, coding for a putative transmembrane protein and a CC-NBS-LRR domain-containing protein, were annotated. Sequencing revealed non-synonymous polymorphisms between Tog5307 and the O. glaberrima susceptible accession CG14 in both candidate genes. An additional resistant O. glaberrima accession, Tog5672, was found to have the Tog5307 genotype for the CC-NBS-LRR gene but not for the putative transmembrane protein gene. Analysis of the cosegregation of Tog5672 resistance with the RYMV3 locus suggests that RYMV3 is also involved in Tog5672 resistance, thereby supporting the CC-NBS-LRR gene as the best candidate for RYMV3.
Characterization of constitutive and putative differentially expressed mRNAs by means of expressed sequence tags, differential display reverse transcriptase-PCR and randomly amplified polymorphic DNA-PCR from the sand fly vector Lutzomyia longipalpis.

PubMed

Ramalho-Ortigão, J M; Temporal, P; de Oliveira , S M; Barbosa, A F; Vilela, M L; Rangel, E F; Brazil, R P; Traub-Cseko, Y M

2001-01-01

Molecular studies of insect disease vectors are of paramount importance for understanding parasite-vector relationship. Advances in this area have led to important findings regarding changes in vectors' physiology upon blood feeding and parasite infection. Mechanisms for interfering with the vectorial capacity of insects responsible for the transmission of diseases such as malaria, Chagas disease and dengue fever are being devised with the ultimate goal of developing transgenic insects. A primary necessity for this goal is information on gene expression and control in the target insect. Our group is investigating molecular aspects of the interaction between Leishmania parasites and Lutzomyia sand flies. As an initial step in our studies we have used random sequencing of cDNA clones from two expression libraries made from head/thorax and abdomen of sugar fed L. longipalpis for the identification of expressed sequence tags (EST). We applied differential display reverse transcriptase-PCR and randomly amplified polymorphic DNA-PCR to characterize differentially expressed mRNA from sugar and blood fed insects, and, in one case, from a L. (V.) braziliensis-infected L. longipalpis. We identified 37 cDNAs that have shown homology to known sequences from GeneBank. Of these, 32 cDNAs code for constitutive proteins such as zinc finger protein, glutamine synthetase, G binding protein, ubiquitin conjugating enzyme. Three are putative differentially expressed cDNAs from blood fed and Leishmania-infected midgut, a chitinase, a V-ATPase and a MAP kinase. Finally, two sequences are homologous to Drosophila melanogaster gene products recently discovered through the Drosophila genome initiative.
Examination of Campylobacter jejuni putative adhesins leads to the identification of a new protein, designated FlpA, required for chicken colonization

USDA-ARS?s Scientific Manuscript database

Campylobacter jejuni colonization of chickens is dependent upon surface exposed proteins termed adhesins. Putative C. jejuni adhesins include CadF, CapA, JlpA, MOMP, PEB1, Cj1279c, and Cj1349c. We examined the genetic relatedness of ninety-seven C. jejuni isolates recovered from human, poultry, bo...
Genetic basis for mycophenolic acid production and strain-dependent production variability in Penicillium roqueforti.

PubMed

Gillot, Guillaume; Jany, Jean-Luc; Dominguez-Santos, Rebeca; Poirier, Elisabeth; Debaets, Stella; Hidalgo, Pedro I; Ullán, Ricardo V; Coton, Emmanuel; Coton, Monika

2017-04-01

Mycophenolic acid (MPA) is a secondary metabolite produced by various Penicillium species including Penicillium roqueforti. The MPA biosynthetic pathway was recently described in Penicillium brevicompactum. In this study, an in silico analysis of the P. roqueforti FM164 genome sequence localized a 23.5-kb putative MPA gene cluster. The cluster contains seven genes putatively coding seven proteins (MpaA, MpaB, MpaC, MpaDE, MpaF, MpaG, MpaH) and is highly similar (i.e. gene synteny, sequence homology) to the P. brevicompactum cluster. To confirm the involvement of this gene cluster in MPA biosynthesis, gene silencing using RNA interference targeting mpaC, encoding a putative polyketide synthase, was performed in a high MPA-producing P. roqueforti strain (F43-1). In the obtained transformants, decreased MPA production (measured by LC-Q-TOF/MS) was correlated to reduced mpaC gene expression by Q-RT-PCR. In parallel, mycotoxin quantification on multiple P. roqueforti strains suggested strain-dependent MPA-production. Thus, the entire MPA cluster was sequenced for P. roqueforti strains with contrasted MPA production and a 174bp deletion in mpaC was observed in low MPA-producers. PCRs directed towards the deleted region among 55 strains showed an excellent correlation with MPA quantification. Our results indicated the clear involvement of mpaC gene as well as surrounding cluster in P. roqueforti MPA biosynthesis. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Functional Characterization of PaLAX1, a Putative Auxin Permease, in Heterologous Plant Systems1[W][OA

PubMed Central

Hoyerová, Klára; Perry, Lucie; Hand, Paul; Laňková, Martina; Kocábek, Tomáš; May, Sean; Kottová, Jana; Pačes, Jan; Napier, Richard; Zažímalová, Eva

2008-01-01

We have isolated the cDNA of the gene PaLAX1 from a wild cherry tree (Prunus avium). The gene and its product are highly similar in sequences to both the cDNAs and the corresponding protein products of AUX/LAX-type genes, coding for putative auxin influx carriers. We have prepared and characterized transformed Nicotiana tabacum and Arabidopsis thaliana plants carrying the gene PaLAX1. We have proved that constitutive overexpression of PaLAX1 is accompanied by changes in the content and distribution of free indole-3-acetic acid, the major endogenous auxin. The increase in free indole-3-acetic acid content in transgenic plants resulted in various phenotype changes, typical for the auxin-overproducing plants. The uptake of synthetic auxin, 2,4-dichlorophenoxyacetic acid, was 3 times higher in transgenic lines compared to the wild-type lines and the treatment with the auxin uptake inhibitor 1-naphthoxyacetic acid reverted the changes caused by the expression of PaLAX1. Moreover, the agravitropic response could be restored by expression of PaLAX1 in the mutant aux1 plants, which are deficient in auxin influx carrier activity. Based on our data, we have concluded that the product of the gene PaLAX1 promotes the uptake of auxin into cells, and, as a putative auxin influx carrier, it affects the content and distribution of free endogenous auxin in transgenic plants. PMID:18184737
Long Non-Coding RNAs Responsive to Salt and Boron Stress in the Hyper-Arid Lluteño Maize from Atacama Desert.

PubMed

Huanca-Mamani, Wilson; Arias-Carrasco, Raúl; Cárdenas-Ninasivincha, Steffany; Rojas-Herrera, Marcelo; Sepúlveda-Hermosilla, Gonzalo; Caris-Maldonado, José Carlos; Bastías, Elizabeth; Maracaja-Coutinho, Vinicius

2018-03-20

Long non-coding RNAs (lncRNAs) have been defined as transcripts longer than 200 nucleotides, which lack significant protein coding potential and possess critical roles in diverse cellular processes. Long non-coding RNAs have recently been functionally characterized in plant stress-response mechanisms. In the present study, we perform a comprehensive identification of lncRNAs in response to combined stress induced by salinity and excess of boron in the Lluteño maize, a tolerant maize landrace from Atacama Desert, Chile. We use deep RNA sequencing to identify a set of 48,345 different lncRNAs, of which 28,012 (58.1%) are conserved with other maize (B73, Mo17 or Palomero), with the remaining 41.9% belonging to potentially Lluteño exclusive lncRNA transcripts. According to B73 maize reference genome sequence, most Lluteño lncRNAs correspond to intergenic transcripts. Interestingly, Lluteño lncRNAs presents an unusual overall higher expression compared to protein coding genes under exposure to stressed conditions. In total, we identified 1710 putatively responsive to the combined stressed conditions of salt and boron exposure. We also identified a set of 848 stress responsive potential trans natural antisense transcripts ( trans -NAT) lncRNAs, which seems to be regulating genes associated with regulation of transcription, response to stress, response to abiotic stimulus and participating of the nicotianamine metabolic process. Reverse transcription-quantitative PCR (RT-qPCR) experiments were performed in a subset of lncRNAs, validating their existence and expression patterns. Our results suggest that a diverse set of maize lncRNAs from leaves and roots is responsive to combined salt and boron stress, being the first effort to identify lncRNAs from a maize landrace adapted to extreme conditions such as the Atacama Desert. The information generated is a starting point to understand the genomic adaptabilities suffered by this maize to surpass this extremely stressed environment.
Long Non-Coding RNAs Responsive to Salt and Boron Stress in the Hyper-Arid Lluteño Maize from Atacama Desert

PubMed Central

Huanca-Mamani, Wilson; Arias-Carrasco, Raúl; Cárdenas-Ninasivincha, Steffany; Rojas-Herrera, Marcelo; Sepúlveda-Hermosilla, Gonzalo; Caris-Maldonado, José Carlos; Bastías, Elizabeth; Maracaja-Coutinho, Vinicius

2018-01-01

Long non-coding RNAs (lncRNAs) have been defined as transcripts longer than 200 nucleotides, which lack significant protein coding potential and possess critical roles in diverse cellular processes. Long non-coding RNAs have recently been functionally characterized in plant stress–response mechanisms. In the present study, we perform a comprehensive identification of lncRNAs in response to combined stress induced by salinity and excess of boron in the Lluteño maize, a tolerant maize landrace from Atacama Desert, Chile. We use deep RNA sequencing to identify a set of 48,345 different lncRNAs, of which 28,012 (58.1%) are conserved with other maize (B73, Mo17 or Palomero), with the remaining 41.9% belonging to potentially Lluteño exclusive lncRNA transcripts. According to B73 maize reference genome sequence, most Lluteño lncRNAs correspond to intergenic transcripts. Interestingly, Lluteño lncRNAs presents an unusual overall higher expression compared to protein coding genes under exposure to stressed conditions. In total, we identified 1710 putatively responsive to the combined stressed conditions of salt and boron exposure. We also identified a set of 848 stress responsive potential trans natural antisense transcripts (trans-NAT) lncRNAs, which seems to be regulating genes associated with regulation of transcription, response to stress, response to abiotic stimulus and participating of the nicotianamine metabolic process. Reverse transcription-quantitative PCR (RT-qPCR) experiments were performed in a subset of lncRNAs, validating their existence and expression patterns. Our results suggest that a diverse set of maize lncRNAs from leaves and roots is responsive to combined salt and boron stress, being the first effort to identify lncRNAs from a maize landrace adapted to extreme conditions such as the Atacama Desert. The information generated is a starting point to understand the genomic adaptabilities suffered by this maize to surpass this extremely stressed environment. PMID:29558449
ProClaT, a new bioinformatics tool for in silico protein reclassification: case study of DraB, a protein coded from the draTGB operon in Azospirillum brasilense.

PubMed

Rubel, Elisa Terumi; Raittz, Roberto Tadeu; Coimbra, Nilson Antonio da Rocha; Gehlen, Michelly Alves Coutinho; Pedrosa, Fábio de Oliveira

2016-12-15

Azopirillum brasilense is a plant-growth promoting nitrogen-fixing bacteria that is used as bio-fertilizer in agriculture. Since nitrogen fixation has a high-energy demand, the reduction of N 2 to NH 4 + by nitrogenase occurs only under limiting conditions of NH 4 + and O 2 . Moreover, the synthesis and activity of nitrogenase is highly regulated to prevent energy waste. In A. brasilense nitrogenase activity is regulated by the products of draG and draT. The product of the draB gene, located downstream in the draTGB operon, may be involved in the regulation of nitrogenase activity by an, as yet, unknown mechanism. A deep in silico analysis of the product of draB was undertaken aiming at suggesting its possible function and involvement with DraT and DraG in the regulation of nitrogenase activity in A. brasilense. In this work, we present a new artificial intelligence strategy for protein classification, named ProClaT. The features used by the pattern recognition model were derived from the primary structure of the DraB homologous proteins, calculated by a ProClaT internal algorithm. ProClaT was applied to this case study and the results revealed that the A. brasilense draB gene codes for a protein highly similar to the nitrogenase associated NifO protein of Azotobacter vinelandii. This tool allowed the reclassification of DraB/NifO homologous proteins, hypothetical, conserved hypothetical and those annotated as putative arsenate reductase, ArsC, as NifO-like. An analysis of co-occurrence of draB, draT, draG and of other nif genes was performed, suggesting the involvement of draB (nifO) in nitrogen fixation, however, without the definition of a specific function.

Molecular cloning and characterization of a gene encoding glutaminase from Aspergillus oryzae.

PubMed

Koibuchi, K; Nagasaki, H; Yuasa, A; Kataoka, J; Kitamoto, K

2000-07-01

A glutaminase from Aspergillus oryzae was purified and its molecular weight was determined to be 82,091 by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Purified glutaminase catalysed the hydrolysis not only of L-glutamine but also of D-glutamine. Both the molecular weight and the substrate specificity of this glutaminase were different from those reported previously [Yano et al. (1998) J Ferment Technol 66: 137-143]. On the basis of its internal amino acid sequences, we have isolated and characterized the glutaminase gene (gtaA) from A. oryzae. The gtaA gene had an open reading frame coding for 690 amino acid residues, including a signal peptide of 20 amino acid residues and a mature protein of 670 amino acid residues. In the 5'-flanking region of the gene, there were three putative CreAp binding sequences and one putative AreAp binding sequence. The gtaA structural gene was introduced into A. oryzae NS4 and a marked increase in activity was detected in comparison with the control strain. The gtaA gene was also isolated from Aspergillus nidulans on the basis of the determined nucleotide sequence of the gtaA gene from A. oryzae.
Transcriptome Sequencing, and Rapid Development and Application of SNP Markers for the Legume Pod Borer Maruca vitrata (Lepidoptera: Crambidae)

PubMed Central

Margam, Venu M.; Coates, Brad S.; Bayles, Darrell O.; Hellmich, Richard L.; Agunbiade, Tolulope; Seufferheld, Manfredo J.; Sun, Weilin; Kroemer, Jeremy A.; Ba, Malick N.; Binso-Dabire, Clementine L.; Baoua, Ibrahim; Ishiyaku, Mohammad F.; Covas, Fernando G.; Srinivasan, Ramasamy; Armstrong, Joel; Murdock, Larry L.; Pittendrigh, Barry R.

2011-01-01

The legume pod borer, Maruca vitrata (Lepidoptera: Crambidae), is an insect pest species of crops grown by subsistence farmers in tropical regions of Africa. We present the de novo assembly of 3729 contigs from 454- and Sanger-derived sequencing reads for midgut, salivary, and whole adult tissues of this non-model species. Functional annotation predicted that 1320 M. vitrata protein coding genes are present, of which 631 have orthologs within the Bombyx mori gene model. A homology-based analysis assigned M. vitrata genes into a group of paralogs, but these were subsequently partitioned into putative orthologs following phylogenetic analyses. Following sequence quality filtering, a total of 1542 putative single nucleotide polymorphisms (SNPs) were predicted within M. vitrata contig assemblies. Seventy one of 1078 designed molecular genetic markers were used to screen M. vitrata samples from five collection sites in West Africa. Population substructure may be present with significant implications in the insect resistance management recommendations pertaining to the release of biological control agents or transgenic cowpea that express Bacillus thuringiensis crystal toxins. Mutation data derived from transcriptome sequencing is an expeditious and economical source for genetic markers that allow evaluation of ecological differentiation. PMID:21754987
Genome sequence of an enhancin gene-rich nucleopolyhedrovirus (NPV) from Agrotis segetum: collinearity with Spodoptera exigua multiple NPV.

PubMed

Jakubowska, Agata K; Peters, Sander A; Ziemnicka, Jadwiga; Vlak, Just M; van Oers, Monique M

2006-03-01

The genome sequence of a Polish isolate of Agrotis segetum nucleopolyhedrovirus (AgseNPV-A) was determined and analysed. The circular genome is composed of 147,544 bp and has a G+C content of 45.7 mol%. It contains 153 putative, non-overlapping open reading frames (ORFs) encoding predicted proteins of more than 50 aa, together making up 89.8 % of the genome. The remaining 10.2 % of the DNA constitutes non-coding regions and homologous-repeat regions. One hundred and forty-three AgseNPV-A ORFs are homologues of previously reported baculovirus gene sequences. There are ten unique ORFs and they account for 3 % of the genome in total. All 62 lepidopteran baculovirus genes, including the 29 core baculovirus genes, were found in the AgseNPV-A genome. The gene content and gene order of AgseNPV-A are most similar to those of Spodoptera exigua (Se) multiple NPV and their shared homologous genes are 100 % collinear. Three putative enhancin genes were identified in the AgseNPV-A genome. In phylogenetic analysis, the AgseNPV-A enhancins form a cluster separated from enhancins of the Mamestra species NPVs.
RADH, a gene of Saccharomyces cerevisiae encoding a putative DNA helicase involved in DNA repair. Characteristics of radH mutants and sequence of the gene.

PubMed

Aboussekhra, A; Chanet, R; Zgaga, Z; Cassier-Chauvat, C; Heude, M; Fabre, F

1989-09-25

A new type of radiation-sensitive mutant of S. cerevisiae is described. The recessive radH mutation sensitizes to the lethal effect of UV radiations haploids in the G1 but not in the G2 mitotic phase. Homozygous diploids are as sensitive as G1 haploids. The UV-induced mutagenesis is depressed, while the induction of gene conversion is increased. The mutation is believed to channel the repair of lesions engaged in the mutagenic pathway into a recombination process, successful if the events involve sister-chromatids but lethal if they involve homologous chromosomes. The sequence of the RADH gene reveals that it may code for a DNA helicase, with a Mr of 134 kDa. All the consensus domains of known DNA helicases are present. Besides these consensus regions, strong homologies with the Rep and UvrD helicases of E. coli were found. The RadH putative helicase appears to belong to the set of proteins involved in the error-prone repair mechanism, at least for UV-induced lesions, and could act in coordination with the Rev3 error-prone DNA polymerase.
Novel insights into the response of Atlantic salmon (Salmo salar) to Piscirickettsia salmonis: Interplay of coding genes and lncRNAs during bacterial infection.

PubMed

Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian

2016-12-01

Despite the high prevalence and impact to Chilean salmon aquaculture of the intracellular bacterium Piscirickettsia salmonis, the molecular underpinnings of host-pathogen interactions remain unclear. Herein, the interplay of coding and non-coding transcripts has been proposed as a key mechanism involved in immune response. Therefore, the aim of this study was to evidence how coding and non-coding transcripts are modulated during the infection process of Atlantic salmon with P. salmonis. For this, RNA-seq was conducted in brain, spleen, and head kidney samples, revealing different transcriptional profiles according to bacterial load. Additionally, while most of the regulated genes annotated for diverse biological processes during infection, a common response associated with clathrin-mediated endocytosis and iron homeostasis was present in all tissues. Interestingly, while endocytosis-promoting factors and clathrin inductions were upregulated, endocytic receptors were mainly downregulated. Furthermore, the regulation of genes related to iron homeostasis suggested an intracellular accumulation of iron, a process in which heme biosynthesis/degradation pathways might play an important role. Regarding the non-coding response, 918 putative long non-coding RNAs were identified, where 425 were newly characterized for S. salar. Finally, co-localization and co-expression analyses revealed a strong correlation between the modulations of long non-coding RNAs and genes associated with endocytosis and iron homeostasis. These results represent the first comprehensive study of putative interplaying mechanisms of coding and non-coding RNAs during bacterial infection in salmonids. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Ultratight crystal packing of a 10 kDa protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Trillo-Muyo, Sergio; Jasilionis, Andrius; Domagalski, Marcin J.

2013-03-01

The crystal structure of the C-terminal domain of a putative U32 peptidase from G. thermoleovorans is reported; it is one of the most tightly packed protein structures reported to date. While small organic molecules generally crystallize forming tightly packed lattices with little solvent content, proteins form air-sensitive high-solvent-content crystals. Here, the crystallization and full structure analysis of a novel recombinant 10 kDa protein corresponding to the C-terminal domain of a putative U32 peptidase are reported. The orthorhombic crystal contained only 24.5% solvent and is therefore among the most tightly packed protein lattices ever reported.
Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout.

PubMed

Al-Tobasei, Rafet; Ali, Ali; Leeds, Timothy D; Liu, Sixin; Palti, Yniv; Kenney, Brett; Salem, Mohamed

2017-08-07

Coding/functional SNPs change the biological function of a gene and, therefore, could serve as "large-effect" genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7-93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout.
Comparative analysis of long non-coding RNAs in Atlantic and Coho salmon reveals divergent transcriptome responses associated with immunity and tissue repair during sea lice infestation.

PubMed

Valenzuela-Muñoz, Valentina; Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian

2018-05-24

The increasing capacity of transcriptomic analysis by high throughput sequencing has highlighted the presence of a large proportion of transcripts that do not encode proteins. In particular, long non-coding RNAs (lncRNAs) are sequences with low coding potential and conservation among species. Moreover, cumulative evidence has revealed important roles in post-transcriptional gene modulation in several taxa. In fish, the role of lncRNAs has been scarcely studied and even less so during the immune response against sea lice. In the present study we mined for lncRNAs in Atlantic salmon (Salmo salar) and Coho salmon (Oncorhynkus kisutch), which are affected by the sea louse Caligus rogercresseyi, evaluating the degree of sequence conservation between these two fish species and their putative roles during the infection process. Herein, Atlantic and Coho salmon were infected with 35 lice/fish and evaluated after 7 and 14 days post-infestation (dpi). For RNA sequencing, samples from skin and head kidney were collected. A total of 5658/4140 and 3678/2123 lncRNAs were identified in uninfected/infected Atlantic and Coho salmon transcriptomes, respectively. Species-specific transcription patterns were observed in exclusive lncRNAs according to the tissue analyzed. Furthermore, neighbor gene GO enrichment analysis of the top 100 highly regulated lncRNAs in Atlantic salmon showed that lncRNAs were localized near genes related to the immune response. On the other hand, in Coho salmon the highly regulated lncRNAs were localized near genes involved in tissue repair processes. This study revealed high regulation of lncRNAs closely localized to immune and tissue repair-related genes in Atlantic and Coho salmon, respectively, suggesting putative roles for lncRNAs in salmon against sea lice infestation. Copyright © 2018 Elsevier Ltd. All rights reserved.
Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci

PubMed Central

Brorsson, Caroline A.; Pociot, Flemming

2014-01-01

Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376
Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

PubMed

Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

2017-12-03

A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first evidence for a significant enrichment of X motifs in the genes of an extant organism. They raise two hypotheses: the X motifs may be evolutionary relics of the primitive codes used for translation, or they may continue to play a functional role in the complex processes of genome decoding and protein synthesis.
A novel splice variant of the protein tyrosine phosphatase PTPRJ that encodes for a soluble protein involved in angiogenesis.

PubMed

Bilotta, Anna; Dattilo, Vincenzo; D'Agostino, Sabrina; Belviso, Stefania; Scalise, Stefania; Bilotta, Mariaconcetta; Gaudio, Eugenio; Paduano, Francesco; Perrotti, Nicola; Florio, Tullio; Fusco, Alfredo; Iuliano, Rodolfo; Trapasso, Francesco

2017-02-07

PTPRJ is a receptor protein tyrosine phosphatase with tumor suppressor activity. Very little is known about the role of PTPRJ ectodomain, although recently both physiological and synthetic PTPRJ ligands have been identified. A putative shorter spliced variant, coding for a 539 aa protein corresponding to the extracellular N-terminus of PTPRJ, is reported in several databases but, currently, no further information is available.Here, we confirmed that the PTPRJ short isoform (named sPTPRJ) is a soluble protein secreted into the supernatant of both endothelial and tumor cells. Like PTPRJ, also sPTPRJ undergoes post-translational modifications such as glycosylation, as assessed by sPTPRJ immunoprecipitation. To characterize its functional activity, we performed an endothelial cell tube formation assay and a wound healing assay on HUVEC cells overexpressing sPTPRJ and we found that sPTPRJ has a proangiogenic activity. We also showed that sPTPRJ expression down-regulates endothelial adhesion molecules, that is a hallmark of proangiogenic activity. Moreover, sPTPRJ mRNA levels in human high-grade glioma, one of the most angiogenic tumors, are higher in tumor samples compared to controls. Further studies will be helpful not only to clarify the way sPTPRJ works but also to supply clues to circumvent its activity in cancer therapy.
A novel splice variant of the protein tyrosine phosphatase PTPRJ that encodes for a soluble protein involved in angiogenesis

PubMed Central

Bilotta, Anna; Dattilo, Vincenzo; D'Agostino, Sabrina; Belviso, Stefania; Scalise, Stefania; Bilotta, Mariaconcetta; Gaudio, Eugenio; Paduano, Francesco; Perrotti, Nicola; Florio, Tullio; Fusco, Alfredo; Iuliano, Rodolfo; Trapasso, Francesco

2017-01-01

PTPRJ is a receptor protein tyrosine phosphatase with tumor suppressor activity. Very little is known about the role of PTPRJ ectodomain, although recently both physiological and synthetic PTPRJ ligands have been identified. A putative shorter spliced variant, coding for a 539 aa protein corresponding to the extracellular N-terminus of PTPRJ, is reported in several databases but, currently, no further information is available. Here, we confirmed that the PTPRJ short isoform (named sPTPRJ) is a soluble protein secreted into the supernatant of both endothelial and tumor cells. Like PTPRJ, also sPTPRJ undergoes post-translational modifications such as glycosylation, as assessed by sPTPRJ immunoprecipitation. To characterize its functional activity, we performed an endothelial cell tube formation assay and a wound healing assay on HUVEC cells overexpressing sPTPRJ and we found that sPTPRJ has a proangiogenic activity. We also showed that sPTPRJ expression down-regulates endothelial adhesion molecules, that is a hallmark of proangiogenic activity. Moreover, sPTPRJ mRNA levels in human high-grade glioma, one of the most angiogenic tumors, are higher in tumor samples compared to controls. Further studies will be helpful not only to clarify the way sPTPRJ works but also to supply clues to circumvent its activity in cancer therapy. PMID:28052032
Age-related macular degeneration-associated silent polymorphisms in HtrA1 impair its ability to antagonize insulin-like growth factor 1.

PubMed

Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius

2013-05-01

Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.
Complete genomic sequence and taxonomic position of eel virus European X (EVEX), a rhabdovirus of European eel.

PubMed

Galinier, Richard; van Beurden, Steven; Amilhat, Elsa; Castric, Jeannette; Schoehn, Guy; Verneau, Olivier; Fazio, Géraldine; Allienne, Jean-François; Engelsma, Marc; Sasal, Pierre; Faliex, Elisabeth

2012-06-01

Eel virus European X (EVEX) was first isolated from diseased European eel Anguilla anguilla in Japan at the end of seventies. The virus was tentatively classified into the Rhabdoviridae family on the basis of morphology and serological cross reactivity. This family of viruses is organized into six genera and currently comprises approximately 200 members, many of which are still unassigned because of the lack of molecular data. This work presents the morphological, biochemical and genetic characterizations of EVEX, and proposes a taxonomic classification for this virus. We provide its complete genome sequence, plus a comprehensive sequence comparison between isolates from different geographical origins. The genome encodes the five classical structural proteins plus an overlapping open reading frame in the phosphoprotein gene, coding for a putative C protein. Phylogenic relationship with other rhabdoviruses indicates that EVEX is most closely related to the Vesiculovirus genus and shares the highest identity with trout rhabdovirus 903/87. Copyright © 2012 Elsevier B.V. All rights reserved.
Investigating the Role of RIO Protein Kinases in Caenorhabditis elegans

PubMed Central

Raymant, Greta; Bertram, Sonja E.; Esmaillie, Reza; Nadarajan, Saravanapriah; Breugelmans, Bert; Hofmann, Andreas; Gasser, Robin B.; Colaiácovo, Monica P.; Boag, Peter R.

2015-01-01

RIO protein kinases (RIOKs) are a relatively conserved family of enzymes implicated in cell cycle control and ribosomal RNA processing. Despite their functional importance, they remain a poorly understood group of kinases in multicellular organisms. Here, we show that the C. elegans genome contains one member of each of the three RIOK sub-families and that each of the genes coding for them has a unique tissue expression pattern. Our analysis showed that the gene encoding RIOK-1 (riok-1) was broadly and strongly expressed. Interestingly, the intestinal expression of riok-1 was dependent upon two putative binding sites for the oxidative and xenobiotic stress response transcription factor SKN-1. RNA interference (RNAi)-mediated knock down of riok-1 resulted in germline defects, including defects in germ line stem cell proliferation, oocyte maturation and the production of endomitotic oocytes. Taken together, our findings indicate new functions for RIOK-1 in post mitotic tissues and in reproduction. PMID:25688864
Identification of a novel aminergic-like G protein-coupled receptor in the cnidarian Renilla koellikeri.

PubMed

Bouchard, Christelle; Ribeiro, Paula; Dubé, François; Demers, Christian; Anctil, Michel

2004-10-27

Biogenic amines exert various physiological effects in cnidarians, but the receptors involved in these responses are not known. We have cloned a novel G protein-coupled receptor cDNA from an anthozoan, the sea pansy Renilla koellikeri, that shows homology to mammalian catecholamine receptors and, to a lesser extent, to peptidergic receptors. This putative receptor, named Ren2, has a DRC pattern that replaces the well-conserved DRY motif on the cytoplasmic side of the transmembrane III and lacks the cysteine residues usually found in the second extracellular loop and C-terminus tail. Both the second extracellular loop and the N-terminal tail were seen to be short (six and three amino acids, respectively). Northern blot analysis suggests that the receptor gene codes for two transcripts. Localization of these transcripts by in situ hybridization demonstrated abundant expression in the epithelium of the pharyngeal wall, the oral disk and tentacles as well as in the endodermal epithelium lining the gastrovascular cavities.
A third genotype of the human parvovirus PARV4 in sub-Saharan Africa.

PubMed

Simmonds, Peter; Douglas, Jill; Bestetti, Giovanna; Longhi, Erika; Antinori, Spinello; Parravicini, Carlo; Corbellino, Mario

2008-09-01

PARV4 is a recently discovered human parvovirus widely distributed in injecting drug users in the USA and Europe, particularly in those co-infected with human immunodeficiency virus (HIV). Like parvovirus B19, PARV4 persists in previously exposed individuals. In bone marrow and lymphoid tissue, PARV4 sequences were detected in two sub-Saharan African study subjects with AIDS but without a reported history of parenteral exposure and who were uninfected with hepatitis C virus. PARV4 variants infecting these subjects were phylogenetically distinct from genotypes 1 and 2 (formerly PARV5) that were reported previously. Analysis of near-complete genome sequences demonstrated that they should be classified as a third (equidistant) PARV4 genotype. The availability of a further near-complete genome sequence of this novel genotype facilitated identification of conserved novel open reading frames embedded in the ORF2 coding sequence; one encoded a putative protein with identifiable homology to SAT proteins of members of the genus Parvovirus.
Mutational analysis in a patient with a variant form of Gaucher disease caused by SAP-2 deficiency

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rafi, M.A.; Gala, G. de; Xunling Zhang

1993-01-01

It is now clear that the lysosomal hydrolysis of sphingolipids requires both lysosomal enzymes and so-called sphingolipid activator proteins (SAPs). One gene, called prosaposin, codes for a precursor protein that is proteolytically cut into four putative SAPs. These four SAPs, of about 80 amino acids, share some structural features but differ somewhat in their specificity. Domain 3 of prosaposin mRNA contains the coding region for SAP-2, an activator of glucocerebrosidase. While most patients with Gaucher disease store glucosylceramide due to defects in glucocerebrosidase, a few patients store this lipid in the presence of normal enzyme levels. In this paper themore » authors describe the identification of a point mutation in domain 3 of a patient who died with this variant form of Gaucher disease. Polymerase chain reaction amplification was performed in the small amount of genomic DNA available using primers generated from the intronic sequence surrounding domain 3. The patient was found to have a T-to-G substitution at position 1144 (counting from the A of ATG initiation codon) in half of the M13 recombinant clones. This changes the codon for cysteine[sub 382] to glycine. His father and unaffected brother also had this mutation, but his mother did not. She was found to have half of the normal amount of mRNA for prosaposin in her cultured skin fibroblasts. Therefore, this child inherited a point mutation in domain 3 from his father and a deficiency of all four SAPs coded for by prosaposin from his mother. 29 refs., 3 figs., 1 tab.« less
Phylogenetic distribution and expression of a penicillin-binding protein homologue, Ear and its significance in virulence of Staphylococcus aureus.

PubMed

Singh, Vineet K; Ring, Robert P; Aswani, Vijay; Stemper, Mary E; Kislow, Jennifer; Ye, Zhan; Shukla, Sanjay K

2017-12-01

Staphylococcus aureus is an opportunistic human pathogen that can cause serious infections in humans. A plethora of known and putative virulence factors are produced by staphylococci that collectively orchestrate pathogenesis. Ear protein (Escherichia coli ampicillin resistance) in S. aureus is an exoprotein in COL strain, predicted to be a superantigen, and speculated to play roles in antibiotic resistance and virulence. The goal of this study was to determine if expression of ear is modulated by single nucleotide polymorphisms in its promoter and coding sequences and whether this gene plays roles in antibiotic resistance and virulence. Promoter, coding sequences and expression of the ear gene in clinical and carriage S. aureus strains with distinct genetic backgrounds were analysed. The JE2 strain and its isogenic ear mutant were used in a systemic infection mouse model to determine the competiveness of the ear mutant.Results/Key findings. The ear gene showed a variable expression, with USA300FPR3757 showing a high-level expression compared to many of the other strains tested including some showing negligible expression. Higher expression was associated with agr type 1 but not correlated with phylogenetic relatedness of the ear gene based upon single nucleotide polymorphisms in the promoter or coding regions suggesting a complex regulation. An isogenic JE2 (USA300 background) ear mutant showed no significant difference in its growth, antibiotic susceptibility or virulence in a mouse model. Our data suggests that despite being highly expressed in a USA300 genetic background, Ear is not a significant contributor to virulence in that strain.
Putative Porin of Bradyrhizobium sp. (Lupinus) Bacteroids Induced by Glyphosate▿

PubMed Central

de María, Nuria; Guevara, Ángeles; Serra, M. Teresa; García-Luque, Isabel; González-Sama, Alfonso; de Lacoba, Mario García; de Felipe, M. Rosario; Fernández-Pascual, Mercedes

2007-01-01

Application of glyphosate (N-[phosphonomethyl] glycine) to Bradyrhizobium sp. (Lupinus)-nodulated lupin plants caused modifications in the protein pattern of bacteroids. The most significant change was the presence of a 44-kDa polypeptide in bacteroids from plants treated with the higher doses of glyphosate employed (5 and 10 mM). The polypeptide has been characterized by the amino acid sequencing of its N terminus and the isolation and nucleic acid sequencing of its encoding gene. It is putatively encoded by a single gene, and the protein has been identified as a putative porin. Protein modeling revealed the existence of several domains sharing similarity to different porins, such as a transmembrane beta-barrel. The protein has been designated BLpp, for Bradyrhizobium sp. (Lupinus) putative porin, and would be the first porin described in Bradyrhizobium sp. (Lupinus). In addition, a putative conserved domain of porins has been identified which consists of 87 amino acids, located in the BLpp sequence 30 amino acids downstream of the N-terminal region. In bacteroids, mRNA of the BLpp gene shows a basal constitutive expression that increases under glyphosate treatment, and the expression of the gene is seemingly regulated at the transcriptional level. By contrast, in free-living bacteria glyphosate treatment leads to an inhibition of BLpp mRNA accumulation, indicating a different effect of glyphosate on BLpp gene expression in bacteroids and free-living bacteria. The possible role of BLpp in a metabolite interchange between Bradyrhizobium and lupin is discussed. PMID:17557843

PUTATIVE CREATINE KINASE M-ISOFORM IN HUMAN SPERM IS IDENTIFIED AS THE 70-KILODALTON HEAT SHOCK PROTEIN HSPA2

EPA Science Inventory

THE PUTATIVE CREATINE KINASE M-ISOFORM IN HUMAN SPERM
IS IDENTIFIED AS THE 70 kDa HEAT SHOCK PROTEIN HSPA2

* Gabor Huszar1, Kathryn Stone2, David Dix3 and Lynne Vigue1
1The Sperm Physiology Laboratory, Department of Obstetrics and Gynecology, 2 W.M. Keck Foundatio...
Cloning, Expression, and Nucleotide Sequence of the Pseudomonas aeruginosa 142 ohb Genes Coding for Oxygenolytic ortho Dehalogenation of Halobenzoates

PubMed Central

Tsoi, Tamara V.; Plotnikova, Elena G.; Cole, James R.; Guerin, William F.; Bagdasarian, Michael; Tiedje, James M.

1999-01-01

We have cloned and characterized novel oxygenolytic ortho-dehalogenation (ohb) genes from 2-chlorobenzoate (2-CBA)- and 2,4-dichlorobenzoate (2,4-dCBA)-degrading Pseudomonas aeruginosa 142. Among 3,700 Escherichia coli recombinants, two clones, DH5αF′(pOD22) and DH5αF′(pOD33), converted 2-CBA to catechol and 2,4-dCBA and 2,5-dCBA to 4-chlorocatechol. A subclone of pOD33, plasmid pE43, containing the 3,687-bp minimized ohb DNA region conferred to P. putida PB2440 the ability to grow on 2-CBA as a sole carbon source. Strain PB2440(pE43) also oxidized but did not grow on 2,4-dCBA, 2,5-dCBA, or 2,6-dCBA. Terminal oxidoreductase ISPOHB structural genes ohbA and ohbB, which encode polypeptides with molecular masses of 20,253 Da (β-ISP) and 48,243 Da (α-ISP), respectively, were identified; these proteins are in accord with the 22- and 48-kDa (as determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis) polypeptides synthesized in E. coli and P. aeruginosa parental strain 142. The ortho-halobenzoate 1,2-dioxygenase activity was manifested in the absence of ferredoxin and reductase genes, suggesting that the ISPOHB utilized electron transfer components provided by the heterologous hosts. ISPOHB formed a new phylogenetic cluster that includes aromatic oxygenases featuring atypical structural-functional organization and is distant from the other members of the family of primary aromatic oxygenases. A putative IclR-type regulatory gene (ohbR) was located upstream of the ohbAB genes. An open reading frame (ohbC) of unknown function that overlaps lengthwise with ohbB but is transcribed in the opposite direction was found. The ohbC gene codes for a 48,969-Da polypeptide, in accord with the 49-kDa protein detected in E. coli. The ohb genes are flanked by an IS1396-like sequence containing a putative gene for a 39,715-Da transposase A (tnpA) at positions 4731 to 5747 and a putative gene for a 45,247-Da DNA topoisomerase I/III (top) at positions 346 to 1563. The ohb DNA region is bordered by 14-bp imperfect inverted repeats at positions 56 to 69 and 5984 to 5997. PMID:10224014
The sigma factor SigD of Mycobacterium tuberculosis putatively enhances gene expression of the septum site determining protein under stressful environments.

PubMed

Ares, Miguel A; Rios-Sarabia, Nora; De la Cruz, Miguel A; Rivera-Gutiérrez, Sandra; García-Morales, Lázaro; León-Solís, Lizbel; Espitia, Clara; Pacheco, Sabino; Cerna-Cortés, Jorge F; Helguera-Repetto, Cecilia A; García, María Jesús; González-Y-Merchand, Jorge A

2017-07-01

This work examined the expression of the septum site determining gene (ssd) of Mycobacterium tuberculosis CDC1551 and its ∆sigD mutant under different growing conditions. The results showed an up-regulation of ssd during stationary phase and starvation conditions, but not during in vitro dormancy, suggesting a putative role for SigD in the control of ssd expression mainly under lack-of-nutrients environments. Furthermore, we elucidated a putative link between ssd expression and cell elongation of bacilli at stationary phase. In addition, a -35 sigD consensus sequence was found for the ssd promoter region, reinforcing the putative regulation of ssd by SigD, and in turn, supporting this protein role during the adaptation of M. tuberculosis to some stressful environments.
In Silico Assigned Resistance Genes Confer Bifidobacterium with Partial Resistance to Aminoglycosides but Not to Β-Lactams

PubMed Central

Fouhy, Fiona; O’Connell Motherway, Mary; Fitzgerald, Gerald F.; Ross, R. Paul; Stanton, Catherine; van Sinderen, Douwe; Cotter, Paul D.

2013-01-01

Bifidobacteria have received significant attention due to their contribution to human gut health and the use of specific strains as probiotics. It is thus not surprising that there has also been significant interest with respect to their antibiotic resistance profile. Numerous culture-based studies have demonstrated that bifidobacteria are resistant to the majority of aminoglycosides, but are sensitive to β-lactams. However, limited research exists with respect to the genetic basis for the resistance of bifidobacteria to aminoglycosides. Here we performed an in-depth in silico analysis of putative Bifidobacterium-encoded aminoglycoside resistance proteins and β-lactamases and assess the contribution of these proteins to antibiotic resistance. The in silico-based screen detected putative aminoglycoside and β-lactam resistance proteins across the Bifidobacterium genus. Laboratory-based investigations of a number of representative bifidobacteria strains confirmed that despite containing putative β-lactamases, these strains were sensitive to β-lactams. In contrast, all strains were resistant to the aminoglycosides tested. To assess the contribution of genes encoding putative aminoglycoside resistance proteins in Bifidobacterium sp. two genes, namely Bbr_0651 and Bbr_1586, were targeted for insertional inactivation in B. breve UCC2003. As compared to the wild-type, the UCC2003 insertion mutant strains exhibited decreased resistance to gentamycin, kanamycin and streptomycin. This study highlights the associated risks of relying on the in silico assignment of gene function. Although several putative β-lactam resistance proteins are located in bifidobacteria, their presence does not coincide with resistance to these antibiotics. In contrast however, this approach has resulted in the identification of two loci that contribute to the aminoglycoside resistance of B. breve UCC2003 and, potentially, many other bifidobacteria. PMID:24324818
Haplotype analysis of the germacrene A synthase gene and association with cynaropicrin content and biological activities in Cynara cardunculus.

PubMed

Ferro, Ana Margarida; Ramos, Patrícia; Guerra, Ângela; Parreira, Paula; Brás, Teresa; Guerreiro, Olinda; Jerónimo, Eliana; Capel, Carmen; Capel, Juan; Yuste-Lisbona, Fernando J; Duarte, Maria F; Lozano, Rafael; Oliveira, M Margarida; Gonçalves, Sónia

2018-04-01

Cynara cardunculus: L. represents a natural source of terpenic compounds, with the predominant molecule being cynaropicrin. Cynaropicrin is gaining interest since it has been correlated to anti-hyperlipidaemia, antispasmodic and cytotoxicity activity against leukocyte cancer cells. The objective of this work was to screen a collection of C. cardunculus, from different origins, for new allelic variants in germacrene A synthase (GAS) gene involved in the cynaropicrin biosynthesis and correlate them with improved cynaropicrin content and biological activities. Using high-resolution melting, nine haplotypes were identified. The putative impact of the identified allelic variants in GAS protein was evaluated by bioinformatic tools and polymorphisms that putatively lead to protein conformational changes were described. Additionally, cynaropicrin and main pentacyclic triterpenes contents, and antithrombin, antimicrobial and antiproliferative activities were also determined in C. cardunculus leaf lipophilic-derived extracts. In this work we identified allelic variants with putative impact on GAS protein, which are significantly associated with cynaropicrin content and antiproliferative activity. The results obtained suggest that the identified polymorphisms should be explored as putative genetic markers correlated with biological properties in Cynara cardunculus.
Immunogenicity and protective efficacy of recombinant Haemophilus parasuis SH0165 putative outer membrane proteins.

PubMed

Fu, Shulin; Zhang, Minmin; Xu, Juan; Ou, Jiwen; Wang, Yan; Liu, Huazhen; Liu, Jinlin; Chen, Huanchun; Bei, Weicheng

2013-01-02

Haemophilus parasuis (H. parasuis), the causative agent of swine polyserositis, polyarthritis, and meningitis, is one of the most important bacterial diseases of pigs worldwide. Little vaccines currently exist that have a significant effect on infections with all pathogenic serovars of H. parasuis. H. parasuis putative outer membrane proteins (OMPs) are potentially essential components of more effective vaccines. Recently, the genomic sequence of H. parasuis serovar 5 strain SH0165 was completed in our laboratory, which allow us to target OMPs for the development of recombinant vaccines. In this study, we focused on 10 putative OMPs and all the putative OMPs were cloned, expressed and purified as HIS fusion proteins. Primary screening for immunoprotective potential was performed in mice challenged with an LD50 challenge. Out of these 10 OMPs three fusion proteins rGAPDH, rOapA, and rHPS-0675 were found to be protective in a mouse model of H. parasuis infection. We further evaluated the immune responses and protective efficacy of rGAPDH, rOapA, and rHPS-0675 in pig models. All three proteins elicited humoral antibody responses and conferred different levels of protection against challenge with a lethal dose of H. parasuis SH0165 in pig models. In addition, the antisera against the three individual proteins and the synergistic protein efficiently inhibited bacterial growth in a whole blood assay. The data demonstrated that the three proteins showed high value individually and the combination of rGAPDH, rOapA, and rHPS-0675 offered the best protection. Our results indicate that rGAPDH, rOapA, and rHPS-0675 induced protection against H. parasuis SH0165 infection, which may facilitate the development of a multi-component vaccine. Copyright © 2012 Elsevier Ltd. All rights reserved.
A catalog for the transcripts from the venomous structures of the caterpillar Lonomia obliqua: identification of the proteins potentially involved in the coagulation disorder and hemorrhagic syndrome

PubMed Central

Veiga, Ana B. G.; Ribeiro, José M. C.; Guimarães, Jorge A.; Francischetti, Ivo M.B.

2010-01-01

Accidents with the caterpillar Lonomia obliqua are often associated with a coagulation disorder and hemorrhagic syndrome in humans. In the present study, we have constructed cDNA libraries from two venomous structures of the caterpillar, namely the tegument and the bristle. High-throughput sequencing and bioinformatics analyses were performed in parallel. Over one thousand cDNAs were obtained and clustered to produce a database of 538 contigs and singletons (clusters) for the tegument library and 368 for the bristle library. We have thus identified dozens of full-length cDNAs coding for proteins with sequence homology to snake venom prothrombin activator, trypsin-like enzymes, blood coagulation factors and prophenoloxidase cascade activators. We also report cDNA coding for cysteine proteases, Group III phospholipase A2, C-type lectins, lipocalins, in addition to protease inhibitors including serpins, Kazal-type inhibitors, cystatins and trypsin inhibitor-like molecules. Antibacterial proteins and housekeeping genes are also described. A significant number of sequences were devoid of database matches, suggesting that their biologic function remains to be defined. We also report the N-terminus of the most abundant proteins present in the bristle, tegument, hemolymph, and "cryosecretion". Thus, we have created a catalog that contains the predicted molecular weight, isoelectric point, accession number, and putative function for each selected molecule from the venomous structures of L. obliqua. The role of these molecules in the coagulation disorder and hemorrhagic syndrome caused by envenomation with this caterpillar is discussed. All sequence information and the Supplemental Data, including Figures and Tables with hyperlinks to FASTA-formatted files for each contig and the best match to the Databases, are available at http://www.ncbi.nih.gov/projects/omes. PMID:16023793
Missing genes, multiple ORFs, and C-to-U type RNA editing in Acrasis kona (Heterolobosea, Excavata) mitochondrial DNA.

PubMed

Fu, Cheng-Jie; Sheikh, Sanea; Miao, Wei; Andersson, Siv G E; Baldauf, Sandra L

2014-08-21

Discoba (Excavata) is an ancient group of eukaryotes with great morphological and ecological diversity. Unlike the other major divisions of Discoba (Jakobida and Euglenozoa), little is known about the mitochondrial DNAs (mtDNAs) of Heterolobosea. We have assembled a complete mtDNA genome from the aggregating heterolobosean amoeba, Acrasis kona, which consists of a single circular highly AT-rich (83.3%) molecule of 51.5 kb. Unexpectedly, A. kona mtDNA is missing roughly 40% of the protein-coding genes and nearly half of the transfer RNAs found in the only other sequenced heterolobosean mtDNAs, those of Naegleria spp. Instead, over a quarter of A. kona mtDNA consists of novel open reading frames. Eleven of the 16 protein-coding genes missing from A. kona mtDNA were identified in its nuclear DNA and polyA RNA, and phylogenetic analyses indicate that at least 10 of these 11 putative nuclear-encoded mitochondrial (NcMt) proteins arose by direct transfer from the mitochondrion. Acrasis kona mtDNA also employs C-to-U type RNA editing, and 12 homologs of DYW-type pentatricopeptide repeat (PPR) proteins implicated in plant organellar RNA editing are found in A. kona nuclear DNA. A mapping of mitochondrial gene content onto a consensus phylogeny reveals a sporadic pattern of relative stasis and rampant gene loss in Discoba. Rampant loss occurred independently in the unique common lineage leading to Heterolobosea + Tsukubamonadida and later in the unique lineage leading to Acrasis. Meanwhile, mtDNA gene content appears to be remarkably stable in the Acrasis sister lineage leading to Naegleria and in their distant relatives Jakobida. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Donkey Orchid Symptomless Virus: A Viral ‘Platypus’ from Australian Terrestrial Orchids

PubMed Central

Wylie, Stephen J.; Li, Hua; Jones, Michael G. K.

2013-01-01

Complete and partial genome sequences of two isolates of an unusual new plant virus, designated Donkey orchid symptomless virus (DOSV) were identified using a high-throughput sequencing approach. The virus was identified from asymptomatic plants of Australian terrestrial orchid Diuris longifolia (Common donkey orchid) growing in a remnant forest patch near Perth, western Australia. DOSV was identified from two D. longifolia plants of 264 tested, and from at least one plant of 129 Caladenia latifolia (pink fairy orchid) plants tested. Phylogenetic analysis of the genome revealed open reading frames (ORF) encoding seven putative proteins of apparently disparate origins. A 69-kDa protein (ORF1) that overlapped the replicase shared low identity with MPs of plant tymoviruses (Tymoviridae). A 157-kDa replicase (ORF2) and 22-kDa coat protein (ORF4) shared 32% and 40% amino acid identity, respectively, with homologous proteins encoded by members of the plant virus family Alphaflexiviridae. A 44-kDa protein (ORF3) shared low identity with myosin and an autophagy protein from Squirrelpox virus. A 27-kDa protein (ORF5) shared no identity with described proteins. A 14-kDa protein (ORF6) shared limited sequence identity (26%) over a limited region of the envelope glycoprotein precursor of mammal-infecting Crimea-Congo hemorrhagic fever virus (Bunyaviridae). The putative 25-kDa movement protein (MP) (ORF7) shared limited (27%) identity with 3A-like MPs of members of the plant-infecting Tombusviridae and Virgaviridae. Transmissibility was shown when DOSV systemically infected Nicotiana benthamiana plants. Structure and organization of the domains within the putative replicase of DOSV suggests a common evolutionary origin with ‘potexvirus-like’ replicases of viruses within the Alphaflexiviridae and Tymoviridae, and the CP appears to be ancestral to CPs of allexiviruses (Alphaflexiviridae). The MP shares an evolutionary history with MPs of dianthoviruses, but the other putative proteins are distant from plant viruses. DOSV is not readily classified in current lower order virus taxa. PMID:24223974
Phloem proteomics reveals new lipid-binding proteins with a putative role in lipid-mediated signaling

DOE PAGES

Barbaglia, Allison M.; Tamot, Banita; Greve, Veronica; ...

2016-04-28

Global climate changes inversely affect our ability to grow the food required for an increasing world population. To combat future crop loss due to abiotic stress, we need to understand the signals responsible for changes in plant development and the resulting adaptations, especially the signaling molecules traveling long-distance through the plant phloem. Using a proteomics approach, we had identified several putative lipid-binding proteins in the phloem exudates. Simultaneously, we identified several complex lipids as well as jasmonates. These findings prompted us to propose that phloem (phospho-) lipids could act as long-distance developmental signals in response to abiotic stress, and thatmore » they are released, sensed, and moved by phloem lipid-binding proteins (Benning et al., 2012). Indeed, the proteins we identified include lipases that could release a signaling lipid into the phloem, putative receptor components, and proteins that could mediate lipid-movement. To test this possible protein-based lipid-signaling pathway, three of the proteins, which could potentially act in a relay, are characterized here: (I) a putative GDSL-motif lipase (II) a PIG-P-like protein, with a possible receptor-like function; (III) and PLAFP (phloem lipid-associated family protein), a predicted lipid-binding protein of unknown function. Here we show that all three proteins bind lipids, in particular phosphatidic acid (PtdOH), which is known to participate in intracellular stress signaling. Genes encoding these proteins are expressed in the vasculature, a prerequisite for phloem transport. Cellular localization studies show that the proteins are not retained in the endoplasmic reticulum but surround the cell in a spotted pattern that has been previously observed with receptors and plasmodesmatal proteins. Abiotic signals that induce the production of PtdOH also regulate the expression of GDSL-lipase and PLAFP, albeit in opposite patterns. Our findings suggest that while all three proteins are indeed lipid-binding and act in the vasculature possibly in a function related to long-distance signaling, the three proteins do not act in the same but rather in distinct pathways. Furthermore, it points toward PLAFP as a prime candidate to investigate long-distance lipid signaling in the plant drought response.« less
Phloem proteomics reveals new lipid-binding proteins with a putative role in lipid-mediated signaling

DOE Office of Scientific and Technical Information (OSTI.GOV)

Barbaglia, Allison M.; Tamot, Banita; Greve, Veronica

Global climate changes inversely affect our ability to grow the food required for an increasing world population. To combat future crop loss due to abiotic stress, we need to understand the signals responsible for changes in plant development and the resulting adaptations, especially the signaling molecules traveling long-distance through the plant phloem. Using a proteomics approach, we had identified several putative lipid-binding proteins in the phloem exudates. Simultaneously, we identified several complex lipids as well as jasmonates. These findings prompted us to propose that phloem (phospho-) lipids could act as long-distance developmental signals in response to abiotic stress, and thatmore » they are released, sensed, and moved by phloem lipid-binding proteins (Benning et al., 2012). Indeed, the proteins we identified include lipases that could release a signaling lipid into the phloem, putative receptor components, and proteins that could mediate lipid-movement. To test this possible protein-based lipid-signaling pathway, three of the proteins, which could potentially act in a relay, are characterized here: (I) a putative GDSL-motif lipase (II) a PIG-P-like protein, with a possible receptor-like function; (III) and PLAFP (phloem lipid-associated family protein), a predicted lipid-binding protein of unknown function. Here we show that all three proteins bind lipids, in particular phosphatidic acid (PtdOH), which is known to participate in intracellular stress signaling. Genes encoding these proteins are expressed in the vasculature, a prerequisite for phloem transport. Cellular localization studies show that the proteins are not retained in the endoplasmic reticulum but surround the cell in a spotted pattern that has been previously observed with receptors and plasmodesmatal proteins. Abiotic signals that induce the production of PtdOH also regulate the expression of GDSL-lipase and PLAFP, albeit in opposite patterns. Our findings suggest that while all three proteins are indeed lipid-binding and act in the vasculature possibly in a function related to long-distance signaling, the three proteins do not act in the same but rather in distinct pathways. Furthermore, it points toward PLAFP as a prime candidate to investigate long-distance lipid signaling in the plant drought response.« less
Horizontal gene transfer in Histophilus somni and its role in the evolution of pathogenic strain 2336, as determined by comparative genomic analyses

PubMed Central

2011-01-01

Background Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae. Results The chromosome of strain 2336 (2,263,857 bp) contained 1,980 protein coding genes, whereas the chromosome of strain 129Pt (2,007,700 bp) contained only 1,792 protein coding genes. Although the chromosomes of the two strains differ in size, their average GC content, gene density (total number of genes predicted on the chromosome), and percentage of sequence (number of genes) that encodes proteins were similar. The chromosomes of these strains also contained a number of discrete prophage regions and genomic islands. One of the genomic islands in strain 2336 contained genes putatively involved in copper, zinc, and tetracycline resistance. Using the genome sequence data and comparative analyses with other members of the Pasteurellaceae, several H. somni genes that may encode proteins involved in virulence (e.g., filamentous haemaggutinins, adhesins, and polysaccharide biosynthesis/modification enzymes) were identified. The two strains contained a total of 17 ORFs that encode putative glycosyltransferases and some of these ORFs had characteristic simple sequence repeats within them. Most of the genes/loci common to both the strains were located in different regions of the two chromosomes and occurred in opposite orientations, indicating genome rearrangement since their divergence from a common ancestor. Conclusions Since the genome of strain 129Pt was ~256,000 bp smaller than that of strain 2336, these genomes provide yet another paradigm for studying evolutionary gene loss and/or gain in regard to virulence repertoire and pathogenic ability. Analyses of the complete genome sequences revealed that bacteriophage- and transposon-mediated horizontal gene transfer had occurred at several loci in the chromosomes of strains 2336 and 129Pt. It appears that these mobile genetic elements have played a major role in creating genomic diversity and phenotypic variability among the two H. somni strains. PMID:22111657
Horizontal gene transfer in Histophilus somni and its role in the evolution of pathogenic strain 2336, as determined by comparative genomic analyses.

PubMed

Siddaramappa, Shivakumara; Challacombe, Jean F; Duncan, Alison J; Gillaspy, Allison F; Carson, Matthew; Gipson, Jenny; Orvis, Joshua; Zaitshik, Jeremy; Barnes, Gentry; Bruce, David; Chertkov, Olga; Detter, J Chris; Han, Cliff S; Tapia, Roxanne; Thompson, Linda S; Dyer, David W; Inzana, Thomas J

2011-11-23

Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae. The chromosome of strain 2336 (2,263,857 bp) contained 1,980 protein coding genes, whereas the chromosome of strain 129Pt (2,007,700 bp) contained only 1,792 protein coding genes. Although the chromosomes of the two strains differ in size, their average GC content, gene density (total number of genes predicted on the chromosome), and percentage of sequence (number of genes) that encodes proteins were similar. The chromosomes of these strains also contained a number of discrete prophage regions and genomic islands. One of the genomic islands in strain 2336 contained genes putatively involved in copper, zinc, and tetracycline resistance. Using the genome sequence data and comparative analyses with other members of the Pasteurellaceae, several H. somni genes that may encode proteins involved in virulence (e.g., filamentous haemaggutinins, adhesins, and polysaccharide biosynthesis/modification enzymes) were identified. The two strains contained a total of 17 ORFs that encode putative glycosyltransferases and some of these ORFs had characteristic simple sequence repeats within them. Most of the genes/loci common to both the strains were located in different regions of the two chromosomes and occurred in opposite orientations, indicating genome rearrangement since their divergence from a common ancestor. Since the genome of strain 129Pt was ~256,000 bp smaller than that of strain 2336, these genomes provide yet another paradigm for studying evolutionary gene loss and/or gain in regard to virulence repertoire and pathogenic ability. Analyses of the complete genome sequences revealed that bacteriophage- and transposon-mediated horizontal gene transfer had occurred at several loci in the chromosomes of strains 2336 and 129Pt. It appears that these mobile genetic elements have played a major role in creating genomic diversity and phenotypic variability among the two H. somni strains.
Comprehensive analysis of single molecule sequencing-derived complete genome and whole transcriptome of Hyposidra talaca nuclear polyhedrosis virus.

PubMed

Nguyen, Thong T; Suryamohan, Kushal; Kuriakose, Boney; Janakiraman, Vasantharajan; Reichelt, Mike; Chaudhuri, Subhra; Guillory, Joseph; Divakaran, Neethu; Rabins, P E; Goel, Ridhi; Deka, Bhabesh; Sarkar, Suman; Ekka, Preety; Tsai, Yu-Chih; Vargas, Derek; Santhosh, Sam; Mohan, Sangeetha; Chin, Chen-Shan; Korlach, Jonas; Thomas, George; Babu, Azariah; Seshagiri, Somasekar

2018-06-12

We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089 bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.
Rhodopseudomonas palustris CGA010 Proteome Implicates Extracytoplasmic Function Sigma Factor in Stress Response

DOE PAGES

Allen, Michael S.; Hurst, Gregory B.; Lu, Tse-Yuan S.; ...

2015-04-08

Rhodopseudomonas palustris encodes 16 extracytoplasmic function (ECF) σ factors. In this paper, to begin to investigate the regulatory network of one of these ECF σ factors, the whole proteome of R. palustris CGA010 was quantitatively analyzed by tandem mass spectrometry from cultures episomally expressing the ECF σ RPA4225 (ecfT) versus a WT control. Among the proteins with the greatest increase in abundance were catalase KatE, trehalose synthase, a DPS-like protein, and several regulatory proteins. Alignment of the cognate promoter regions driving expression of several upregulated proteins suggested a conserved binding motif in the -35 and -10 regions with the consensusmore » sequence GGAAC-18N-TT. Additionally, the putative anti-σ factor RPA4224, whose gene is contained in the same predicted operon as RPA4225, was identified as interacting directly with the predicted response regulator RPA4223 by mass spectrometry of affinity-isolated protein complexes. Furthermore, another gene (RPA4226) coding for a protein that contains a cytoplasmic histidine kinase domain is located immediately upstream of RPA4225. The genomic organization of orthologs for these four genes is conserved in several other strains of R. palustris as well as in closely related α-Proteobacteria. Finally, taken together, these data suggest that ECF σ RPA4225 and the three additional genes make up a sigma factor mimicry system in R. palustris.« less
Rhodopseudomonas palustris CGA010 Proteome Implicates Extracytoplasmic Function Sigma Factor in Stress Response

DOE Office of Scientific and Technical Information (OSTI.GOV)

Allen, Michael S.; Hurst, Gregory B.; Lu, Tse-Yuan S.

Rhodopseudomonas palustris encodes 16 extracytoplasmic function (ECF) σ factors. In this paper, to begin to investigate the regulatory network of one of these ECF σ factors, the whole proteome of R. palustris CGA010 was quantitatively analyzed by tandem mass spectrometry from cultures episomally expressing the ECF σ RPA4225 (ecfT) versus a WT control. Among the proteins with the greatest increase in abundance were catalase KatE, trehalose synthase, a DPS-like protein, and several regulatory proteins. Alignment of the cognate promoter regions driving expression of several upregulated proteins suggested a conserved binding motif in the -35 and -10 regions with the consensusmore » sequence GGAAC-18N-TT. Additionally, the putative anti-σ factor RPA4224, whose gene is contained in the same predicted operon as RPA4225, was identified as interacting directly with the predicted response regulator RPA4223 by mass spectrometry of affinity-isolated protein complexes. Furthermore, another gene (RPA4226) coding for a protein that contains a cytoplasmic histidine kinase domain is located immediately upstream of RPA4225. The genomic organization of orthologs for these four genes is conserved in several other strains of R. palustris as well as in closely related α-Proteobacteria. Finally, taken together, these data suggest that ECF σ RPA4225 and the three additional genes make up a sigma factor mimicry system in R. palustris.« less
High-quality permanent draft genome sequence of the extremely osmotolerant diphenol degrading bacterium Halotalea alkalilenta AW-7T, and emended description of the genus Halotalea

DOE PAGES

Ntougias, Spyridon; Lapidus, Alla; Copeland, Alex; ...

2015-08-13

Members of the genus Halotalea (family Halomonadaceae) are of high significance since they can tolerate the greatest glucose and maltose concentrations ever reported for known bacteria and are involved in the degradation of industrial effluents. Here, the characteristics and the permanent-draft genome sequence and annotation of Halotalea alkalilenta AW-7T are described. The microorganism was sequenced as a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project at the DOE Joint Genome Institute, and it is the only strain within the genus Halotalea having its genome sequenced. The genome is 4,467,826 bp longmore » and consists of 40 scaffolds with 64.62 % average GC content. A total of 4,104 genes were predicted, comprising of 4,028 protein-coding and 76 RNA genes. Most protein-coding genes (87.79 %) were assigned to a putative function. Halotalea alkalilenta AW-7T encodes the catechol and protocatechuate degradation to β-ketoadipate via the β-ketoadipate and protocatechuate ortho-cleavage degradation pathway, and it possesses the genetic ability to detoxify fluoroacetate, cyanate and acrylonitrile. Lastly, an emended description of the genus Halotalea Ntougias et al. 2007 is also provided in order to describe the delayed fermentation ability of the type strain.« less
Computational analysis of ribonomics datasets identifies long non-coding RNA targets of γ-herpesviral miRNAs.

PubMed

Sethuraman, Sunantha; Thomas, Merin; Gay, Lauren A; Renne, Rolf

2018-05-29

Ribonomics experiments involving crosslinking and immuno-precipitation (CLIP) of Ago proteins have expanded the understanding of the miRNA targetome of several organisms. These techniques, collectively referred to as CLIP-seq, have been applied to identifying the mRNA targets of miRNAs expressed by Kaposi's Sarcoma-associated herpes virus (KSHV) and Epstein-Barr virus (EBV). However, these studies focused on identifying only those RNA targets of KSHV and EBV miRNAs that are known to encode proteins. Recent studies have demonstrated that long non-coding RNAs (lncRNAs) are also targeted by miRNAs. In this study, we performed a systematic re-analysis of published datasets from KSHV- and EBV-driven cancers. We used CLIP-seq data from lymphoma cells or EBV-transformed B cells, and a crosslinking, ligation and sequencing of hybrids dataset from KSHV-infected endothelial cells, to identify novel lncRNA targets of viral miRNAs. Here, we catalog the lncRNA targetome of KSHV and EBV miRNAs, and provide a detailed in silico analysis of lncRNA-miRNA binding interactions. Viral miRNAs target several hundred lncRNAs, including a subset previously shown to be aberrantly expressed in human malignancies. In addition, we identified thousands of lncRNAs to be putative targets of human miRNAs, suggesting that miRNA-lncRNA interactions broadly contribute to the regulation of gene expression.
The Complete Mitochondrial Genome of Ctenoptilum vasava (Lepidoptera: Hesperiidae: Pyrginae) and Its Phylogenetic Implication

PubMed Central

Hao, Jiasheng; Sun, Qianqian; Zhao, Huabin; Sun, Xiaoyan; Gai, Yonghua; Yang, Qun

2012-01-01

We here report the first complete mitochondrial (mt) genome of a skipper, Ctenoptilum vasava Moore, 1865 (Lepidoptera: Hesperiidae: Pyrginae). The mt genome of the skipper is a circular molecule of 15,468 bp, containing 2 ribosomal RNA genes, 24 putative transfer RNA (tRNA), genes including an extra copy of trnS (AGN) and a tRNA-like insertion trnL (UUR), 13 protein-coding genes and an AT-rich region. All protein-coding genes (PCGs) are initiated by ATN codons and terminated by the typical stop codon TAA or TAG, except for COII which ends with a single T. The intergenic spacer sequence between trnS (AGN) and ND1 genes also contains the ATACTAA motif. The AT-rich region of 429 bp is comprised of nonrepetitive sequences, including the motif ATAGA followed by an 19 bp poly-T stretch, a microsatellite-like (AT)3 (TA)9 element next to the ATTTA motif, an 11 bp poly-A adjacent to tRNAs. Phylogenetic analyses (ML and BI methods) showed that Papilionoidea is not a natural group, and Hesperioidea is placed within the Papilionoidea as a sister to ((Pieridae + Lycaenidae) + Nymphalidae) while Papilionoidae is paraphyletic to Hesperioidea. This result is remarkably different from the traditional view where Papilionoidea and Hesperioidea are considered as two distinct superfamilies. PMID:22577351
Evidence that the Malaria Parasite Plasmodium falciparum Putative Rhoptry Protein 2 Localizes to the Golgi Apparatus throughout the Erythrocytic Cycle.

PubMed

Hallée, Stéphanie; Richard, Dave

2015-01-01

Invasion of a red blood cell by Plasmodium falciparum merozoites is an essential step in the malaria lifecycle. Several of the proteins involved in this process are stored in the apical complex of the merozoite, a structure containing secretory organelles that are released at specific times during invasion. The molecular players involved in erythrocyte invasion thus represent potential key targets for both therapeutic and vaccine-based strategies to block parasite development. In our quest to identify and characterize new effectors of invasion, we investigated the P. falciparum homologue of a P. berghei protein putatively localized to the rhoptries, the Putative rhoptry protein 2 (PbPRP2). We show that in P. falciparum, the protein colocalizes extensively with the Golgi apparatus across the asexual erythrocytic cycle. Furthermore, imaging of merozoites caught at different times during invasion show that PfPRP2 is not secreted during the process instead staying associated with the Golgi apparatus. Our evidence therefore suggests that PfPRP2 is a Golgi protein and that it is likely not a direct effector in the process of merozoite invasion.

Functional Characterization of LcpA, a Surface-Exposed Protein of Leptospira spp. That Binds the Human Complement Regulator C4BP▿

PubMed Central

Barbosa, Angela S.; Monaris, Denize; Silva, Ludmila B.; Morais, Zenaide M.; Vasconcellos, Sílvio A.; Cianciarullo, Aurora M.; Isaac, Lourdes; Abreu, Patricia A. E.

2010-01-01

We have previously shown that pathogenic leptospiral strains are able to bind C4b binding protein (C4BP). Surface-bound C4BP retains its cofactor activity, indicating that acquisition of this complement regulator may contribute to leptospiral serum resistance. In the present study, the abilities of seven recombinant putative leptospiral outer membrane proteins to interact with C4BP were evaluated. The protein encoded by LIC11947 interacted with this human complement regulator in a dose-dependent manner. The cofactor activity of C4BP bound to immobilized recombinant LIC11947 (rLIC11947) was confirmed by detecting factor I-mediated cleavage of C4b. rLIC11947 was therefore named LcpA (for leptospiral complement regulator-acquiring protein A). LcpA was shown to be an outer membrane protein by using immunoelectron microscopy, cell surface proteolysis, and Triton X-114 fractionation. The gene coding for LcpA is conserved among pathogenic leptospiral strains. This is the first characterization of a Leptospira surface protein that binds to the human complement regulator C4BP in a manner that allows this important regulator to control complement system activation mediated either by the classical pathway or by the lectin pathway. This newly identified protein may play a role in immune evasion by Leptospira spp. and may therefore represent a target for the development of a human vaccine against leptospirosis. PMID:20404075
The predicted secretome and transmembranome of the poultry red mite Dermanyssus gallinae.

PubMed

Schicht, Sabine; Qi, Weihong; Poveda, Lucy; Strube, Christina

2013-09-11

The worldwide distributed hematophagous poultry red mite Dermanyssus gallinae (De Geer, 1778) is one of the most important pests of poultry. Even though 35 acaricide compounds are available, control of D. gallinae remains difficult due to acaricide resistances as well as food safety regulations. The current study was carried out to identify putative excretory/secretory (pES) proteins of D. gallinae since these proteins play an important role in the host-parasite interaction and therefore represent potential targets for the development of novel intervention strategies. Additionally, putative transmembrane proteins (pTM) of D. gallinae were analyzed as representatives of this protein group also serve as promising targets for new control strategies. D. gallinae pES and pTM protein prediction was based on putative protein sequences of whole transcriptome data which was parsed to different bioinformatical servers (SignalP, SecretomeP, TMHMM and TargetP). Subsequently, pES and pTM protein sequences were functionally annotated by different computational tools. Computational analysis of the D. gallinae proteins identified 3,091 pES (5.6%) and 7,361 pTM proteins (13.4%). A significant proportion of pES proteins are considered to be involved in blood feeding and digestion such as salivary proteins, proteases, lipases and carbohydrases. The cysteine proteases cathepsin D and L as well as legumain, enzymes that cleave hemoglobin during blood digestion of the near related ticks, represented 6 of the top-30 BLASTP matches of the poultry red mite's secretome. Identified pTM proteins may be involved in many important biological processes including cell signaling, transport of membrane-impermeable molecules and cell recognition. Ninjurin-like proteins, whose functions in mites are still unknown, represent the most frequently occurring pTM. The current study is the first providing a mite's secretome as well as transmembranome and provides valuable insights into D. gallinae pES and pTM proteins operating in different metabolic pathways. Identifying a variety of molecules putatively involved in blood feeding may significantly contribute to the development of new therapeutic targets or vaccines against this poultry pest.
The predicted secretome and transmembranome of the poultry red mite Dermanyssus gallinae

PubMed Central

2013-01-01

Background The worldwide distributed hematophagous poultry red mite Dermanyssus gallinae (De Geer, 1778) is one of the most important pests of poultry. Even though 35 acaricide compounds are available, control of D. gallinae remains difficult due to acaricide resistances as well as food safety regulations. The current study was carried out to identify putative excretory/secretory (pES) proteins of D. gallinae since these proteins play an important role in the host-parasite interaction and therefore represent potential targets for the development of novel intervention strategies. Additionally, putative transmembrane proteins (pTM) of D. gallinae were analyzed as representatives of this protein group also serve as promising targets for new control strategies. Methods D. gallinae pES and pTM protein prediction was based on putative protein sequences of whole transcriptome data which was parsed to different bioinformatical servers (SignalP, SecretomeP, TMHMM and TargetP). Subsequently, pES and pTM protein sequences were functionally annotated by different computational tools. Results Computational analysis of the D. gallinae proteins identified 3,091 pES (5.6%) and 7,361 pTM proteins (13.4%). A significant proportion of pES proteins are considered to be involved in blood feeding and digestion such as salivary proteins, proteases, lipases and carbohydrases. The cysteine proteases cathepsin D and L as well as legumain, enzymes that cleave hemoglobin during blood digestion of the near related ticks, represented 6 of the top-30 BLASTP matches of the poultry red mite’s secretome. Identified pTM proteins may be involved in many important biological processes including cell signaling, transport of membrane-impermeable molecules and cell recognition. Ninjurin-like proteins, whose functions in mites are still unknown, represent the most frequently occurring pTM. Conclusion The current study is the first providing a mite’s secretome as well as transmembranome and provides valuable insights into D. gallinae pES and pTM proteins operating in different metabolic pathways. Identifying a variety of molecules putatively involved in blood feeding may significantly contribute to the development of new therapeutic targets or vaccines against this poultry pest. PMID:24020355
Discovery: an interactive resource for the rational selection and comparison of putative drug target proteins in malaria

PubMed Central

Joubert, Fourie; Harrison, Claudia M; Koegelenberg, Riaan J; Odendaal, Christiaan J; de Beer, Tjaart AP

2009-01-01

Background Up to half a billion human clinical cases of malaria are reported each year, resulting in about 2.7 million deaths, most of which occur in sub-Saharan Africa. Due to the over-and misuse of anti-malarials, widespread resistance to all the known drugs is increasing at an alarming rate. Rational methods to select new drug target proteins and lead compounds are urgently needed. The Discovery system provides data mining functionality on extensive annotations of five malaria species together with the human and mosquito hosts, enabling the selection of new targets based on multiple protein and ligand properties. Methods A web-based system was developed where researchers are able to mine information on malaria proteins and predicted ligands, as well as perform comparisons to the human and mosquito host characteristics. Protein features used include: domains, motifs, EC numbers, GO terms, orthologs, protein-protein interactions, protein-ligand interactions and host-pathogen interactions among others. Searching by chemical structure is also available. Results An in silico system for the selection of putative drug targets and lead compounds is presented, together with an example study on the bifunctional DHFR-TS from Plasmodium falciparum. Conclusion The Discovery system allows for the identification of putative drug targets and lead compounds in Plasmodium species based on the filtering of protein and chemical properties. PMID:19642978
Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family

PubMed Central

Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H

2005-01-01

Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476
Identification and Characterization of Putative Integron-Like Elements of the Heavy-Metal-Hypertolerant Strains of Pseudomonas spp.

PubMed

Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz

2016-11-28

Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .
In Silico Prediction and Validation of Novel RNA Binding Proteins and Residues in the Human Proteome.

PubMed

Chowdhury, Shomeek; Zhang, Jian; Kurgan, Lukasz

2018-05-28

Deciphering a complete landscape of protein-RNA interactions in the human proteome remains an elusive challenge. We computationally elucidate RNA binding proteins (RBPs) using an approach that complements previous efforts. We employ two modern complementary sequence-based methods that provide accurate predictions from the structured and the intrinsically disordered sequences, even in the absence of sequence similarity to the known RBPs. We generate and analyze putative RNA binding residues on the whole proteome scale. Using a conservative setting that ensures low, 5% false positive rate, we identify 1511 putative RBPs that include 281 known RBPs and 166 RBPs that were previously predicted. We empirically demonstrate that these overlaps are statistically significant. We also validate the putative RBPs based on two major hallmarks of their RNA binding residues: high levels of evolutionary conservation and enrichment in charged amino acids. Moreover, we show that the novel RBPs are significantly under-annotated functionally which coincides with the fact that they were not yet found to interact with RNAs. We provide two examples of our novel putative RBPs for which there is recent evidence of their interactions with RNAs. The dataset of novel putative RBPs and RNA binding residues for the future hypothesis generation is provided in the Supporting Information. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A new polymorphic and multicopy MHC gene family related to nonmammalian class I

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leelayuwat, C.; Degli-Esposti, M.A.; Abraham, L.J.

1994-12-31

The authors have used genomic analysis to characterize a region of the central major histocompatibility complex (MHC) spanning {approximately} 300 kilobases (kb) between TNF and HLA-B. This region has been suggested to carry genetic factors relevant to the development of autoimmune diseases such as myasthenia gravis (MG) and insulin dependent diabetes mellitus (IDDM). Genomic sequence was analyzed for coding potential, using two neural network programs, GRAIL and GeneParser. A genomic probe, JAB, containing putative coding sequences (PERB11) located 60 kb centromeric of HLA-B, was used for northern analysis of human tissues. Multiple transcripts were detected. Southern analysis of genomic DNAmore » and overlapping YAC clones, covering the region from BAT1 to HLA-F, indicated that there are at least five copies of PERB11, four of which are located within this region of the MHC. The partial cDNA sequence of PERB11 was obtained from poly-A RNA derived from skeletal muscle. The putative amino acid sequence of PERB11 shares {approximately} 30% identity to MHC class I molecules from various species, including reptiles, chickens, and frogs, as well as to other MHC class I-like molecules, such as the IgG FcR of the mouse and rat and the human Zn-{alpha}2-glycoprotein. From direct comparison of amino acid sequences, it is concluded that PERB11 is a distinct molecule more closely related to nonmammalian than known mammalian MHC class I molecules. Genomic sequence analysis of PERB11 from five MHC ancestral haplotypes (AH) indicated that the gene is polymorphic at both DNA and protein level. The results suggest that the authors have identified a novel polymorphic gene family with multiple copies within the MHC. 48 refs., 10 figs., 2 tabs.« less
Arabidopsis Polycomb Repressive Complex 2 binding sites contain putative GAGA factor binding motifs within coding regions of genes

PubMed Central

2013-01-01

Background Polycomb Repressive Complex 2 (PRC2) is an essential regulator of gene expression that maintains genes in a repressed state by marking chromatin with trimethylated Histone H3 lysine 27 (H3K27me3). In Arabidopsis, loss of PRC2 function leads to pleiotropic effects on growth and development thought to be due to ectopic expression of seed and embryo-specific genes. While there is some understanding of the mechanisms by which specific genes are targeted by PRC2 in animal systems, it is still not clear how PRC2 is recruited to specific regions of plant genomes. Results We used ChIP-seq to determine the genome-wide distribution of hemagglutinin (HA)-tagged FERTLIZATION INDEPENDENT ENDOSPERM (FIE-HA), the Extra Sex Combs homolog protein present in all Arabidopsis PRC2 complexes. We found that the FIE-HA binding sites co-locate with a subset of the H3K27me3 sites in the genome and that the associated genes were more likely to be de-repressed in mutants of PRC2 components. The FIE-HA binding sites are enriched for three sequence motifs including a putative GAGA factor binding site that is also found in Drosophila Polycomb Response Elements (PREs). Conclusions Our results suggest that PRC2 binding sites in plant genomes share some sequence features with Drosophila PREs. However, unlike Drosophila PREs which are located in promoters and devoid of H3K27me3, Arabidopsis FIE binding sites tend to be in gene coding regions and co-localize with H3K27me3. PMID:24001316
Isolation and Characterization of EstC, a New Cold-Active Esterase from Streptomyces coelicolor A3(2)

PubMed Central

Brault, Guillaume; Shareck, François; Hurtubise, Yves; Lépine, François; Doucet, Nicolas

2012-01-01

The genome sequence of Streptomyces coelicolor A3(2) contains more than 50 genes coding for putative lipolytic enzymes. Many studies have shown the capacity of this actinomycete to store important reserves of intracellular triacylglycerols in nutrient depletion situations. In the present study, we used genome mining of S. coelicolor to identify genes coding for putative, non-secreted esterases/lipases. Two genes were cloned and successfully overexpressed in E. coli as His-tagged fusion proteins. One of the recombinant enzymes, EstC, showed interesting cold-active esterase activity with a strong potential for the production of valuable esters. The purified enzyme displayed optimal activity at 35°C and was cold-active with retention of 25% relative activity at 10°C. Its optimal pH was 8.5–9 but the enzyme kept more than 75% of its maximal activity between pH 7.5 and 10. EstC also showed remarkable tolerance over a wide range of pH values, retaining almost full residual activity between pH 6–11. The enzyme was active toward short-chain p-nitrophenyl esters (C2–C12), displaying optimal activity with the valerate (C5) ester (k cat/K m = 737±77 s−1 mM−1). The enzyme was also very active toward short chain triglycerides such as triacetin (C2:0) and tributyrin (C4:0), in addition to showing good primary alcohol and organic solvent tolerance, suggesting it could function as an interesting candidate for organic synthesis of short-chain esters such as flavors. PMID:22396747
Proteomics reveals novel components of the Anopheles gambiae eggshell

PubMed Central

Amenya, Dolphine A.; Chou, Wayne; Li, Jianyong; Yan, Guiyun; Gershon, Paul D.; James, Anthony A.; Marinotti, Osvaldo

2010-01-01

While genome and transcriptome sequencing has revealed a large number and diversity of Anopheles gambiae predicted proteins, identifying their functions and biosynthetic pathways remains challenging. Applied mass spectrometry based proteomics in conjunction with mosquito genome and transcriptome databases were used to identify 44 proteins as putative components of the eggshell. Among the identified molecules are two vitelline membrane proteins and a group of seven putative chorion proteins. Enzymes with peroxidase, laccase and phenoloxidase activities, likely involved in cross-linking reactions that stabilize the eggshell structure, also were identified. Seven odorant binding proteins were found in association with the mosquito eggshell, although their role has yet to be demonstrated. This analysis fills a considerable gap of knowledge about proteins that build the eggshell of anopheline mosquitoes. PMID:20433845
Transcriptome analysis of the couch potato (CPO) protein reveals an expression pattern associated with early development in the salmon louse Caligus rogercresseyi.

PubMed

Gallardo-Escárate, Cristian; Valenzuela-Muñoz, Valentina; Nuñez-Acuña, Gustavo; Chávez-Mardones, Jacqueline; Maldonado-Aguayo, Waleska

2014-02-15

The couch potato (CPO) protein is a key biomolecule involved in regulating diapause through the RNA-binding process of the peripheral and central nervous systems in insects and also recently discovered in a few crustacean species. As such, ectoparasitic copepods are interesting model species that have no evidence of developmental arrest. The present study is the first to report on the cloning of a putative CPO gene from the salmon louse Caligus rogercresseyi (CrCPO), as identified by high-throughput transcriptome sequencing. In addition, the transcription expression in larvae and adults was evaluated using quantitative real-time PCR. The CrCPO cDNA sequence showed 3261 base pairs (bp), consisting of 713bp of 5' UTR, 1741bp of 3' UTR, and an open reading frame of 807bp encoding for 268 amino acids. The highly conserved RNA binding regions RNP2 (LFVSGL) and RNP1 (SPVGFVTF), as well the dimerization site (LEF), were also found. Furthermore, eight single nucleotide polymorphisms located in the untranslated regions and one located in the coding region were detected. Gene transcription analysis revealed that CrCPO has ubiquitous expression across larval stages and in adult individuals, with the highest expression from nauplius to copepodid stages. The present study suggests a putative biological function of CrCPO associated with the development of the nervous system in salmon lice and contributes molecular evidence for candidate genes related to host-parasite interactions. Copyright © 2013 Elsevier B.V. All rights reserved.
Arrangement of the Clostridium baratii F7 Toxin Gene Cluster with Identification of a σ Factor That Recognizes the Botulinum Toxin Gene Cluster Promoters

DOE PAGES

Dover, Nir; Barash, Jason R.; Burke, Julianne N.; ...

2014-05-22

Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bontmore » gene that is part of a toxin gene cluster that includes several accessory genes. In this paper, we sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ 70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. Finally, this TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.« less
Association of an SNP in a novel DREB2-like gene SiDREB2 with stress tolerance in foxtail millet [Setaria italica (L.)].

PubMed

Lata, Charu; Bhutty, Sarita; Bahadur, Ranjit Prasad; Majee, Manoj; Prasad, Manoj

2011-06-01

The DREB genes code for important plant transcription factors involved in the abiotic stress response and signal transduction. Characterization of DREB genes and development of functional markers for effective alleles is important for marker-assisted selection in foxtail millet. Here the characterization of a cDNA (SiDREB2) encoding a putative dehydration-responsive element-binding protein 2 from foxtail millet and the development of an allele-specific marker (ASM) for dehydration tolerance is reported. A cDNA clone (GenBank accession no. GT090998) coding for a putative DREB2 protein was isolated as a differentially expressed gene from a 6 h dehydration stress SSH library. A 5' RACE (rapid amplification of cDNA ends) was carried out to obtain the full-length cDNA, and sequence analysis showed that SiDREB2 encoded a polypeptide of 234 amino acids with a predicted mol. wt of 25.72 kDa and a theoretical pI of 5.14. A theoretical model of the tertiary structure shows that it has a highly conserved GCC-box-binding N-terminal domain, and an acidic C-terminus that acts as an activation domain for transcription. Based on its similarity to AP2 domains, SiDREB2 was classified into the A-2 subgroup of the DREB subfamily. Quantitative real-time PCR analysis showed significant up-regulation of SiDREB2 by dehydration (polyethylene glycol) and salinity (NaCl), while its expression was less affected by other stresses. A synonymous single nucleotide polymorphism (SNP) associated with dehydration tolerance was detected at the 558th base pair (an A/G transition) in the SiDREB2 gene in a core set of 45 foxtail millet accessions used. Based on the identified SNP, three primers were designed to develop an ASM for dehydration tolerance. The ASM produced a 261 bp fragment in all the tolerant accessions and produced no amplification in the sensitive accessions. The use of this ASM might be faster, cheaper, and more reproducible than other SNP genotyping methods, and thus will enable marker-aided breeding of foxtail millet for dehydration tolerance.
Putative outer membrane proteins of Leptospira interrogans stimulate human umbilical vein endothelial cells (HUVECS) and express during infection.

PubMed

Gómez, Ricardo M; Vieira, Monica L; Schattner, Mirta; Malaver, Elisa; Watanabe, Monica M; Barbosa, Angela S; Abreu, Patricia A E; de Morais, Zenaide M; Cifuente, Javier O; Atzingen, Marina V; Oliveira, Tatiane R; Vasconcellos, Silvio A; Nascimento, Ana L T O

2008-01-01

Cell adhesion molecules (CAMs) are surface receptors present in eukaryotic cells that mediate cell-cell or cell-extracellular matrix interactions. Vascular endothelium stimulation in vitro that lead to the upregulation of CAMs was reported for the pathogenic spirochaetes, including rLIC10365 of Leptospira interrogans. In this study, we report the cloning of LIC10507, LIC10508, LIC10509 genes of L. interrogans using Escherichia coli as a host system. The rational for selecting these sequences is due to their location in L. interrogans serovar Copenhageni genome that has a potential involvement in pathogenesis. The genes encode for predicted lipoproteins with no assigned functions. The purified recombinant proteins were capable to promote the upregulation of intercellular adhesion molecule 1 (ICAM-1) and E-selectin on monolayers of human umbilical vein endothelial cells (HUVECS). In addition, the coding sequences are expressed in the renal tubules of animal during bacterial experimental infection. The proteins are probably located at the outer membrane of the bacteria since they are detected in detergent-phase of L. interrogans Triton X-114 extract. Altogether our data suggest a possible involvement of these proteins during bacterial infection and provide new insights into the role of this region in the pathogenesis of Leptospira.
Regulatory Proteolysis in Arabidopsis-Pathogen Interactions.

PubMed

Pogány, Miklós; Dankó, Tamás; Kámán-Tóth, Evelin; Schwarczinger, Ildikó; Bozsó, Zoltán

2015-09-24

Approximately two and a half percent of protein coding genes in Arabidopsis encode enzymes with known or putative proteolytic activity. Proteases possess not only common housekeeping functions by recycling nonfunctional proteins. By irreversibly cleaving other proteins, they regulate crucial developmental processes and control responses to environmental changes. Regulatory proteolysis is also indispensable in interactions between plants and their microbial pathogens. Proteolytic cleavage is simultaneously used both by plant cells, to recognize and inactivate invading pathogens, and by microbes, to overcome the immune system of the plant and successfully colonize host cells. In this review, we present available results on the group of proteases in the model plant Arabidopsis thaliana whose functions in microbial pathogenesis were confirmed. Pathogen-derived proteolytic factors are also discussed when they are involved in the cleavage of host metabolites. Considering the wealth of review papers available in the field of the ubiquitin-26S proteasome system results on the ubiquitin cascade are not presented. Arabidopsis and its pathogens are conferred with abundant sets of proteases. This review compiles a list of those that are apparently involved in an interaction between the plant and its pathogens, also presenting their molecular partners when available.
In situ localization and tissue distribution of ostreid herpesvirus 1 proteins in infected Pacific oyster, Crassostrea gigas.

PubMed

Martenot, Claire; Segarra, Amélie; Baillon, Laury; Faury, Nicole; Houssin, Maryline; Renault, Tristan

2016-05-01

Immunohistochemistry (IHC) assays were conducted on paraffin sections from experimentally infected spat and unchallenged spat produced in hatchery to determine the tissue distribution of three viral proteins within the Pacific oyster, Crassostrea gigas. Polyclonal antibodies were produced from recombinant proteins corresponding to two putative membrane proteins and one putative apoptosis inhibitor encoded by ORF 25, 72, and 87, respectively. Results were then compared to those obtained by in situ hybridization performed on the same individuals, and showed a substantial agreement according to Landis and Koch numeric scale. Positive signals were mainly observed in connective tissue of gills, mantle, adductor muscle, heart, digestive gland, labial palps, and gonads of infected spat. Positive signals were also reported in digestive epithelia. However, few positive signals were also observed in healthy appearing oysters (unchallenged spat) and could be due to virus persistence after a primary infection. Cellular localization of staining seemed to be linked to the function of the viral protein targeted. A nucleus staining was preferentially observed with antibodies targeting the putative apoptosis inhibitor protein whereas a cytoplasmic localization was obtained using antibodies recognizing putative membrane proteins. The detection of viral proteins was often associated with histopathological changes previously reported during OsHV-1 infection by histology and transmission electron microscopy. Within the 6h after viral suspension injection, positive signals were almost at the maximal level with the three antibodies and all studied organs appeared infected at 28h post viral injection. Connective tissue appeared to be a privileged site for OsHV-1 replication even if positive signals were observed in the epithelium cells of different organs which may be interpreted as a hypothetical portal of entry or release for the virus. IHC constitutes a suited method for analyzing the early infection stages of OsHV-1 infection and a useful tool to investigate interactions between OsHV-1 and its host at a protein level. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.
Tentacle Transcriptome and Venom Proteome of the Pacific Sea Nettle, Chrysaora fuscescens (Cnidaria: Scyphozoa).

PubMed

Ponce, Dalia; Brinkman, Diane L; Potriquet, Jeremy; Mulvenna, Jason

2016-04-05

Jellyfish venoms are rich sources of toxins designed to capture prey or deter predators, but they can also elicit harmful effects in humans. In this study, an integrated transcriptomic and proteomic approach was used to identify putative toxins and their potential role in the venom of the scyphozoan jellyfish Chrysaora fuscescens. A de novo tentacle transcriptome, containing more than 23,000 contigs, was constructed and used in proteomic analysis of C. fuscescens venom to identify potential toxins. From a total of 163 proteins identified in the venom proteome, 27 were classified as putative toxins and grouped into six protein families: proteinases, venom allergens, C-type lectins, pore-forming toxins, glycoside hydrolases and enzyme inhibitors. Other putative toxins identified in the transcriptome, but not the proteome, included additional proteinases as well as lipases and deoxyribonucleases. Sequence analysis also revealed the presence of ShKT domains in two putative venom proteins from the proteome and an additional 15 from the transcriptome, suggesting potential ion channel blockade or modulatory activities. Comparison of these potential toxins to those from other cnidarians provided insight into their possible roles in C. fuscescens venom and an overview of the diversity of potential toxin families in cnidarian venoms.
Whole genome annotation and comparative genomic analyses of bio-control fungus Purpureocillium lilacinum.

PubMed

Prasad, Pushplata; Varshney, Deepti; Adholeya, Alok

2015-11-25

The fungus Purpureocillium lilacinum is widely known as a biological control agent against plant parasitic nematodes. This research article consists of genomic annotation of the first draft of whole genome sequence of P. lilacinum. The study aims to decipher the putative genetic components of the fungus involved in nematode pathogenesis by performing comparative genomic analysis with nine closely related fungal species in Hypocreales. de novo genomic assembly was done and a total of 301 scaffolds were constructed for P. lilacinum genomic DNA. By employing structural genome prediction models, 13, 266 genes coding for proteins were predicted in the genome. Approximately 73% of the predicted genes were functionally annotated using Blastp, InterProScan and Gene Ontology. A 14.7% fraction of the predicted genes shared significant homology with genes in the Pathogen Host Interactions (PHI) database. The phylogenomic analysis carried out using maximum likelihood RAxML algorithm provided insight into the evolutionary relationship of P. lilacinum. In congruence with other closely related species in the Hypocreales namely, Metarhizium spp., Pochonia chlamydosporia, Cordyceps militaris, Trichoderma reesei and Fusarium spp., P. lilacinum has large gene sets coding for G-protein coupled receptors (GPCRs), proteases, glycoside hydrolases and carbohydrate esterases that are required for degradation of nematode-egg shell components. Screening of the genome by Antibiotics & Secondary Metabolite Analysis Shell (AntiSMASH) pipeline indicated that the genome potentially codes for a variety of secondary metabolites, possibly required for adaptation to heterogeneous lifestyles reported for P. lilacinum. Significant up-regulation of subtilisin-like serine protease genes in presence of nematode eggs in quantitative real-time analyses suggested potential role of serine proteases in nematode pathogenesis. The data offer a better understanding of Purpureocillium lilacinum genome and will enhance our understanding on the molecular mechanism involved in nematophagy.
Analysis of complete genome sequence of Neorickettsia risticii: causative agent of Potomac horse fever

PubMed Central

Lin, Mingqun; Zhang, Chunbin; Gibson, Kathryn; Rikihisa, Yasuko

2009-01-01

Neorickettsia risticii is an obligate intracellular bacterium of the trematodes and mammals. Horses develop Potomac horse fever (PHF) when they ingest aquatic insects containing encysted N. risticii-infected trematodes. The complete genome sequence of N. risticii Illinois consists of a single circular chromosome of 879 977 bp and encodes 38 RNA species and 898 proteins. Although N. risticii has limited ability to synthesize amino acids and lacks many metabolic pathways, it is capable of making major vitamins, cofactors and nucleotides. Comparison with its closely related human pathogen N. sennetsu showed that 758 (88.2%) of protein-coding genes are conserved between N. risticii and N. sennetsu. Four-way comparison of genes among N. risticii and other Anaplasmataceae showed that most genes are either shared among Anaplasmataceae (525 orthologs that generally associated with housekeeping functions), or specific to each genome (>200 genes that are mostly hypothetical proteins). Genes potentially involved in the pathogenesis of N. risticii were identified, including those encoding putative outer membrane proteins, two-component systems and a type IV secretion system (T4SS). The bipolar localization of T4SS pilus protein VirB2 on the bacterial surface was demonstrated for the first time in obligate intracellular bacteria. These data provide insights toward genomic potential of N. risticii and intracellular parasitism, and facilitate our understanding of PHF pathogenesis. PMID:19661282

Analysis of complete genome sequence of Neorickettsia risticii: causative agent of Potomac horse fever.

PubMed

Lin, Mingqun; Zhang, Chunbin; Gibson, Kathryn; Rikihisa, Yasuko

2009-10-01

Neorickettsia risticii is an obligate intracellular bacterium of the trematodes and mammals. Horses develop Potomac horse fever (PHF) when they ingest aquatic insects containing encysted N. risticii-infected trematodes. The complete genome sequence of N. risticii Illinois consists of a single circular chromosome of 879 977 bp and encodes 38 RNA species and 898 proteins. Although N. risticii has limited ability to synthesize amino acids and lacks many metabolic pathways, it is capable of making major vitamins, cofactors and nucleotides. Comparison with its closely related human pathogen N. sennetsu showed that 758 (88.2%) of protein-coding genes are conserved between N. risticii and N. sennetsu. Four-way comparison of genes among N. risticii and other Anaplasmataceae showed that most genes are either shared among Anaplasmataceae (525 orthologs that generally associated with housekeeping functions), or specific to each genome (>200 genes that are mostly hypothetical proteins). Genes potentially involved in the pathogenesis of N. risticii were identified, including those encoding putative outer membrane proteins, two-component systems and a type IV secretion system (T4SS). The bipolar localization of T4SS pilus protein VirB2 on the bacterial surface was demonstrated for the first time in obligate intracellular bacteria. These data provide insights toward genomic potential of N. risticii and intracellular parasitism, and facilitate our understanding of PHF pathogenesis.
Genome-wide Expression Profiling, In Vivo DNA Binding Analysis, and Probabilistic Motif Prediction Reveal Novel Abf1 Target Genes during Fermentation, Respiration, and Sporulation in Yeast

PubMed Central

Schlecht, Ulrich; Erb, Ionas; Demougin, Philippe; Robine, Nicolas; Borde, Valérie; van Nimwegen, Erik; Nicolas, Alain

2008-01-01

The autonomously replicating sequence binding factor 1 (Abf1) was initially identified as an essential DNA replication factor and later shown to be a component of the regulatory network controlling mitotic and meiotic cell cycle progression in budding yeast. The protein is thought to exert its functions via specific interaction with its target site as part of distinct protein complexes, but its roles during mitotic growth and meiotic development are only partially understood. Here, we report a comprehensive approach aiming at the identification of direct Abf1-target genes expressed during fermentation, respiration, and sporulation. Computational prediction of the protein's target sites was integrated with a genome-wide DNA binding assay in growing and sporulating cells. The resulting data were combined with the output of expression profiling studies using wild-type versus temperature-sensitive alleles. This work identified 434 protein-coding loci as being transcriptionally dependent on Abf1. More than 60% of their putative promoter regions contained a computationally predicted Abf1 binding site and/or were bound by Abf1 in vivo, identifying them as direct targets. The present study revealed numerous loci previously unknown to be under Abf1 control, and it yielded evidence for the protein's variable DNA binding pattern during mitotic growth and meiotic development. PMID:18305101
Proteomics of the organohalide-respiring Epsilonproteobacterium Sulfurospirillum multivorans adapted to tetrachloroethene and other energy substrates

PubMed Central

Goris, Tobias; Schiffmann, Christian L.; Gadkari, Jennifer; Schubert, Torsten; Seifert, Jana; Jehmlich, Nico; von Bergen, Martin; Diekert, Gabriele

2015-01-01

Organohalide respiration is an environmentally important but poorly characterized type of anaerobic respiration. We compared the global proteome of the versatile organohalide-respiring Epsilonproteobacterium Sulfurospirillum multivorans grown with different electron acceptors (fumarate, nitrate, or tetrachloroethene [PCE]). The most significant differences in protein abundance were found for gene products of the organohalide respiration region. This genomic region encodes the corrinoid and FeS cluster containing PCE reductive dehalogenase PceA and other proteins putatively involved in PCE metabolism such as those involved in corrinoid biosynthesis. The latter gene products as well as PceA and a putative quinol dehydrogenase were almost exclusively detected in cells grown with PCE. This finding suggests an electron flow from the electron donor such as formate or pyruvate via the quinone pool and a quinol dehydrogenase to PceA and the terminal electron acceptor PCE. Two putative accessory proteins, an IscU-like protein and a peroxidase-like protein, were detected with PCE only and might be involved in PceA maturation. The proteome of cells grown with pyruvate instead of formate as electron donor indicates a route of electrons from reduced ferredoxin via an Epsilonproteobacterial complex I and the quinone pool to PCE. PMID:26387727
Crystal structure of secretory abundant heat soluble protein 4 from one of the toughest “water bears” micro‐animals Ramazzottius Varieornatus

PubMed Central

Fukuda, Yohta

2018-01-01

Abstract Though anhydrobiotic tardigrades (micro‐animals also known as water bears) possess many genes of secretory abundant heat soluble (SAHS) proteins unique to Tardigrada, their functions are unknown. A previous crystallographic study revealed that a SAHS protein (RvSAHS1) from one of the toughest tardigrades, Ramazzottius varieornatus, has a β‐barrel architecture similar to fatty acid binding proteins (FABPs) and two putative ligand binding sites (LBS1 and LBS2) where fatty acids can bind. However, some SAHS proteins such as RvSAHS4 have different sets of amino acid residues at LBS1 and LBS2, implying that they prefer other ligands and have different functions. Here RvSAHS4 was crystallized and analyzed under a condition similar to that for RvSAHS1. There was no electron density corresponding to a fatty acid at LBS1 of RvSAHS4, where a putative fatty acid was observed in RvSAHS1. Instead, LBS2 of RvSAHS4, which was composed of uncharged residues, captured a putative polyethylene glycol molecule. These results suggest that RvSAHS4 mainly uses LBS2 for the binding of uncharged molecules. PMID:29493034
Analysis of the Pantoea ananatis pan-genome reveals factors underlying its ability to colonize and interact with plant, insect and vertebrate hosts.

PubMed

De Maayer, Pieter; Chan, Wai Yin; Rubagotti, Enrico; Venter, Stephanus N; Toth, Ian K; Birch, Paul R J; Coutinho, Teresa A

2014-05-27

Pantoea ananatis is found in a wide range of natural environments, including water, soil, as part of the epi- and endophytic flora of various plant hosts, and in the insect gut. Some strains have proven effective as biological control agents and plant-growth promoters, while other strains have been implicated in diseases of a broad range of plant hosts and humans. By analysing the pan-genome of eight sequenced P. ananatis strains isolated from different sources we identified factors potentially underlying its ability to colonize and interact with hosts in both the plant and animal Kingdoms. The pan-genome of the eight compared P. ananatis strains consisted of a core genome comprised of 3,876 protein coding sequences (CDSs) and a sizeable accessory genome consisting of 1,690 CDSs. We estimate that ~106 unique CDSs would be added to the pan-genome with each additional P. ananatis genome sequenced in the future. The accessory fraction is derived mainly from integrated prophages and codes mostly for proteins of unknown function. Comparison of the translated CDSs on the P. ananatis pan-genome with the proteins encoded on all sequenced bacterial genomes currently available revealed that P. ananatis carries a number of CDSs with orthologs restricted to bacteria associated with distinct hosts, namely plant-, animal- and insect-associated bacteria. These CDSs encode proteins with putative roles in transport and metabolism of carbohydrate and amino acid substrates, adherence to host tissues, protection against plant and animal defense mechanisms and the biosynthesis of potential pathogenicity determinants including insecticidal peptides, phytotoxins and type VI secretion system effectors. P. ananatis has an 'open' pan-genome typical of bacterial species that colonize several different environments. The pan-genome incorporates a large number of genes encoding proteins that may enable P. ananatis to colonize, persist in and potentially cause disease symptoms in a wide range of plant and animal hosts.
Comparative Genomics of 12 Strains of Erwinia amylovora Identifies a Pan-Genome with a Large Conserved Core

PubMed Central

Mann, Rachel A.; Smits, Theo H. M.; Bühlmann, Andreas; Blom, Jochen; Goesmann, Alexander; Frey, Jürg E.; Plummer, Kim M.; Beer, Steven V.; Luck, Joanne; Duffy, Brion; Rodoni, Brendan

2013-01-01

The plant pathogen Erwinia amylovora can be divided into two host-specific groupings; strains infecting a broad range of hosts within the Rosaceae subfamily Spiraeoideae (e.g., Malus, Pyrus, Crataegus, Sorbus) and strains infecting Rubus (raspberries and blackberries). Comparative genomic analysis of 12 strains representing distinct populations (e.g., geographic, temporal, host origin) of E. amylovora was used to describe the pan-genome of this major pathogen. The pan-genome contains 5751 coding sequences and is highly conserved relative to other phytopathogenic bacteria comprising on average 89% conserved, core genes. The chromosomes of Spiraeoideae-infecting strains were highly homogeneous, while greater genetic diversity was observed between Spiraeoideae- and Rubus-infecting strains (and among individual Rubus-infecting strains), the majority of which was attributed to variable genomic islands. Based on genomic distance scores and phylogenetic analysis, the Rubus-infecting strain ATCC BAA-2158 was genetically more closely related to the Spiraeoideae-infecting strains of E. amylovora than it was to the other Rubus-infecting strains. Analysis of the accessory genomes of Spiraeoideae- and Rubus-infecting strains has identified putative host-specific determinants including variation in the effector protein HopX1Ea and a putative secondary metabolite pathway only present in Rubus-infecting strains. PMID:23409014
Comparative genomics of 12 strains of Erwinia amylovora identifies a pan-genome with a large conserved core.

PubMed

Mann, Rachel A; Smits, Theo H M; Bühlmann, Andreas; Blom, Jochen; Goesmann, Alexander; Frey, Jürg E; Plummer, Kim M; Beer, Steven V; Luck, Joanne; Duffy, Brion; Rodoni, Brendan

2013-01-01

The plant pathogen Erwinia amylovora can be divided into two host-specific groupings; strains infecting a broad range of hosts within the Rosaceae subfamily Spiraeoideae (e.g., Malus, Pyrus, Crataegus, Sorbus) and strains infecting Rubus (raspberries and blackberries). Comparative genomic analysis of 12 strains representing distinct populations (e.g., geographic, temporal, host origin) of E. amylovora was used to describe the pan-genome of this major pathogen. The pan-genome contains 5751 coding sequences and is highly conserved relative to other phytopathogenic bacteria comprising on average 89% conserved, core genes. The chromosomes of Spiraeoideae-infecting strains were highly homogeneous, while greater genetic diversity was observed between Spiraeoideae- and Rubus-infecting strains (and among individual Rubus-infecting strains), the majority of which was attributed to variable genomic islands. Based on genomic distance scores and phylogenetic analysis, the Rubus-infecting strain ATCC BAA-2158 was genetically more closely related to the Spiraeoideae-infecting strains of E. amylovora than it was to the other Rubus-infecting strains. Analysis of the accessory genomes of Spiraeoideae- and Rubus-infecting strains has identified putative host-specific determinants including variation in the effector protein HopX1(Ea) and a putative secondary metabolite pathway only present in Rubus-infecting strains.
The ANKK1 kinase gene and psychiatric disorders.

PubMed

Ponce, Guillermo; Pérez-González, Rocío; Aragüés, María; Palomo, Tomás; Rodríguez-Jiménez, Roberto; Jiménez-Arriero, Miguel Angel; Hoenicka, Janet

2009-07-01

The TaqIA single nucleotide polymorphism (SNP, rs1800497), which is located in the gene that codes for the putative kinase ANKK1 (ANKK1) near the termination codon of the D2 dopamine receptor gene (DRD2; chromosome 11q22-q23), is the most studied genetic variation in a broad range of psychiatric disorders and personality traits. A large number of individual genetic association studies have found that the TaqIA SNP is linked to alcoholism and antisocial traits. In addition, it has also been related to other conditions such as schizophrenia, eating disorders, and some behavioral childhood disorders. The TaqIA A1 allele is mainly associated with addictions, antisocial disorders, eating disorders, and attention-deficit/hyperactivity disorders, while the A2 allele occurs more frequently in schizophrenic and obsessive-compulsive patients. Current data show that the TaqIA polymorphism may be a marker of both DRD2 and ANKK1 genetic variants. ANKK1 would belong to a family of kinases involved in signal transduction. This raises the question of whether signaling players intervene in the pathophysiology of psychiatric disorders. Basic research on the ANKK1 protein and its putative interaction with the D2 dopamine receptor could shed light on this issue.
Prospecting Biotechnologically-Relevant Monooxygenases from Cold Sediment Metagenomes: An In Silico Approach.

PubMed

Musumeci, Matías A; Lozada, Mariana; Rial, Daniela V; Mac Cormack, Walter P; Jansson, Janet K; Sjöling, Sara; Carroll, JoLynn; Dionisi, Hebe M

2017-04-09

The goal of this work was to identify sequences encoding monooxygenase biocatalysts with novel features by in silico mining an assembled metagenomic dataset of polar and subpolar marine sediments. The targeted enzyme sequences were Baeyer-Villiger and bacterial cytochrome P450 monooxygenases (CYP153). These enzymes have wide-ranging applications, from the synthesis of steroids, antibiotics, mycotoxins and pheromones to the synthesis of monomers for polymerization and anticancer precursors, due to their extraordinary enantio-, regio-, and chemo- selectivity that are valuable features for organic synthesis. Phylogenetic analyses were used to select the most divergent sequences affiliated to these enzyme families among the 264 putative monooxygenases recovered from the ~14 million protein-coding sequences in the assembled metagenome dataset. Three-dimensional structure modeling and docking analysis suggested features useful in biotechnological applications in five metagenomic sequences, such as wide substrate range, novel substrate specificity or regioselectivity. Further analysis revealed structural features associated with psychrophilic enzymes, such as broader substrate accessibility, larger catalytic pockets or low domain interactions, suggesting that they could be applied in biooxidations at room or low temperatures, saving costs inherent to energy consumption. This work allowed the identification of putative enzyme candidates with promising features from metagenomes, providing a suitable starting point for further developments.
Prospecting Biotechnologically-Relevant Monooxygenases from Cold Sediment Metagenomes: An In Silico Approach

PubMed Central

Musumeci, Matías A.; Lozada, Mariana; Rial, Daniela V.; Mac Cormack, Walter P.; Jansson, Janet K.; Sjöling, Sara; Carroll, JoLynn; Dionisi, Hebe M.

2017-01-01

The goal of this work was to identify sequences encoding monooxygenase biocatalysts with novel features by in silico mining an assembled metagenomic dataset of polar and subpolar marine sediments. The targeted enzyme sequences were Baeyer–Villiger and bacterial cytochrome P450 monooxygenases (CYP153). These enzymes have wide-ranging applications, from the synthesis of steroids, antibiotics, mycotoxins and pheromones to the synthesis of monomers for polymerization and anticancer precursors, due to their extraordinary enantio-, regio-, and chemo- selectivity that are valuable features for organic synthesis. Phylogenetic analyses were used to select the most divergent sequences affiliated to these enzyme families among the 264 putative monooxygenases recovered from the ~14 million protein-coding sequences in the assembled metagenome dataset. Three-dimensional structure modeling and docking analysis suggested features useful in biotechnological applications in five metagenomic sequences, such as wide substrate range, novel substrate specificity or regioselectivity. Further analysis revealed structural features associated with psychrophilic enzymes, such as broader substrate accessibility, larger catalytic pockets or low domain interactions, suggesting that they could be applied in biooxidations at room or low temperatures, saving costs inherent to energy consumption. This work allowed the identification of putative enzyme candidates with promising features from metagenomes, providing a suitable starting point for further developments. PMID:28397770
Venom Gland Transcriptomic and Proteomic Analyses of the Enigmatic Scorpion Superstitionia donensis (Scorpiones: Superstitioniidae), with Insights on the Evolution of Its Venom Components.

PubMed

Santibáñez-López, Carlos E; Cid-Uribe, Jimena I; Batista, Cesar V F; Ortiz, Ernesto; Possani, Lourival D

2016-12-09

Venom gland transcriptomic and proteomic analyses have improved our knowledge on the diversity of the heterogeneous components present in scorpion venoms. However, most of these studies have focused on species from the family Buthidae. To gain insights into the molecular diversity of the venom components of scorpions belonging to the family Superstitioniidae, one of the neglected scorpion families, we performed a transcriptomic and proteomic analyses for the species Superstitionia donensis . The total mRNA extracted from the venom glands of two specimens was subjected to massive sequencing by the Illumina protocol, and a total of 219,073 transcripts were generated. We annotated 135 transcripts putatively coding for peptides with identity to known venom components available from different protein databases. Fresh venom collected by electrostimulation was analyzed by LC-MS/MS allowing the identification of 26 distinct components with sequences matching counterparts from the transcriptomic analysis. In addition, the phylogenetic affinities of the found putative calcins, scorpines, La1-like peptides and potassium channel κ toxins were analyzed. The first three components are often reported as ubiquitous in the venom of different families of scorpions. Our results suggest that, at least calcins and scorpines, could be used as molecular markers in phylogenetic studies of scorpion venoms.
Venom Gland Transcriptomic and Proteomic Analyses of the Enigmatic Scorpion Superstitionia donensis (Scorpiones: Superstitioniidae), with Insights on the Evolution of Its Venom Components

PubMed Central

Santibáñez-López, Carlos E.; Cid-Uribe, Jimena I.; Batista, Cesar V. F.; Ortiz, Ernesto; Possani, Lourival D.

2016-01-01

Venom gland transcriptomic and proteomic analyses have improved our knowledge on the diversity of the heterogeneous components present in scorpion venoms. However, most of these studies have focused on species from the family Buthidae. To gain insights into the molecular diversity of the venom components of scorpions belonging to the family Superstitioniidae, one of the neglected scorpion families, we performed a transcriptomic and proteomic analyses for the species Superstitionia donensis. The total mRNA extracted from the venom glands of two specimens was subjected to massive sequencing by the Illumina protocol, and a total of 219,073 transcripts were generated. We annotated 135 transcripts putatively coding for peptides with identity to known venom components available from different protein databases. Fresh venom collected by electrostimulation was analyzed by LC-MS/MS allowing the identification of 26 distinct components with sequences matching counterparts from the transcriptomic analysis. In addition, the phylogenetic affinities of the found putative calcins, scorpines, La1-like peptides and potassium channel κ toxins were analyzed. The first three components are often reported as ubiquitous in the venom of different families of scorpions. Our results suggest that, at least calcins and scorpines, could be used as molecular markers in phylogenetic studies of scorpion venoms. PMID:27941686
A novel polyomavirus from the nasal cavity of a giant panda (Ailuropoda melanoleuca).

PubMed

Qi, Dunwu; Shan, Tongling; Liu, Zhijian; Deng, Xutao; Zhang, Zhihe; Bi, Wenlei; Owens, Jacob Robert; Feng, Feifei; Zheng, Lisong; Huang, Feng; Delwart, Eric; Hou, Rong; Zhang, Wen

2017-10-27

Polyomaviruses infect a wide variety of mammalian and avian hosts with a broad spectrum of outcomes including asymptomatic infection, acute systemic disease, and tumor induction. Viral metagenomics and general PCR methods were used to detected viral nucleic acid in the samples from a diseased and healthy giant pandas. A novel polyomavirus, the giant panda polyomavirus 1 (GPPyV1) from the nasal cavity of a dead giant panda (Ailuropoda melanoleuca) was characterized. The GPPyV1 genome is 5144 bp in size and reveals five putative open-reading frames coding for the classic small and large T antigens in the early region, and the VP1, VP2 and VP3 capsid proteins in the late region. Phylogenetic analyses of the large T antigen of the GPPyV1 indicated GPPyV1 belonged to a putative new species within genus Deltapolyomavirus, clustering with four human polyomavirus species. The GPPyV1 VP1 and VP2 clustered with genus Alphapolyomavirus. Our epidemiologic study indicated that this novel polyomavirus was also detected in nasal swabs and fecal samples collected from captive healthy giant pandas. A novel polyomavirus was detected in giant pandas and its complete genome was characterized, which may cause latency infection in giant pandas.
Comparative analysis of programmed cell death pathways in filamentous fungi.

PubMed

Fedorova, Natalie D; Badger, Jonathan H; Robson, Geoff D; Wortman, Jennifer R; Nierman, William C

2005-12-08

Fungi can undergo autophagic- or apoptotic-type programmed cell death (PCD) on exposure to antifungal agents, developmental signals, and stress factors. Filamentous fungi can also exhibit a form of cell death called heterokaryon incompatibility (HI) triggered by fusion between two genetically incompatible individuals. With the availability of recently sequenced genomes of Aspergillus fumigatus and several related species, we were able to define putative components of fungi-specific death pathways and the ancestral core apoptotic machinery shared by all fungi and metazoa. Phylogenetic profiling of HI-associated proteins from four Aspergilli and seven other fungal species revealed lineage-specific protein families, orphan genes, and core genes conserved across all fungi and metazoa. The Aspergilli-specific domain architectures include NACHT family NTPases, which may function as key integrators of stress and nutrient availability signals. They are often found fused to putative effector domains such as Pfs, SesB/LipA, and a newly identified domain, HET-s/LopB. Many putative HI inducers and mediators are specific to filamentous fungi and not found in unicellular yeasts. In addition to their role in HI, several of them appear to be involved in regulation of cell cycle, development and sexual differentiation. Finally, the Aspergilli possess many putative downstream components of the mammalian apoptotic machinery including several proteins not found in the model yeast, Saccharomyces cerevisiae. Our analysis identified more than 100 putative PCD associated genes in the Aspergilli, which may help expand the range of currently available treatments for aspergillosis and other invasive fungal diseases. The list includes species-specific protein families as well as conserved core components of the ancestral PCD machinery shared by fungi and metazoa.
The complete mitochondrial genome of threatened chocolate mahseer (Neolissochilus hexagonolepis) and its phylogeny.

PubMed

Sahoo, Prabhati Kumari; Goel, Chirag; Kumar, Rohit; Dhama, Nisha; Ali, Shahnawaz; Sarma, Dandadhar; Nanda, Prasanta; Barat, Ashoktaru

2015-10-10

The chocolate mahseer (Neolissochilus hexagonolepis) is an important food and game fish of North Eastern India. To study the phylogenetic status we sequenced the complete mitochondrial genome of N. hexagonolepis. The mitogenome is 16,563 bp in length and composed of 13 protein coding genes, 22 tRNAs, 2 rRNAs and one putative control region. The overall base composition was A 31.8%, T 25.0%, G 15.8%, C 27.4% and A+T content 56.9%, G+C content 43.1%. The phylogenetic analysis using the complete mitochondrial genome revealed that the chocolate mahseer belonged to same clade of mahseer group of fishes but different from genera Barbus and Acrossocheilus. The present study will be helpful for the evolution and conservation genetic studies of N. hexagonolepis. Copyright © 2015 Elsevier B.V. All rights reserved.
Analysis of Putative Apoplastic Effectors from the Nematode, Globodera rostochiensis, and Identification of an Expansin-Like Protein That Can Induce and Suppress Host Defenses

PubMed Central

Ali, Shawkat; Magne, Maxime; Chen, Shiyan; Côté, Olivier; Stare, Barbara Gerič; Obradovic, Natasa; Jamshaid, Lubna; Wang, Xiaohong; Bélair, Guy; Moffett, Peter

2015-01-01

The potato cyst nematode, Globodera rostochiensis, is an important pest of potato. Like other pathogens, plant parasitic nematodes are presumed to employ effector proteins, secreted into the apoplast as well as the host cytoplasm, to alter plant cellular functions and successfully infect their hosts. We have generated a library of ORFs encoding putative G. rostochiensis putative apoplastic effectors in vectors for expression in planta. These clones were assessed for morphological and developmental effects on plants as well as their ability to induce or suppress plant defenses. Several CLAVATA3/ESR-like proteins induced developmental phenotypes, whereas predicted cell wall-modifying proteins induced necrosis and chlorosis, consistent with roles in cell fate alteration and tissue invasion, respectively. When directed to the apoplast with a signal peptide, two effectors, an ubiquitin extension protein (GrUBCEP12) and an expansin-like protein (GrEXPB2), suppressed defense responses including NB-LRR signaling induced in the cytoplasm. GrEXPB2 also elicited defense response in species- and sequence-specific manner. Our results are consistent with the scenario whereby potato cyst nematodes secrete effectors that modulate host cell fate and metabolism as well as modifying host cell walls. Furthermore, we show a novel role for an apoplastic expansin-like protein in suppressing intra-cellular defense responses. PMID:25606855
Analysis of putative apoplastic effectors from the nematode, Globodera rostochiensis, and identification of an expansin-like protein that can induce and suppress host defenses.

PubMed

Ali, Shawkat; Magne, Maxime; Chen, Shiyan; Côté, Olivier; Stare, Barbara Gerič; Obradovic, Natasa; Jamshaid, Lubna; Wang, Xiaohong; Bélair, Guy; Moffett, Peter

2015-01-01

The potato cyst nematode, Globodera rostochiensis, is an important pest of potato. Like other pathogens, plant parasitic nematodes are presumed to employ effector proteins, secreted into the apoplast as well as the host cytoplasm, to alter plant cellular functions and successfully infect their hosts. We have generated a library of ORFs encoding putative G. rostochiensis putative apoplastic effectors in vectors for expression in planta. These clones were assessed for morphological and developmental effects on plants as well as their ability to induce or suppress plant defenses. Several CLAVATA3/ESR-like proteins induced developmental phenotypes, whereas predicted cell wall-modifying proteins induced necrosis and chlorosis, consistent with roles in cell fate alteration and tissue invasion, respectively. When directed to the apoplast with a signal peptide, two effectors, an ubiquitin extension protein (GrUBCEP12) and an expansin-like protein (GrEXPB2), suppressed defense responses including NB-LRR signaling induced in the cytoplasm. GrEXPB2 also elicited defense response in species- and sequence-specific manner. Our results are consistent with the scenario whereby potato cyst nematodes secrete effectors that modulate host cell fate and metabolism as well as modifying host cell walls. Furthermore, we show a novel role for an apoplastic expansin-like protein in suppressing intra-cellular defense responses.
Variant discovery in the sheep milk transcriptome using RNA sequencing.

PubMed

Suárez-Vega, Aroa; Gutiérrez-Gil, Beatriz; Klopp, Christophe; Tosser-Klopp, Gwenola; Arranz, Juan José

2017-02-15

The identification of genetic variation underlying desired phenotypes is one of the main challenges of current livestock genetic research. High-throughput transcriptome sequencing (RNA-Seq) offers new opportunities for the detection of transcriptome variants (SNPs and short indels) in different tissues and species. In this study, we used RNA-Seq on Milk Sheep Somatic Cells (MSCs) with the goal of characterizing the genetic variation within the coding regions of the milk transcriptome in Churra and Assaf sheep, two common dairy sheep breeds farmed in Spain. A total of 216,637 variants were detected in the MSCs transcriptome of the eight ewes analyzed. Among them, a total of 57,795 variants were detected in the regions harboring Quantitative Trait Loci (QTL) for milk yield, protein percentage and fat percentage, of which 21.44% were novel variants. Among the total variants detected, 561 (2.52%) and 1,649 (7.42%) were predicted to produce high or moderate impact changes in the corresponding transcriptional unit, respectively. In the functional enrichment analysis of the genes positioned within selected QTL regions harboring novel relevant functional variants (high and moderate impact), the KEGG pathway with the highest enrichment was "protein processing in endoplasmic reticulum". Additionally, a total of 504 and 1,063 variants were identified in the genes encoding principal milk proteins and molecules involved in the lipid metabolism, respectively. Of these variants, 20 mutations were found to have putative relevant effects on the encoded proteins. We present herein the first transcriptomic approach aimed at identifying genetic variants of the genes expressed in the lactating mammary gland of sheep. Through the transcriptome analysis of variability within regions harboring QTL for milk yield, protein percentage and fat percentage, we have found several pathways and genes that harbor mutations that could affect dairy production traits. Moreover, remarkable variants were also found in candidate genes coding for major milk proteins and proteins related to milk fat metabolism. Several of the SNPs found in this study could be included as suitable markers in genotyping platforms or custom SNP arrays to perform association analyses in commercial populations and apply genomic selection protocols in the dairy production industry.
Long non-coding RNA HOTAIR, a c-Myc activated driver of malignancy, negatively regulates miRNA-130a in gallbladder cancer

PubMed Central

2014-01-01

Background Protein coding genes account for only about 2% of the human genome, whereas the vast majority of transcripts are non-coding RNAs including long non-coding RNAs. A growing volume of literature has proposed that lncRNAs are important players in cancer. HOTAIR was previously shown to be an oncogene and negative prognostic factor in a variety of cancers. However, the factors that contribute to its upregulation and the interaction between HOTAIR and miRNAs are largely unknown. Methods A computational screen of HOTAIR promoter was conducted to search for transcription-factor-binding sites. HOTAIR promoter activities were examined by luciferase reporter assay. The function of the c-Myc binding site in the HOTAIR promoter region was tested by a promoter assay with nucleotide substitutions in the putative E-box. The association of c-Myc with the HOTAIR promoter in vivo was confirmed by chromatin immunoprecipitation assay and Electrophoretic mobility shift assay. A search for miRNAs with complementary base paring with HOTAIR was performed utilizing online software program. Gain and loss of function approaches were employed to investigate the expression changes of HOTAIR or miRNA-130a. The expression levels of HOTAIR, c-Myc and miRNA-130a were examined in 65 matched pairs of gallbladder cancer tissues. The effects of HOTAIR and miRNA-130a on gallbladder cancer cell invasion and proliferation was tested using in vitro cell invasion and flow cytometric assays. Results We demonstrate that HOTAIR is a direct target of c-Myc through interaction with putative c-Myc target response element (RE) in the upstream region of HOTAIR in gallbladder cancer cells. A positive correlation between c-Myc and HOTAIR mRNA levels was observed in gallbladder cancer tissues. We predicted that HOTAIR harbors a miRNA-130a binding site. Our data showed that this binding site is vital for the regulation of miRNA-130a by HOTAIR. Moreover, a negative correlation between HOTAIR and miRNA-130a was observed in gallbladder cancer tissues. Finally, we demonstrate that the oncogenic activity of HOTAIR is in part through its negative regulation of miRNA-130a. Conclusion Together, these results suggest that HOTAIR is a c-Myc-activated driver of malignancy, which acts in part through repression of miRNA-130a. PMID:24953832
Mutations in a novel gene, NHS, cause the pleiotropic effects of Nance-Horan syndrome, including severe congenital cataract, dental anomalies, and mental retardation.

PubMed

Burdon, Kathryn P; McKay, James D; Sale, Michèle M; Russell-Eggitt, Isabelle M; Mackey, David A; Wirth, M Gabriela; Elder, James E; Nicoll, Alan; Clarke, Michael P; FitzGerald, Liesel M; Stankovich, James M; Shaw, Marie A; Sharma, Shiwani; Gajovic, Srecko; Gruss, Peter; Ross, Shelley; Thomas, Paul; Voss, Anne K; Thomas, Tim; Gécz, Jozef; Craig, Jamie E

2003-11-01

Nance-Horan syndrome (NHS) is an X-linked disorder characterized by congenital cataracts, dental anomalies, dysmorphic features, and, in some cases, mental retardation. NHS has been mapped to a 1.3-Mb interval on Xp22.13. We have confirmed the same localization in the original, extended Australian family with NHS and have identified protein-truncating mutations in a novel gene, which we have called "NHS," in five families. The NHS gene encompasses approximately 650 kb of genomic DNA, coding for a 1,630-amino acid putative nuclear protein. NHS orthologs were found in other vertebrates, but no sequence similarity to known genes was identified. The murine developmental expression profile of the NHS gene was studied using in situ hybridization and a mouse line containing a lacZ reporter-gene insertion in the Nhs locus. We found a complex pattern of temporally and spatially regulated expression, which, together with the pleiotropic features of NHS, suggests that this gene has key functions in the regulation of eye, tooth, brain, and craniofacial development.

Mutations in a Novel Gene, NHS, Cause the Pleiotropic Effects of Nance-Horan Syndrome, Including Severe Congenital Cataract, Dental Anomalies, and Mental Retardation

PubMed Central

Burdon, Kathryn P.; McKay, James D.; Sale, Michèle M.; Russell-Eggitt, Isabelle M.; Mackey, David A.; Wirth, M. Gabriela; Elder, James E.; Nicoll, Alan; Clarke, Michael P.; FitzGerald, Liesel M.; Stankovich, James M.; Shaw, Marie A.; Sharma, Shiwani; Gajovic, Srecko; Gruss, Peter; Ross, Shelley; Thomas, Paul; Voss, Anne K.; Thomas, Tim; Gécz, Jozef; Craig, Jamie E.

2003-01-01

Nance-Horan syndrome (NHS) is an X-linked disorder characterized by congenital cataracts, dental anomalies, dysmorphic features, and, in some cases, mental retardation. NHS has been mapped to a 1.3-Mb interval on Xp22.13. We have confirmed the same localization in the original, extended Australian family with NHS and have identified protein-truncating mutations in a novel gene, which we have called “NHS,” in five families. The NHS gene encompasses ∼650 kb of genomic DNA, coding for a 1,630–amino acid putative nuclear protein. NHS orthologs were found in other vertebrates, but no sequence similarity to known genes was identified. The murine developmental expression profile of the NHS gene was studied using in situ hybridization and a mouse line containing a lacZ reporter-gene insertion in the Nhs locus. We found a complex pattern of temporally and spatially regulated expression, which, together with the pleiotropic features of NHS, suggests that this gene has key functions in the regulation of eye, tooth, brain, and craniofacial development. PMID:14564667
PepLine: a software pipeline for high-throughput direct mapping of tandem mass spectrometry data on genomic sequences.

PubMed

Ferro, Myriam; Tardif, Marianne; Reguer, Erwan; Cahuzac, Romain; Bruley, Christophe; Vermat, Thierry; Nugues, Estelle; Vigouroux, Marielle; Vandenbrouck, Yves; Garin, Jérôme; Viari, Alain

2008-05-01

PepLine is a fully automated software which maps MS/MS fragmentation spectra of trypsic peptides to genomic DNA sequences. The approach is based on Peptide Sequence Tags (PSTs) obtained from partial interpretation of QTOF MS/MS spectra (first module). PSTs are then mapped on the six-frame translations of genomic sequences (second module) giving hits. Hits are then clustered to detect potential coding regions (third module). Our work aimed at optimizing the algorithms of each component to allow the whole pipeline to proceed in a fully automated manner using raw nucleic acid sequences (i.e., genomes that have not been "reduced" to a database of ORFs or putative exons sequences). The whole pipeline was tested on controlled MS/MS spectra sets from standard proteins and from Arabidopsis thaliana envelope chloroplast samples. Our results demonstrate that PepLine competed with protein database searching softwares and was fast enough to potentially tackle large data sets and/or high size genomes. We also illustrate the potential of this approach for the detection of the intron/exon structure of genes.
Integrated genome sequence and linkage map of physic nut (Jatropha curcas L.), a biodiesel plant.

PubMed

Wu, Pingzhi; Zhou, Changpin; Cheng, Shifeng; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Chen, Yanbo; Chen, Yan; Ni, Peixiang; Wang, Ying; Xu, Xun; Huang, Ying; Song, Chi; Wang, Zhiwen; Shi, Nan; Zhang, Xudong; Fang, Xiaohua; Yang, Qing; Jiang, Huawu; Chen, Yaping; Li, Meiru; Wang, Ying; Chen, Fan; Wang, Jun; Wu, Guojiang

2015-03-01

The family Euphorbiaceae includes some of the most efficient biomass accumulators. Whole genome sequencing and the development of genetic maps of these species are important components in molecular breeding and genetic improvement. Here we report the draft genome of physic nut (Jatropha curcas L.), a biodiesel plant. The assembled genome has a total length of 320.5 Mbp and contains 27,172 putative protein-coding genes. We established a linkage map containing 1208 markers and anchored the genome assembly (81.7%) to this map to produce 11 pseudochromosomes. After gene family clustering, 15,268 families were identified, of which 13,887 existed in the castor bean genome. Analysis of the genome highlighted specific expansion and contraction of a number of gene families during the evolution of this species, including the ribosome-inactivating proteins and oil biosynthesis pathway enzymes. The genomic sequence and linkage map provide a valuable resource not only for fundamental and applied research on physic nut but also for evolutionary and comparative genomics analysis, particularly in the Euphorbiaceae. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
A Cluster of Cuticle Protein Genes of Drosophila Melanogaster at 65a: Sequence, Structure and Evolution

PubMed Central

Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.

1997-01-01

A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064
Cloning, Sequencing, and Role in Virulence of Two Phospholipases (A1 and C) from Mesophilic Aeromonas sp. Serogroup O:34

PubMed Central

Merino, Susana; Aguilar, Alicia; Nogueras, Maria Mercedes; Regue, Miguel; Swift, Simon; Tomás, Juan M.

1999-01-01

Two different representative recombinant clones encoding Aeromonas hydrophila lipases were found upon screening on tributyrin (phospholipase A1) and egg yolk agar (lecithinase-phospholipase C) plates of a cosmid-based genomic library of Aeromonas hydrophila AH-3 (serogroup O34) introduced into Escherichia coli DH5α. Subcloning, nucleotide sequencing, and in vitro-coupled transcription-translation experiments showed that the phospholipase A1 (pla) and C (plc) genes code for an 83-kDa putative lipoprotein and a 65-kDa protein, respectively. Defined insertion mutants of A. hydrophila AH-3 defective in either pla or plc genes were defective in phospholipase A1 and C activities, respectively. Lecithinase (phospholipase C) was shown to be cytotoxic but nonhemolytic or poorly hemolytic. A. hydrophila AH-3 plc mutants showed a more than 10-fold increase in their 50% lethal dose on fish and mice, and complementation of the plc single gene on these mutants abolished this effect, suggesting that Plc protein is a virulence factor in the mesophilic Aeromonas sp. serogroup O:34 infection process. PMID:10417167
Characterization of the cryptic plasmid pOfk55 from Legionella pneumophila and construction of a pOfk55-derived shuttle vector.

PubMed

Nishida, Takashi; Watanabe, Kenta; Tachibana, Masato; Shimizu, Takashi; Watarai, Masahisa

2017-03-01

In this study, a cryptic plasmid pOfk55 from Legionella pneumophila was isolated and characterized. pOfk55 comprised 2584bp with a GC content of 37.3% and contained three putative open reading frames (ORFs). orf1 encoded a protein of 195 amino acids and the putative protein shared 39% sequence identity with a putative plasmid replication protein RepL. ORF1 was needed for replication in L. pneumophila but pOfk55 did not replicate in Escherichia coli. orf2 and orf3 encoded putative hypothetical proteins of 114 amino acids and 78 amino acids, respectively, but the functions of the putative proteins ORF2 and OFR3 are not clear. The transfer mechanism for pOfk55 was independent on the type IVB secretion system in the original host. A L. pneumophila-E. coli shuttle vector, pNT562 (5058bp, Km R ), was constructed by In-Fusion Cloning of pOfk55 with a kanamycin-resistance gene from pUTmini-Tn5Km and the origin of replication from pBluescript SK(+) (pNT561). Multiple cloning sites from pBluescript SK(+) as well as the tac promoter region and lacI gene from pAM239-GFP were inserted into pNT561 to construct pNT562. The transformation efficiency of pNT562 in L. pneumophila strains ranged from 1.6×10 1 to 1.0×10 5 CFU/ng. The relative number of pNT562 was estimated at 5.7±1.0 copies and 73.6% of cells maintained the plasmid after 1week in liquid culture without kanamycin. A green fluorescent protein (GFP) expression vector, pNT563, was constructed by ligating pNT562 with the gfpmut3 gene from pAM239-GFP. pNT563 was introduced into L. pneumophila Lp02 and E. coli DH5α, and both strains expressed GFP successfully. These results suggest that the shuttle vector is useful for genetic studies in L. pneumophila. Copyright © 2017 Elsevier Inc. All rights reserved.
Selective enrichment of metal-binding proteins based on magnetic core/shell microspheres functionalized with metal cations.

PubMed

Fang, Caiyun; Zhang, Lei; Zhang, Xiaoqin; Lu, Haojie

2015-06-21

Metal binding proteins play many important roles in a broad range of biological processes. Characterization of metal binding proteins is important for understanding their structure and biological functions, thus leading to a clear understanding of metal associated diseases. The present study is the first to investigate the effectiveness of magnetic microspheres functionalized with metal cations (Ca(2+), Cu(2+), Zn(2+) and Fe(3+)) as the absorbent matrix in IMAC technology to enrich metal containing/binding proteins. The putative metal binding proteins in rat liver were then globally characterized by using this strategy which is very easy to handle and can capture a number of metal binding proteins effectively. In total, 185 putative metal binding proteins were identified from rat liver including some known less abundant and membrane-bound metal binding proteins such as Plcg1, Acsl5, etc. The identified proteins are involved in many important processes including binding, catalytic activity, translation elongation factor activity, electron carrier activity, and so on.
Genome of turbot rhabdovirus exhibits unusual non-coding regions and an additional ORF that could be expressed in fish cell.

PubMed

Zhu, Ruo-Lin; Lei, Xiao-Ying; Ke, Fei; Yuan, Xiu-Ping; Zhang, Qi-Ya

2011-02-01

Genomic sequence of Scophthalmus maximus rhabdovirus (SMRV) isolated from diseased turbot has been characterized. The complete genome of SMRV comprises 11,492 nucleotides and encodes five typical rhabdovirus genes N, P, M, G and L. In addition, two open reading frames (ORF) are predicted overlapping with P gene, one upstream of P and smaller than P (temporarily called Ps), and another in P gene which may encodes a protein similar to the vesicular stomatitis virus C protein. The C ORF is contained within the P ORF. The five typical proteins share the highest sequence identities (48.9%) with the corresponding proteins of rhabdoviruses in genus Vesiculovirus. Phylogenetic analysis of partial L protein sequence indicates that SMRV is close to genus Vesiculovirus. The first 13 nucleotides at the ends of the SMRV genome are absolutely inverse complementarity. The gene junctions between the five genes show conserved polyadenylation signal (CATGA(7)) and intergenic dinucleotide (CT) followed by putative transcription initiation sequence A(A/G)(C/G)A(A/G/T), which are different from known rhabdoviruses. The entire Ps ORF was cloned and expressed, and used to generate polyclonal antibody in mice. One obvious band could be detected in SMRV-infected carp leucocyte cells (CLCs) by anti-Ps/C serum via Western blot, and the subcellular localization of Ps-GFP fusion protein exhibited cytoplasm distribution as multiple punctuate or doughnut shaped foci of uneven size. Copyright Â© 2010 Elsevier B.V. All rights reserved.
Mutations in Elongation Factor Ef-1α Affect the Frequency of Frameshifting and Amino Acid Misincorporation in Saccharomyces Cerevisiae

PubMed Central

Sandbaken, M. G.; Culbertson, M. R.

1988-01-01

A mutational analysis of the eukaryotic elongation factor EF-1α indicates that this protein functions to limit the frequency of errors during genetic code translation. We found that both amino acid misincorporation and reading frame errors are controlled by EF-1α. In order to examine the function of this protein, the TEF2 gene, which encodes EF-1α in Saccharomyces cerevisiae, was mutagenized in vitro with hydroxylamine. Sixteen independent TEF2 alleles were isolated by their ability to suppress frameshift mutations. DNA sequence analysis identified eight different sites in the EF-1α protein that elevate the frequency of mistranslation when mutated. These sites are located in two different regions of the protein. Amino acid substitutions located in or near the GTP-binding and hydrolysis domain of the protein cause suppression of frameshift and nonsense mutations. These mutations may effect mistranslation by altering the binding or hydrolysis of GTP. Amino acid substitutions located adjacent to a putative aminoacyl-tRNA binding region also suppress frameshift and nonsense mutations. These mutations may alter the binding of aminoacyl-tRNA by EF-1α. The identification of frameshift and nonsense suppressor mutations in EF-1α indicates a role for this protein in limiting amino acid misincorporation and reading frame errors. We suggest that these types of errors are controlled by a common mechanism or closely related mechanisms. PMID:3066688
Identification of a putative protein profile associated with tamoxifen therapy resistance in breast cancer.

PubMed

Umar, Arzu; Kang, Hyuk; Timmermans, Annemieke M; Look, Maxime P; Meijer-van Gelder, Marion E; den Bakker, Michael A; Jaitly, Navdeep; Martens, John W M; Luider, Theo M; Foekens, John A; Pasa-Tolić, Ljiljana

2009-06-01

Tamoxifen resistance is a major cause of death in patients with recurrent breast cancer. Current clinical factors can correctly predict therapy response in only half of the treated patients. Identification of proteins that are associated with tamoxifen resistance is a first step toward better response prediction and tailored treatment of patients. In the present study we intended to identify putative protein biomarkers indicative of tamoxifen therapy resistance in breast cancer using nano-LC coupled with FTICR MS. Comparative proteome analysis was performed on approximately 5,500 pooled tumor cells (corresponding to approximately 550 ng of protein lysate/analysis) obtained through laser capture microdissection (LCM) from two independently processed data sets (n = 24 and n = 27) containing both tamoxifen therapy-sensitive and therapy-resistant tumors. Peptides and proteins were identified by matching mass and elution time of newly acquired LC-MS features to information in previously generated accurate mass and time tag reference databases. A total of 17,263 unique peptides were identified that corresponded to 2,556 non-redundant proteins identified with > or = 2 peptides. 1,713 overlapping proteins between the two data sets were used for further analysis. Comparative proteome analysis revealed 100 putatively differentially abundant proteins between tamoxifen-sensitive and tamoxifen-resistant tumors. The presence and relative abundance for 47 differentially abundant proteins were verified by targeted nano-LC-MS/MS in a selection of unpooled, non-microdissected discovery set tumor tissue extracts. ENPP1, EIF3E, and GNB4 were significantly associated with progression-free survival upon tamoxifen treatment for recurrent disease. Differential abundance of our top discriminating protein, extracellular matrix metalloproteinase inducer, was validated by tissue microarray in an independent patient cohort (n = 156). Extracellular matrix metalloproteinase inducer levels were higher in therapy-resistant tumors and significantly associated with an earlier tumor progression following first line tamoxifen treatment (hazard ratio, 1.87; 95% confidence interval, 1.25-2.80; p = 0.002). In summary, comparative proteomics performed on laser capture microdissection-derived breast tumor cells using nano-LC-FTICR MS technology revealed a set of putative biomarkers associated with tamoxifen therapy resistance in recurrent breast cancer.
Characterization by Suppression Subtractive Hybridization of Transcripts That Are Differentially Expressed in Leaves of Anthracnose-Resistant Ramie Cultivar.

PubMed

Xuxia, Wang; Jie, Chen; Bo, Wang; Lijun, Liu; Hui, Jiang; Diluo, Tang; Dingxiang, Peng

2012-01-01

For the purpose of screening putative anthracnose resistance-related genes of ramie ( Boehmeria nivea L. Gaud), a cDNA library was constructed by suppression subtractive hybridization using anthracnose-resistant cultivar Huazhu no. 4. The cDNAs from Huazhu no. 4, which were infected with Colletotrichum gloeosporioides , were used as the tester and cDNAs from uninfected Huazhu no. 4 as the driver. Sequencing analysis and homology searching showed that these clones represented 132 single genes, which were assigned to functional categories, including 14 putative cellular functions, according to categories established for Arabidopsis . These 132 genes included 35 disease resistance and stress tolerance-related genes including putative heat-shock protein 90, metallothionein, PR-1.2 protein, catalase gene, WRKY family genes, and proteinase inhibitor-like protein. Partial disease-related genes were further analyzed by reverse transcription PCR and RNA gel blot. These expressed sequence tags are the first anthracnose resistance-related expressed sequence tags reported in ramie.
Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prody, C.A.; Zevin-Sonkin, D.; Gnatt, A.

1987-06-01

To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase and Torpedo electric organ true acetylcholinesterase. Using these probes, the authors isolated several cDNA clones from lambdagt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. Inmore » RNA blots of poly(A)/sup +/ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These finding demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species.« less
Discovery of genes related to insecticide resistance in Bactrocera dorsalis by functional genomic analysis of a de novo assembled transcriptome.

PubMed

Hsu, Ju-Chun; Chien, Ting-Ying; Hu, Chia-Cheng; Chen, Mei-Ju May; Wu, Wen-Jer; Feng, Hai-Tung; Haymer, David S; Chen, Chien-Yu

2012-01-01

Insecticide resistance has recently become a critical concern for control of many insect pest species. Genome sequencing and global quantization of gene expression through analysis of the transcriptome can provide useful information relevant to this challenging problem. The oriental fruit fly, Bactrocera dorsalis, is one of the world's most destructive agricultural pests, and recently it has been used as a target for studies of genetic mechanisms related to insecticide resistance. However, prior to this study, the molecular data available for this species was largely limited to genes identified through homology. To provide a broader pool of gene sequences of potential interest with regard to insecticide resistance, this study uses whole transcriptome analysis developed through de novo assembly of short reads generated by next-generation sequencing (NGS). The transcriptome of B. dorsalis was initially constructed using Illumina's Solexa sequencing technology. Qualified reads were assembled into contigs and potential splicing variants (isotigs). A total of 29,067 isotigs have putative homologues in the non-redundant (nr) protein database from NCBI, and 11,073 of these correspond to distinct D. melanogaster proteins in the RefSeq database. Approximately 5,546 isotigs contain coding sequences that are at least 80% complete and appear to represent B. dorsalis genes. We observed a strong correlation between the completeness of the assembled sequences and the expression intensity of the transcripts. The assembled sequences were also used to identify large numbers of genes potentially belonging to families related to insecticide resistance. A total of 90 P450-, 42 GST-and 37 COE-related genes, representing three major enzyme families involved in insecticide metabolism and resistance, were identified. In addition, 36 isotigs were discovered to contain target site sequences related to four classes of resistance genes. Identified sequence motifs were also analyzed to characterize putative polypeptide translational products and associate them with specific genes and protein functions.
Sequence characterization of S100A8 gene reveals structural differences of protein and transcriptional factor binding sites in water buffalo and yak.

PubMed

Kathiravan, P; Goyal, S; Kataria, R S; Mishra, B P; Jayakumar, S; Joshi, B K

2011-01-01

The present study was undertaken to characterize the structure of S100A8 gene and its promoter in water buffalo and yak. Sequence data of 2.067 kb, 2.071 kb, and 2.052 kb with respect to complete S100A8 gene including 5' flanking region was generated in river buffalo, swamp buffalo, and yak, respectively. BLAST analysis of coding DNA sequences (CDS) of S100A8 gene revealed 95% homology of buffalo sequence with cattle, 85% with pig and horse, 83% with dog, 72-73% with murines, and around 79% with primates and humans. Phylogenetic analysis of predicted CDS revealed distinct clustering of murines, primates, and domestic animals with bovines and bubalines forming a subcluster among farm animals. In silico translation of predicted CDS revealed a sequence of 89 amino acids with 7 amino acid changes between cattle and buffalo and 2 changes between cattle and yak. The search for Pfam family revealed the N-terminal calcium binding domain and the noncanonical EF hand domain in the carboxy terminus, with more variations being observed in the N-terminal domain among different species. Two amino acid changes observed in carboxy terminal EF hand domain resulted in altered secondary structure of yak S100A8 protein. Analysis of S100A8 gene promoter revealed 14 putative motifs for transcriptional factor binding sites. Two putative motifs viz. C/EBP and v-Myb were found to be absent in swamp buffalo as compared to river buffalo and cattle. Differences in the structure of S100A8 protein and the transcriptional factor binding sites identified in the present study need to be analyzed further for their functional significance in yak and swamp buffalo respectively. Copyright © Taylor & Francis Group, LLC
Mitogen-activated protein kinase 1 from disk abalone (Haliotis discus discus): Roles in early development and immunity-related transcriptional responses.

PubMed

Perera, N C N; Godahewa, G I; Lee, Jehee

2016-12-01

Mitogen-activated protein kinase (MAPK) is involved in the regulation of cellular events by mediating signal transduction pathways. MAPK1 is a member of the extracellular-signal regulated kinases (ERKs), playing roles in cell proliferation, differentiation, and development. This is mainly in response to growth factors, mitogens, and many environmental stresses. In the current study, we have characterized the structural features of a homolog of MAPK1 from disk abalone (AbMAPK1). Further, we have unraveled its expressional kinetics against different experimental pathogenic infections or related chemical stimulants. AbMAPK1 harbors a 5' untranslated region (UTR) of 23 bps, a coding sequence of 1104 bps, and a 3' UTR of 448 bp. The putative peptide comprises a predicted molecular mass of 42.2 kDa, with a theoretical pI of 6.28. Based on the in silico analysis, AbMAPK1 possesses two N-glycosylation sites, one S_TK catalytic domain, and a conserved His-Arg-Asp domain (HRD). In addition, a conservative glycine rich ATP-phosphate-binding loop and a threonine-x-tyrosine motif (TEY) important for the autophosphorylation were also identified in the protein. Homology assessment of AbMAPK1 showed several conserved regions, and ark clam (Aplysia californica) showed the highest sequence identity (87.9%). The phylogenetic analysis supported close evolutionary kinship with molluscan orthologs. Constitutive expression of AbMAPK1 was observed in six different tissues of disk abalone, with the highest expression in the digestive tract, followed by the gills and hemocytes. Highest AbMAPK1 mRNA expression level was detected at the trochophore developmental stage, suggesting its role in abalone cell differentiation and proliferation. Significant modulation of AbMAPK1 expression under pathogenic stress suggested its putative involvement in the immune defense mechanism. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Structural and functional characterization of a novel molluskan ortholog of TRAF and TNF receptor-associated protein from disk abalone (Haliotis discus discus).

PubMed

Lee, Youngdeuk; Elvitigala, Don Anushka Sandaruwan; Whang, Ilson; Lee, Sukkyoung; Kim, Hyowon; Zoysa, Mahanama De; Oh, Chulhong; Kang, Do-Hyung; Lee, Jehee

2014-09-01

Immune signaling cascades have an indispensable role in the host defense of almost all the organisms. Tumor necrosis factor (TNF) signaling is considered as a prominent signaling pathway in vertebrate as well as invertebrate species. Within the signaling cascade, TNF receptor-associated factor (TRAF) and TNF receptor-associated protein (TTRAP) has been shown to have a crucial role in the modulation of immune signaling in animals. Here, we attempted to characterize a novel molluskan ortholog of TTRAP (AbTTRAP) from disk abalone (Haliotis discus discus) and analyzed its expression levels under pathogenic stress. The complete coding sequence of AbTTRAP consisted of 1071 nucleotides, coding for a 357 amino acid peptide, with a predicted molecular mass of 40 kDa. According to our in-silico analysis, AbTTRAP resembled the typical TTRAP domain architecture, including a 5'-tyrosyl DNA phosphodiesterase domain. Moreover, phylogenetic analysis revealed its common ancestral invertebrate origin, where AbTTRAP was clustered with molluskan counterparts. Quantitative real time PCR showed universally distributed expression of AbTTRAP in selected tissues of abalone, from which more prominent expression was detected in hemocytes. Upon stimulation with two pathogen-derived mitogens, lipopolysaccharide (LPS) and polyinosinic:polycytidylic acid (poly I:C), transcript levels of AbTTRAP in hemocytes and gill tissues were differentially modulated with time. In addition, the recombinant protein of AbTTRAP exhibited prominent endonuclease activity against abalone genomic DNA, which was enhanced by the presence of Mg(2+) in the medium. Collectively, these results reinforce the existence of the TNF signaling cascade in mollusks like disk abalone, further implicating the putative regulatory behavior of TTRAP in invertebrate host pathology. Copyright © 2014 Elsevier Ltd. All rights reserved.
Understanding Neurodevelopmental Disorders: The Promise of Regulatory Variation in the 3'UTRome.

PubMed

Wanke, Kai A; Devanna, Paolo; Vernes, Sonja C

2018-04-01

Neurodevelopmental disorders have a strong genetic component, but despite widespread efforts, the specific genetic factors underlying these disorders remain undefined for a large proportion of affected individuals. Given the accessibility of exome sequencing, this problem has thus far been addressed from a protein-centric standpoint; however, protein-coding regions only make up ∼1% to 2% of the human genome. With the advent of whole genome sequencing we are in the midst of a paradigm shift as it is now possible to interrogate the entire sequence of the human genome (coding and noncoding) to fill in the missing heritability of complex disorders. These new technologies bring new challenges, as the number of noncoding variants identified per individual can be overwhelming, making it prudent to focus on noncoding regions of known function, for which the effects of variation can be predicted and directly tested to assess pathogenicity. The 3'UTRome is a region of the noncoding genome that perfectly fulfills these criteria and is of high interest when searching for pathogenic variation related to complex neurodevelopmental disorders. Herein, we review the regulatory roles of the 3'UTRome as binding sites for microRNAs or RNA binding proteins, or during alternative polyadenylation. We detail existing evidence that these regions contribute to neurodevelopmental disorders and outline strategies for identification and validation of novel putatively pathogenic variation in these regions. This evidence suggests that studying the 3'UTRome will lead to the identification of new risk factors, new candidate disease genes, and a better understanding of the molecular mechanisms contributing to neurodevelopmental disorders. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Comparison of the Genome Sequence of the Poultry Pathogen Bordetella avium with Those of B. bronchiseptica, B. pertussis, and B. parapertussis Reveals Extensive Diversity in Surface Structures Associated with Host Interaction

PubMed Central

Sebaihia, Mohammed; Preston, Andrew; Maskell, Duncan J.; Kuzmiak, Holly; Connell, Terry D.; King, Natalie D.; Orndorff, Paul E.; Miyamoto, David M.; Thomson, Nicholas R.; Harris, David; Goble, Arlette; Lord, Angela; Murphy, Lee; Quail, Michael A.; Rutter, Simon; Squares, Robert; Squares, Steven; Woodward, John; Parkhill, Julian; Temple, Louise M.

2006-01-01

Bordetella avium is a pathogen of poultry and is phylogenetically distinct from Bordetella bronchiseptica, Bordetella pertussis, and Bordetella parapertussis, which are other species in the Bordetella genus that infect mammals. In order to understand the evolutionary relatedness of Bordetella species and further the understanding of pathogenesis, we obtained the complete genome sequence of B. avium strain 197N, a pathogenic strain that has been extensively studied. With 3,732,255 base pairs of DNA and 3,417 predicted coding sequences, it has the smallest genome and gene complement of the sequenced bordetellae. In this study, the presence or absence of previously reported virulence factors from B. avium was confirmed, and the genetic bases for growth characteristics were elucidated. Over 1,100 genes present in B. avium but not in B. bronchiseptica were identified, and most were predicted to encode surface or secreted proteins that are likely to define an organism adapted to the avian rather than the mammalian respiratory tracts. These include genes coding for the synthesis of a polysaccharide capsule, hemagglutinins, a type I secretion system adjacent to two very large genes for secreted proteins, and unique genes for both lipopolysaccharide and fimbrial biogenesis. Three apparently complete prophages are also present. The BvgAS virulence regulatory system appears to have polymorphisms at a poly(C) tract that is involved in phase variation in other bordetellae. A number of putative iron-regulated outer membrane proteins were predicted from the sequence, and this regulation was confirmed experimentally for five of these. PMID:16885469
Identification and characterization of wheat long non-protein coding RNAs responsive to powdery mildew infection and heat stress by using microarray analysis and SBS sequencing

PubMed Central

2011-01-01

Background Biotic and abiotic stresses, such as powdery mildew infection and high temperature, are important limiting factors for yield and grain quality in wheat production. Emerging evidences suggest that long non-protein coding RNAs (npcRNAs) are developmentally regulated and play roles in development and stress responses of plants. However, identification of long npcRNAs is limited to a few plant species, such as Arabidopsis, rice and maize, no systematic identification of long npcRNAs and their responses to abiotic and biotic stresses is reported in wheat. Results In this study, by using computational analysis and experimental approach we identified 125 putative wheat stress responsive long npcRNAs, which are not conserved among plant species. Among them, some were precursors of small RNAs such as microRNAs and siRNAs, two long npcRNAs were identified as signal recognition particle (SRP) 7S RNA variants, and three were characterized as U3 snoRNAs. We found that wheat long npcRNAs showed tissue dependent expression patterns and were responsive to powdery mildew infection and heat stress. Conclusion Our results indicated that diverse sets of wheat long npcRNAs were responsive to powdery mildew infection and heat stress, and could function in wheat responses to both biotic and abiotic stresses, which provided a starting point to understand their functions and regulatory mechanisms in the future. PMID:21473757
Comparison of Spinach Sex Chromosomes with Sugar Beet Autosomes Reveals Extensive Synteny and Low Recombination at the Male-Determining Locus.

PubMed

Takahata, Satoshi; Yago, Takumi; Iwabuchi, Keisuke; Hirakawa, Hideki; Suzuki, Yutaka; Onodera, Yasuyuki

2016-01-01

Spinach (Spinacia oleracea, 2n = 12) and sugar beet (Beta vulgaris, 2n = 18) are important crop members of the family Chenopodiaceae ss Sugar beet has a basic chromosome number of 9 and a cosexual breeding system, as do most members of the Chenopodiaceae ss. family. By contrast, spinach has a basic chromosome number of 6 and, although certain cultivars and genotypes produce monoecious plants, is considered to be a dioecious species. The loci determining male and monoecious sexual expression were mapped to different loci on the spinach sex chromosomes. In this study, a linkage map with 46 mapped protein-coding sequences was constructed for the spinach sex chromosomes. Comparison of the linkage map with a reference genome sequence of sugar beet revealed that the spinach sex chromosomes exhibited extensive synteny with sugar beet chromosomes 4 and 9. Tightly linked protein-coding genes linked to the male-determining locus in spinach corresponded to genes located in or around the putative pericentromeric and centromeric regions of sugar beet chromosomes 4 and 9, supporting the observation that recombination rates were low in the vicinity of the male-determining locus. The locus for monoecism was confined to a chromosomal segment corresponding to a region of approximately 1.7Mb on sugar beet chromosome 9, which may facilitate future positional cloning of the locus. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Tentacle Transcriptome and Venom Proteome of the Pacific Sea Nettle, Chrysaora fuscescens (Cnidaria: Scyphozoa)

PubMed Central

Ponce, Dalia; Brinkman, Diane L.; Potriquet, Jeremy; Mulvenna, Jason

2016-01-01

Jellyfish venoms are rich sources of toxins designed to capture prey or deter predators, but they can also elicit harmful effects in humans. In this study, an integrated transcriptomic and proteomic approach was used to identify putative toxins and their potential role in the venom of the scyphozoan jellyfish Chrysaora fuscescens. A de novo tentacle transcriptome, containing more than 23,000 contigs, was constructed and used in proteomic analysis of C. fuscescens venom to identify potential toxins. From a total of 163 proteins identified in the venom proteome, 27 were classified as putative toxins and grouped into six protein families: proteinases, venom allergens, C-type lectins, pore-forming toxins, glycoside hydrolases and enzyme inhibitors. Other putative toxins identified in the transcriptome, but not the proteome, included additional proteinases as well as lipases and deoxyribonucleases. Sequence analysis also revealed the presence of ShKT domains in two putative venom proteins from the proteome and an additional 15 from the transcriptome, suggesting potential ion channel blockade or modulatory activities. Comparison of these potential toxins to those from other cnidarians provided insight into their possible roles in C. fuscescens venom and an overview of the diversity of potential toxin families in cnidarian venoms. PMID:27058558
The midgut transcriptome of Phlebotomus (Larroussius) perniciosus, a vector of Leishmania infantum: comparison of sugar fed and blood fed sand flies.

PubMed

Dostálová, Anna; Votýpka, Jan; Favreau, Amanda J; Barbian, Kent D; Volf, Petr; Valenzuela, Jesus G; Jochim, Ryan C

2011-05-10

Parasite-vector interactions are fundamental in the transmission of vector-borne diseases such as leishmaniasis. Leishmania development in the vector sand fly is confined to the digestive tract, where sand fly midgut molecules interact with the parasites. In this work we sequenced and analyzed two midgut-specific cDNA libraries from sugar fed and blood fed female Phlebotomus perniciosus and compared the transcript expression profiles. A total of 4111 high quality sequences were obtained from the two libraries and assembled into 370 contigs and 1085 singletons. Molecules with putative roles in blood meal digestion, peritrophic matrix formation, immunity and response to oxidative stress were identified, including proteins that were not previously reported in sand flies. These molecules were evaluated relative to other published sand fly transcripts. Comparative analysis of the two libraries revealed transcripts differentially expressed in response to blood feeding. Molecules up regulated by blood feeding include a putative peritrophin (PperPer1), two chymotrypsin-like proteins (PperChym1 and PperChym2), a putative trypsin (PperTryp3) and four putative microvillar proteins (PperMVP1, 2, 4 and 5). Additionally, several transcripts were more abundant in the sugar fed midgut, such as two putative trypsins (PperTryp1 and PperTryp2), a chymotrypsin (PperChym3) and a microvillar protein (PperMVP3). We performed a detailed temporal expression profile analysis of the putative trypsin transcripts using qPCR and confirmed the expression of blood-induced and blood-repressed trypsins. Trypsin expression was measured in Leishmania infantum-infected and uninfected sand flies, which identified the L. infantum-induced down regulation of PperTryp3 at 24 hours post-blood meal. This midgut tissue-specific transcriptome provides insight into the molecules expressed in the midgut of P. perniciosus, an important vector of visceral leishmaniasis in the Old World. Through the comparative analysis of the libraries we identified molecules differentially expressed during blood meal digestion. Additionally, this study provides a detailed comparison to transcripts of other sand flies. Moreover, our analysis of putative trypsins demonstrated that L. infantum infection can reduce the transcript abundance of trypsin PperTryp3 in the midgut of P. perniciosus.
A computational search for box C/D snoRNA genes in the Drosophila melanogaster genome.

PubMed

Accardo, M C; Giordano, E; Riccardo, S; Digilio, F A; Iazzetti, G; Calogero, R A; Furia, M

2004-12-12

In eukaryotes, the family of non-coding RNA genes includes a number of genes encoding small nucleolar RNAs (mainly C/D and H/ACA snoRNAs), which act as guides in the maturation or post-transcriptional modifications of target RNA molecules. Since in Drosophila melanogaster (Dm) only few examples of snoRNAs have been identified so far by cDNA libraries screening, integration of the molecular data with in silico identification of these types of genes could throw light on their organization in the Dm genome. We have performed a computational screening of the Dm genome for C/D snoRNA genes, followed by experimental validation of the putative candidates. Few of the 26 confirmed snoRNAs had been recognized by cDNA library analysis. Organization of the Dm genome was also found to be more variegated than previously suspected, with snoRNA genes nested in both the introns and exons of protein-coding genes. This finding suggests that the presence of additional mechanisms of snoRNA biogenesis based on the alternative production of overlapping mRNA/snoRNA molecules. Additional information is available at http://www.bioinformatica.unito.it/bioinformatics/snoRNAs.
Evolutionary impact of transposable elements on genomic diversity and lineage-specific innovation in vertebrates.

PubMed

Warren, Ian A; Naville, Magali; Chalopin, Domitille; Levin, Perrine; Berger, Chloé Suzanne; Galiana, Delphine; Volff, Jean-Nicolas

2015-09-01

Since their discovery, a growing body of evidence has emerged demonstrating that transposable elements are important drivers of species diversity. These mobile elements exhibit a great variety in structure, size and mechanisms of transposition, making them important putative actors in organism evolution. The vertebrates represent a highly diverse and successful lineage that has adapted to a wide range of different environments. These animals also possess a rich repertoire of transposable elements, with highly diverse content between lineages and even between species. Here, we review how transposable elements are driving genomic diversity and lineage-specific innovation within vertebrates. We discuss the large differences in TE content between different vertebrate groups and then go on to look at how they affect organisms at a variety of levels: from the structure of chromosomes to their involvement in the regulation of gene expression, as well as in the formation and evolution of non-coding RNAs and protein-coding genes. In the process of doing this, we highlight how transposable elements have been involved in the evolution of some of the key innovations observed within the vertebrate lineage, driving the group's diversity and success.
Origin and Functional Prediction of Pollen Allergens in Plants1[OPEN

PubMed Central

Chen, Miaolin; Xu, Jie; Ren, Kang; Searle, Iain

2016-01-01

Pollen allergies have long been a major pandemic health problem for human. However, the evolutionary events and biological function of pollen allergens in plants remain largely unknown. Here, we report the genome-wide prediction of pollen allergens and their biological function in the dicotyledonous model plant Arabidopsis (Arabidopsis thaliana) and the monocotyledonous model plant rice (Oryza sativa). In total, 145 and 107 pollen allergens were predicted from rice and Arabidopsis, respectively. These pollen allergens are putatively involved in stress responses and metabolic processes such as cell wall metabolism during pollen development. Interestingly, these putative pollen allergen genes were derived from large gene families and became diversified during evolution. Sequence analysis across 25 plant species from green alga to angiosperms suggest that about 40% of putative pollen allergenic proteins existed in both lower and higher plants, while other allergens emerged during evolution. Although a high proportion of gene duplication has been observed among allergen-coding genes, our data show that these genes might have undergone purifying selection during evolution. We also observed that epitopes of an allergen might have a biological function, as revealed by comprehensive analysis of two known allergens, expansin and profilin. This implies a crucial role of conserved amino acid residues in both in planta biological function and allergenicity. Finally, a model explaining how pollen allergens were generated and maintained in plants is proposed. Prediction and systematic analysis of pollen allergens in model plants suggest that pollen allergens were evolved by gene duplication and then functional specification. This study provides insight into the phylogenetic and evolutionary scenario of pollen allergens that will be helpful to future characterization and epitope screening of pollen allergens. PMID:27436829
Identification of Putative Precursor Genes for the Biosynthesis of Cannabinoid-Like Compound in Radula marginata

PubMed Central

Hussain, Tajammul; Plunkett, Blue; Ejaz, Mahwish; Espley, Richard V.; Kayser, Oliver

2018-01-01

The liverwort Radula marginata belongs to the bryophyte division of land plants and is a prospective alternate source of cannabinoid-like compounds. However, mechanistic insights into the molecular pathways directing the synthesis of these cannabinoid-like compounds have been hindered due to the lack of genetic information. This prompted us to do deep sequencing, de novo assembly and annotation of R. marginata transcriptome, which resulted in the identification and validation of the genes for cannabinoid biosynthetic pathway. In total, we have identified 11,421 putative genes encoding 1,554 enzymes from 145 biosynthetic pathways. Interestingly, we have identified all the upstream genes of the central precursor of cannabinoid biosynthesis, cannabigerolic acid (CBGA), including its two first intermediates, stilbene acid (SA) and geranyl diphosphate (GPP). Expression of all these genes was validated using quantitative real-time PCR. We have characterized the protein structure of stilbene synthase (STS), which is considered as a homolog of olivetolic acid in R. marginata. Moreover, the metabolomics approach enabled us to identify CBGA-analogous compounds using electrospray ionization mass spectrometry (ESI-MS/MS) and gas chromatography mass spectrometry (GC-MS). Transcriptomic analysis revealed 1085 transcription factors (TF) from 39 families. Comparative analysis showed that six TF families have been uniquely predicted in R. marginata. In addition, the bioinformatics analysis predicted a large number of simple sequence repeats (SSRs) and non-coding RNAs (ncRNAs). Our results collectively provide mechanistic insights into the putative precursor genes for the biosynthesis of cannabinoid-like compounds and a novel transcriptomic resource for R. marginata. The large-scale transcriptomic resource generated in this study would further serve as a reference transcriptome to explore the Radulaceae family.
Origin and Functional Prediction of Pollen Allergens in Plants.

PubMed

Chen, Miaolin; Xu, Jie; Devis, Deborah; Shi, Jianxin; Ren, Kang; Searle, Iain; Zhang, Dabing

2016-09-01

Pollen allergies have long been a major pandemic health problem for human. However, the evolutionary events and biological function of pollen allergens in plants remain largely unknown. Here, we report the genome-wide prediction of pollen allergens and their biological function in the dicotyledonous model plant Arabidopsis (Arabidopsis thaliana) and the monocotyledonous model plant rice (Oryza sativa). In total, 145 and 107 pollen allergens were predicted from rice and Arabidopsis, respectively. These pollen allergens are putatively involved in stress responses and metabolic processes such as cell wall metabolism during pollen development. Interestingly, these putative pollen allergen genes were derived from large gene families and became diversified during evolution. Sequence analysis across 25 plant species from green alga to angiosperms suggest that about 40% of putative pollen allergenic proteins existed in both lower and higher plants, while other allergens emerged during evolution. Although a high proportion of gene duplication has been observed among allergen-coding genes, our data show that these genes might have undergone purifying selection during evolution. We also observed that epitopes of an allergen might have a biological function, as revealed by comprehensive analysis of two known allergens, expansin and profilin. This implies a crucial role of conserved amino acid residues in both in planta biological function and allergenicity. Finally, a model explaining how pollen allergens were generated and maintained in plants is proposed. Prediction and systematic analysis of pollen allergens in model plants suggest that pollen allergens were evolved by gene duplication and then functional specification. This study provides insight into the phylogenetic and evolutionary scenario of pollen allergens that will be helpful to future characterization and epitope screening of pollen allergens. © 2016 American Society of Plant Biologists. All rights reserved.
Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.

PubMed

Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui

2013-12-01

MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple. © 2013.
Three sorghum serpin recombinant proteins inhibit midgut trypsin activity and growth of corn earworm

USDA-ARS?s Scientific Manuscript database

The sorghum (Sorghum bicolor) genome contains at least 17 putative serpin (serine protease inhibitor) open reading frames, some of which are induced by pathogens. Recent transcriptome studies found that most of the putative serpins are expressed but their roles are unknown. Four sorghum serpins were...
Functional Characterization of Four Putative δ1-Pyrroline-5-Carboxylate Reductases from Bacillus subtilis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Forlani, Giuseppe; Nocek, Boguslaw; Chakravarthy, Srinivas

In most living organisms, the amino acid proline is synthesized starting from both glutamate and ornithine. In prokaryotes, in the absence of an ornithine cyclodeaminase that has been identified to date only in a small number of soil and plant bacteria, these pathways share the last step, the reduction of delta(1)-pyrroline-5-carboxylate (P5C) catalyzed by P5C reductase (EC 1.5.1.2). In several species, multiple forms of P5C reductase have been reported, possibly reflecting the dual function of proline. Aside from its common role as a building block of proteins, proline is indeed also involved in the cellular response to osmotic and oxidativemore » stress conditions. Genome analysis of Bacillus subtilis identifies the presence of four genes (ProH, ProI, ProG, and ComER) that, based on bioinformatic and phylogenic studies, were defined as respectively coding a putative P5C reductase. Here we describe the cloning, heterologous expression, functional analysis and small-angle X-ray scattering studies of the four affinity-purified proteins. Results showed that two of them, namely ProI and ComER, lost their catalytic efficiency or underwent subfunctionalization. In the case of ComER, this could be likely explained by the loss of the ability to form a dimer, which has been previously shown to be an essential structural feature of the catalytically active P5C reductase. The properties of the two active enzymes are consistent with a constitutive role for ProG, and suggest that ProH expression may be beneficial to satisfy an increased need for proline.« less
Functional Characterization of Four Putative δ1-Pyrroline-5-Carboxylate Reductases from Bacillus subtilis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Forlani, Giuseppe; Nocek, Boguslaw; Chakravarthy, Srinivas

In most living organisms, the amino acid proline is synthesized starting from both glutamate and ornithine. In prokaryotes, in the absence of an ornithine cyclodeaminase that has been identified to date only in a small number of soil and plant bacteria, these pathways share the last step, the reduction of δ1-pyrroline-5-carboxylate (P5C) catalyzed by P5C reductase (EC 1.5.1.2). In several species, multiple forms of P5C reductase have been reported, possibly reflecting the dual function of proline. Aside from its common role as a building block of proteins, proline is indeed also involved in the cellular response to osmotic and oxidativemore » stress conditions. Genome analysis of Bacillus subtilis identifies the presence of four genes (ProH, ProI, ProG, and ComER) that, based on bioinformatic and phylogenic studies, were defined as respectively coding a putative P5C reductase. Here we describe the cloning, heterologous expression, functional analysis and small-angle X-ray scattering studies of the four affinity-purified proteins. Results showed that two of them, namely ProI and ComER, lost their catalytic efficiency or underwent subfunctionalization. In the case of ComER, this could be likely explained by the loss of the ability to form a dimer, which has been previously shown to be an essential structural feature of the catalytically active P5C reductase. The properties of the two active enzymes are consistent with a constitutive role for ProG, and suggest that ProH expression may be beneficial to satisfy an increased need for proline.« less
Characterization of uncultivable bat influenza virus using a replicative synthetic virus.

PubMed

Zhou, Bin; Ma, Jingjiao; Liu, Qinfang; Bawa, Bhupinder; Wang, Wei; Shabman, Reed S; Duff, Michael; Lee, Jinhwa; Lang, Yuekun; Cao, Nan; Nagy, Abdou; Lin, Xudong; Stockwell, Timothy B; Richt, Juergen A; Wentworth, David E; Ma, Wenjun

2014-10-01

Bats harbor many viruses, which are periodically transmitted to humans resulting in outbreaks of disease (e.g., Ebola, SARS-CoV). Recently, influenza virus-like sequences were identified in bats; however, the viruses could not be cultured. This discovery aroused great interest in understanding the evolutionary history and pandemic potential of bat-influenza. Using synthetic genomics, we were unable to rescue the wild type bat virus, but could rescue a modified bat-influenza virus that had the HA and NA coding regions replaced with those of A/PR/8/1934 (H1N1). This modified bat-influenza virus replicated efficiently in vitro and in mice, resulting in severe disease. Additional studies using a bat-influenza virus that had the HA and NA of A/swine/Texas/4199-2/1998 (H3N2) showed that the PR8 HA and NA contributed to the pathogenicity in mice. Unlike other influenza viruses, engineering truncations hypothesized to reduce interferon antagonism into the NS1 protein didn't attenuate bat-influenza. In contrast, substitution of a putative virulence mutation from the bat-influenza PB2 significantly attenuated the virus in mice and introduction of a putative virulence mutation increased its pathogenicity. Mini-genome replication studies and virus reassortment experiments demonstrated that bat-influenza has very limited genetic and protein compatibility with Type A or Type B influenza viruses, yet it readily reassorts with another divergent bat-influenza virus, suggesting that the bat-influenza lineage may represent a new Genus/Species within the Orthomyxoviridae family. Collectively, our data indicate that the bat-influenza viruses recently identified are authentic viruses that pose little, if any, pandemic threat to humans; however, they provide new insights into the evolution and basic biology of influenza viruses.
Characterization of Uncultivable Bat Influenza Virus Using a Replicative Synthetic Virus

PubMed Central

Bawa, Bhupinder; Wang, Wei; Shabman, Reed S.; Duff, Michael; Lee, Jinhwa; Lang, Yuekun; Cao, Nan; Nagy, Abdou; Lin, Xudong; Stockwell, Timothy B.; Richt, Juergen A.; Wentworth, David E.; Ma, Wenjun

2014-01-01

Bats harbor many viruses, which are periodically transmitted to humans resulting in outbreaks of disease (e.g., Ebola, SARS-CoV). Recently, influenza virus-like sequences were identified in bats; however, the viruses could not be cultured. This discovery aroused great interest in understanding the evolutionary history and pandemic potential of bat-influenza. Using synthetic genomics, we were unable to rescue the wild type bat virus, but could rescue a modified bat-influenza virus that had the HA and NA coding regions replaced with those of A/PR/8/1934 (H1N1). This modified bat-influenza virus replicated efficiently in vitro and in mice, resulting in severe disease. Additional studies using a bat-influenza virus that had the HA and NA of A/swine/Texas/4199-2/1998 (H3N2) showed that the PR8 HA and NA contributed to the pathogenicity in mice. Unlike other influenza viruses, engineering truncations hypothesized to reduce interferon antagonism into the NS1 protein didn't attenuate bat-influenza. In contrast, substitution of a putative virulence mutation from the bat-influenza PB2 significantly attenuated the virus in mice and introduction of a putative virulence mutation increased its pathogenicity. Mini-genome replication studies and virus reassortment experiments demonstrated that bat-influenza has very limited genetic and protein compatibility with Type A or Type B influenza viruses, yet it readily reassorts with another divergent bat-influenza virus, suggesting that the bat-influenza lineage may represent a new Genus/Species within the Orthomyxoviridae family. Collectively, our data indicate that the bat-influenza viruses recently identified are authentic viruses that pose little, if any, pandemic threat to humans; however, they provide new insights into the evolution and basic biology of influenza viruses. PMID:25275541
Sugarcane genes differentially expressed in response to Puccinia melanocephala infection: identification and transcript profiling.

PubMed

Oloriz, María I; Gil, Víctor; Rojas, Luis; Portal, Orelvis; Izquierdo, Yovanny; Jiménez, Elio; Höfte, Monica

2012-05-01

Brown rust caused by the fungus Puccinia melanocephala is a major disease of sugarcane (Saccharum spp.). A sugarcane mutant, obtained by chemical mutagenesis of the susceptible variety B4362, showed a post-haustorial hypersensitive response (HR)-mediated resistance to the pathogen and was used to identify genes differentially expressed in response to P. melanocephala via suppression subtractive hybridization (SSH). Tester cDNA was derived from the brown rust-resistant mutant after inoculation with P. melanocephala, while driver cDNAs were obtained from the non-inoculated resistant mutant and the inoculated susceptible donor variety B4362. Database comparisons of the sequences of the SSH recombinant clones revealed that, of a subset of 89 non-redundant sequences, 88% had similarity to known functional genes, while 12% were of unknown function. Thirteen genes were selected for transcript profiling in the resistant mutant and the susceptible donor variety. Genes involved in glycolysis and C4 carbon fixation were up-regulated in both interactions probably due to disturbance of sugarcane carbon metabolism by the pathogen. Genes related with the nascent polypeptide associated complex, post-translational proteome modulation and autophagy were transcribed at higher levels in the compatible interaction. Up-regulation of a putative L-isoaspartyl O-methyltransferase S-adenosylmethionine gene in the compatible interaction may point to fungal manipulation of the cytoplasmatic methionine cycle. Genes coding for a putative no apical meristem protein, S-adenosylmethionine decarboxylase, non-specific lipid transfer protein, and GDP-L-galactose phosphorylase involved in ascorbic acid biosynthesis were up-regulated in the incompatible interaction at the onset of haustorium formation, and may contribute to the HR-mediated defense response in the rust-resistant mutant.
Centrocins: isolation and characterization of novel dimeric antimicrobial peptides from the green sea urchin, Strongylocentrotus droebachiensis.

PubMed

Li, Chun; Haug, Tor; Moe, Morten K; Styrvold, Olaf B; Stensvåg, Klara

2010-09-01

As immune effector molecules, antimicrobial peptides (AMPs) play an important role in the invertebrate immune system. Here, we present two novel AMPs, named centrocins 1 (4.5kDa) and 2 (4.4kDa), purified from coelomocyte extracts of the green sea urchin, Strongylocentrotus droebachiensis. The native peptides are cationic and show potent activities against Gram-positive and Gram-negative bacteria. The centrocins have an intramolecular heterodimeric structure, containing a heavy chain (30 amino acids) and a light chain (12 amino acids). The cDNA encoding the peptides and genomic sequences were cloned and sequenced. One putative isoform (centrocin 1b) was identified and one intron was found in the genes coding for the centrocins. The full length protein sequence of centrocin 1 consists of 119 amino acids, whereas centrocin 2 consists of 118 amino acids which both include a preprosequence of 51 or 50 amino acids for centrocins 1 and 2, respectively, and an interchain of 24 amino acids between the heavy and light chain. The difference of molecular mass between the native centrocins and the deduced sequences from cDNA indicates that the native centrocins contain a post-translational brominated tryptophan. In addition, two amino acids at the C-terminal, Gly-Arg, were removed from the light chains during the post-translational processing. The separate peptide chains of centrocin 1 were synthesized and the heavy chain alone was shown to be sufficient for antimicrobial activity. The genome of the closely related species, the purple sea urchin (S. purpuratus), was shown to contain two putative proteins with high similarity to the centrocins. Copyright 2010 Elsevier Ltd. All rights reserved.
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters.

PubMed

Dallery, Jean-Félix; Lapalu, Nicolas; Zampounis, Antonios; Pigné, Sandrine; Luyten, Isabelle; Amselem, Joëlle; Wittenberg, Alexander H J; Zhou, Shiguo; de Queiroz, Marisa V; Robin, Guillaume P; Auger, Annie; Hainaut, Matthieu; Henrissat, Bernard; Kim, Ki-Tae; Lee, Yong-Hwan; Lespinet, Olivier; Schwartz, David C; Thon, Michael R; O'Connell, Richard J

2017-08-29

The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.
Molecular cloning of the potato Gro1-4 gene conferring resistance to pathotype Ro1 of the root cyst nematode Globodera rostochiensis, based on a candidate gene approach.

PubMed

Paal, Jürgen; Henselewski, Heike; Muth, Jost; Meksem, Khalid; Menéndez, Cristina M; Salamini, Francesco; Ballvora, Agim; Gebhardt, Christiane

2004-04-01

The endoparasitic root cyst nematode Globodera rostochiensis causes considerable damage in potato cultivation. In the past, major genes for nematode resistance have been introgressed from related potato species into cultivars. Elucidating the molecular basis of resistance will contribute to the understanding of nematode-plant interactions and assist in breeding nematode-resistant cultivars. The Gro1 resistance locus to G. rostochiensis on potato chromosome VII co-localized with a resistance-gene-like (RGL) DNA marker. This marker was used to isolate from genomic libraries 15 members of a closely related candidate gene family. Analysis of inheritance, linkage mapping, and sequencing reduced the number of candidate genes to three. Complementation analysis by stable potato transformation showed that the gene Gro1-4 conferred resistance to G. rostochiensis pathotype Ro1. Gro1-4 encodes a protein of 1136 amino acids that contains Toll-interleukin 1 receptor (TIR), nucleotide-binding (NB), leucine-rich repeat (LRR) homology domains and a C-terminal domain with unknown function. The deduced Gro1-4 protein differed by 29 amino acid changes from susceptible members of the Gro1 gene family. Sequence characterization of 13 members of the Gro1 gene family revealed putative regulatory elements and a variable microsatellite in the promoter region, insertion of a retrotransposon-like element in the first intron, and a stop codon in the NB coding region of some genes. Sequence analysis of RT-PCR products showed that Gro1-4 is expressed, among other members of the family including putative pseudogenes, in non-infected roots of nematode-resistant plants. RT-PCR also demonstrated that members of the Gro1 gene family are expressed in most potato tissues.
Evolution of coding and non-coding genes in HOX clusters of a marsupial.

PubMed

Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

2012-06-18

The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.
Evolution of coding and non-coding genes in HOX clusters of a marsupial

PubMed Central

2012-01-01

Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672
Complete genome-wide screening and subtractive genomic approach revealed new virulence factors, potential drug targets against bio-war pathogen Brucella melitensis 16M

PubMed Central

Pradeepkiran, Jangampalli Adi; Sainath, Sri Bhashyam; Kumar, Konidala Kranthi; Bhaskar, Matcha

2015-01-01

Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes) to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50%) to Silicibacter pomeroyi DUF1285 family protein (2RE3). A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the glycerol structural analogs from the PubChem database. We identified five best inhibitors with strong affinities, stable interactions, and also with reliable drug-like properties. Hence, these leads might be used as the most effective inhibitors of modeled protein. The outcome of the present work of virtual screening of putative gene targets might facilitate design of potential drugs for better treatment against brucellosis. PMID:25834405

First isolation of West Nile virus from a dromedary camel

PubMed Central

Joseph, Sunitha; Wernery, Ulrich; Teng, Jade LL; Wernery, Renate; Huang, Yi; Patteril, Nissy AG; Chan, Kwok-Hung; Elizabeth, Shyna K; Fan, Rachel YY; Lau, Susanna KP; Kinne, Jörg; Woo, Patrick CY

2016-01-01

Although antibodies against West Nile virus (WNV) have been detected in the sera of dromedaries in the Middle East, North Africa and Spain, no WNV has been isolated or amplified from dromedary or Bactrian camels. In this study, WNV was isolated from Vero cells inoculated with both nasal swab and pooled trachea/lung samples from a dromedary calf in Dubai. Complete-genome sequencing and phylogenetic analysis using the near-whole-genome polyprotein revealed that the virus belonged to lineage 1a. There was no clustering of the present WNV with other WNVs isolated in other parts of the Middle East. Within lineage 1a, the dromedary WNV occupied a unique position, although it was most closely related to other WNVs of cluster 2. Comparative analysis revealed that the putative E protein encoded by the genome possessed the original WNV E protein glycosylation motif NYS at E154–156, which contained the N-linked glycosylation site at N-154 associated with increased WNV pathogenicity and neuroinvasiveness. In the putative NS1 protein, the A70S substitution observed in other cluster 2 WNVs and P250, which has been implicated in neuroinvasiveness, were present. In addition, the foo motif in the putative NS2A protein, which has been implicated in neuroinvasiveness, was detected. Notably, the amino-acid residues at 14 positions in the present dromedary WNV genome differed from those in most of the closely related WNV strains in cluster 2 of lineage 1a, with the majority of these differences observed in the putative E and NS5 proteins. The present study is the first to demonstrate the isolation of WNV from dromedaries. This finding expands the possible reservoirs of WNV and sources of WNV infection. PMID:27273223
Putative bacterial volatile-mediated growth in soybean (Glycine max L. Merrill) and expression of induced proteins under salt stress.

PubMed

Vaishnav, A; Kumari, S; Jain, S; Varma, A; Choudhary, D K

2015-08-01

Plant root-associated rhizobacteria elicit plant immunity referred to as induced systemic tolerance (IST) against multiple abiotic stresses. Among multibacterial determinants involved in IST, the induction of IST and promotion of growth by putative bacterial volatile compounds (VOCs) is reported in the present study. To characterize plant proteins induced by putative bacterial VOCs, proteomic analysis was performed by MALDI-MS/MS after exposure of soybean seedlings to a new strain of plant growth promoting rhizobacteria (PGPR) Pseudomonas simiae strain AU. Furthermore, expression analysis by Western blotting confirmed that the vegetative storage protein (VSP), gamma-glutamyl hydrolase (GGH) and RuBisCo large chain proteins were significantly up-regulated by the exposure to AU strain and played a major role in IST. VSP has preponderant roles in N accumulation and mobilization, acid phosphatase activity and Na(+) homeostasis to sustain plant growth under stress condition. More interestingly, plant exposure to the bacterial strain significantly reduced Na(+) and enhanced K(+) and P content in root of soybean seedlings under salt stress. In addition, high accumulation of proline and chlorophyll content also provided evidence of protection against osmotic stress during the elicitation of IST by bacterial exposure. The present study reported for the first time that Ps. simiae produces a putative volatile blend that can enhance soybean seedling growth and elicit IST against 100 mmol l(-1) NaCl stress condition. The identification of such differentially expressed proteins provide new targets for future studies that will allow assessment of their physiological roles and significance in the response of glycophytes to stresses. Further work should uncover more about the chemical side of VOC compounds and a detailed study about their molecular mechanism responsible for plant growth. © 2015 The Society for Applied Microbiology.
Structural analysis of the receptor binding domain of botulinum neurotoxin serotype D

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Yanfeng; Buchko, Garry W.; Qin, Lin

2010-10-28

Botulinum neurotoxins (BoNTs) are the most toxic proteins known. The mechanism for entry into neuronal cells for serotypes A, B, E, F, and G involves a well understood dual receptor (protein and ganglioside) process, however, the mechanism of entry for serotypes C and D remains unclear. To provide structural insights into how BoNT/D enters neuronal cells, the crystal structure of the receptor binding domain (S863-E1276) for this serotype (BoNT/D-HCR) was determined at 1.65 Å resolution. While BoNT/D-HCR adopts an overall fold similar to that observed in other known BoNT HCRs, several major structural differences are present. These structural differences aremore » located at, or near, putative receptor binding sites and may be responsible for BoNT/D host preferences. Two loops, S1195-I1204 and K1236-N1244, located on both sides of the putative protein receptor binding pocket, are displaced >10 Å relative to the corresponding residues in the crystal structures of BoNT/B and G. Obvious clashes were observed in the putative protein receptor binding site when the BoNT/B protein receptor synaptotagmin II was modeled into the BoNT/D-HCR structure. Although a ganglioside binding site has never been unambiguously identified in BoNT/D-HCR, a shallow cavity in an analogous location to the other BoNT serotypes HCR domains is observed in BoNT/D-HCR that has features compatible with membrane binding. A portion of a loop near the putative receptor binding site, K1236-N1244, is hydrophobic and solvent-exposed and may directly bind membrane lipids. Liposome-binding experiments with BoNT/D-HCR demonstrate that this membrane lipid may be phosphatidylethanolamine.« less
Structural Analysis of the Receptor Binding Domain of Botulinum Neurotoxin Serotype D

DOE Office of Scientific and Technical Information (OSTI.GOV)

Y Zhang; G Buchko; L Qin

2011-12-31

Botulinum neurotoxins (BoNTs) are the most toxic proteins known. The mechanism for entry into neuronal cells for serotypes A, B, E, F, and G involves a well understood dual receptor (protein and ganglioside) process, however, the mechanism of entry for serotypes C and D remains unclear. To provide structural insights into how BoNT/D enters neuronal cells, the crystal structure of the receptor binding domain (S863-E1276) for this serotype (BoNT/D-HCR) was determined at 1.65{angstrom} resolution. While BoNT/D-HCR adopts an overall fold similar to that observed in other known BoNT HCRs, several major structural differences are present. These structural differences are locatedmore » at, or near, putative receptor binding sites and may be responsible for BoNT/D host preferences. Two loops, S1195-I1204 and K1236-N1244, located on both sides of the putative protein receptor binding pocket, are displaced >10{angstrom} relative to the corresponding residues in the crystal structures of BoNT/B and G. Obvious clashes were observed in the putative protein receptor binding site when the BoNT/B protein receptor synaptotagmin II was modeled into the BoNT/D-HCR structure. Although a ganglioside binding site has never been unambiguously identified in BoNT/D-HCR, a shallow cavity in an analogous location to the other BoNT serotypes HCR domains is observed in BoNT/D-HCR that has features compatible with membrane binding. A portion of a loop near the putative receptor binding site, K1236-N1244, is hydrophobic and solvent-exposed and may directly bind membrane lipids. Liposome-binding experiments with BoNT/D-HCR demonstrate that this membrane lipid may be phosphatidylethanolamine.« less
In silico analysis to identify vaccine candidates common to multiple serotypes of Shigella and evaluation of their immunogenicity.

PubMed

Pahil, Sapna; Taneja, Neelam; Ansari, Hifzur Rahman; Raghava, G P S

2017-01-01

Shigellosis or bacillary dysentery is an important cause of diarrhea, with the majority of the cases occurring in developing countries. Considering the high disease burden, increasing antibiotic resistance, serotype-specific immunity and the post-infectious sequelae associated with shigellosis, there is a pressing need of an effective vaccine against multiple serotypes of the pathogen. In the present study, we used bio-informatics approach to identify antigens shared among multiple serotypes of Shigella spp. This approach led to the identification of many immunogenic peptides. The five most promising peptides based on MHC binding efficiency were a putative lipoprotein (EL PGI I), a putative heat shock protein (EL PGI II), Spa32 (EL PGI III), IcsB (EL PGI IV) and a hypothetical protein (EL PGI V). These peptides were synthesized and the immunogenicity was evaluated in BALB/c mice by ELISA and cytokine assays. The putative heat shock protein (HSP) and the hypothetical protein elicited good humoral response, whereas putative lipoprotein, Spa32 and IcsB elicited good T-cell response as revealed by increased IFN-γ and TNF-α cytokine levels. The patient sera from confirmed cases of shigellosis were also evaluated for the presence of peptide specific antibodies with significant IgG and IgA antibodies against the HSP and the hypothetical protein, bestowing them as potential future vaccine candidates. The antigens reported in this study are novel and have not been tested as vaccine candidates against Shigella. This study offers time and cost-effective way of identifying unprecedented immunogenic antigens to be used as potential vaccine candidates. Moreover, this approach should easily be extendable to find new potential vaccine candidates for other pathogenic bacteria.
Differential splicing of human androgen receptor pre-mRNA in X-linked reifenstein syndrome, because of a deletion involving a putative branch site

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ris-Stalpers, C.; Verleun-Mooijman, M.C.T.; Blaeij, T.J.P. de

1994-04-01

The analysis of the androgen receptor (AR) gene, mRNA, and protein in a subject with X-linked Reifenstein syndrome (partial androgen insensitivity) is reported. The presence of two mature AR transcripts in genital skin fibroblasts of the patient is established, and, by reverse transcriptase-PCR and RNase transcription analysis, the wild-type transcript and a transcript in which exon 3 sequences are absent without disruption of the translational reading frame are identified. Sequencing and hybridization analysis show a deletion of >6 kb in intron 2 of the human AR gene, starting 18 bp upstream of exon 3. The deletion includes the putative branch-pointmore » sequence (BPS) but not the acceptor splice site on the intron 2/exon 3 boundary. The deletion of the putative intron 2 BPS results in 90% inhibition of wild-type splicing. The mutant transcript encodes an AR protein lacking the second zinc finger of the DNA-binding domain. Western/immunoblotting analysis is used to show that the mutant AR protein is expressed in genital skin fibroblasts of the patient. The residual 10% wild-type transcript can be the result of the use of a cryptic BPS located 63 bp upstream of the intron 2/exon 3 boundary of the mutant AR gene. The mutated AR protein has no transcription-activating potential and does not influence the transactivating properties of the wild-type AR, as tested in cotransfection studies. It is concluded that the partial androgen-insensitivity syndrome of this patient is the consequence of the limited amount of wild-type AR protein expressed in androgen target cells, resulting from the deletion of the intron 2 putative BPS. 42 refs., 6 figs., 1 tab.« less
Involvement of Two Latex-Clearing Proteins during Rubber Degradation and Insights into the Subsequent Degradation Pathway Revealed by the Genome Sequence of Gordonia polyisoprenivorans Strain VH2

PubMed Central

Hiessl, Sebastian; Schuldes, Jörg; Thürmer, Andrea; Halbsguth, Tobias; Bröker, Daniel; Angelov, Angel; Liebl, Wolfgang; Daniel, Rolf

2012-01-01

The increasing production of synthetic and natural poly(cis-1,4-isoprene) rubber leads to huge challenges in waste management. Only a few bacteria are known to degrade rubber, and little is known about the mechanism of microbial rubber degradation. The genome of Gordonia polyisoprenivorans strain VH2, which is one of the most effective rubber-degrading bacteria, was sequenced and annotated to elucidate the degradation pathway and other features of this actinomycete. The genome consists of a circular chromosome of 5,669,805 bp and a circular plasmid of 174,494 bp with average GC contents of 67.0% and 65.7%, respectively. It contains 5,110 putative protein-coding sequences, including many candidate genes responsible for rubber degradation and other biotechnically relevant pathways. Furthermore, we detected two homologues of a latex-clearing protein, which is supposed to be a key enzyme in rubber degradation. The deletion of these two genes for the first time revealed clear evidence that latex-clearing protein is essential for the microbial utilization of rubber. Based on the genome sequence, we predict a pathway for the microbial degradation of rubber which is supported by previous and current data on transposon mutagenesis, deletion mutants, applied comparative genomics, and literature search. PMID:22327575
Mycobacterium ahvazicum sp. nov., the nineteenth species of the Mycobacterium simiae complex.

PubMed

Bouam, Amar; Heidarieh, Parvin; Shahraki, Abodolrazagh Hashemi; Pourahmad, Fazel; Mirsaeidi, Mehdi; Hashemzadeh, Mohamad; Baptiste, Emeline; Armstrong, Nicholas; Levasseur, Anthony; Robert, Catherine; Drancourt, Michel

2018-03-07

Four slowly growing mycobacteria isolates were isolated from the respiratory tract and soft tissue biopsies collected in four unrelated patients in Iran. Conventional phenotypic tests indicated that these four isolates were identical to Mycobacterium lentiflavum while 16S rRNA gene sequencing yielded a unique sequence separated from that of M. lentiflavum. One representative strain AFP-003 T was characterized as comprising a 6,121,237-bp chromosome (66.24% guanosine-cytosine content) encoding for 5,758 protein-coding genes, 50 tRNA and one complete rRNA operon. A total of 2,876 proteins were found to be associated with the mobilome, including 195 phage proteins. A total of 1,235 proteins were found to be associated with virulence and 96 with toxin/antitoxin systems. The genome of AFP-003 T has the genetic potential to produce secondary metabolites, with 39 genes found to be associated with polyketide synthases and non-ribosomal peptide syntases and 11 genes encoding for bacteriocins. Two regions encoding putative prophages and three OriC regions separated by the dnaA gene were predicted. Strain AFP-003 T genome exhibits 86% average nucleotide identity with Mycobacterium genavense genome. Genetic and genomic data indicate that strain AFP-003 T is representative of a novel Mycobacterium species that we named Mycobacterium ahvazicum, the nineteenth species of the expanding Mycobacterium simiae complex.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Schriner, J.E.; Yi, W.; Hofmann, S.L.

Palmitoyl-protein thioesterase (PPT) is a small glycoprotein that removes palmitate groups from cysteine residues in lipid-modified proteins. We recently reported mutations in PPT in patients with infantile neuronal ceroid lipofuscinosis (INCL), a severe neurodegenerative disorder. INCL is characterized by the accumulation of proteolipid storage material in brain and other tissues, suggesting that the disease is a consequence of abnormal catabolism of acylated proteins. In the current paper, we report the sequence of the human PPT cDNA and the structure of the human PPT gene. The cDNA predicts a protein of 306 amino acids that contains a 25-amino-acid signal peptide, threemore » N-linked glycosylation sites, and consensus motifs characteristic of thioesterases. Northern analysis of a human tissue blot revealed ubiquitous expression of a single 2.5-kb mRNA, with highest expression in lung, brain, and heart. The human PPT gene spans 25 kb and is composed of seven coding exons and a large eighth exon, containing the entire 3{prime}-untranslated region of 1388 bp. An Alu repeat and promoter elements corresponding to putative binding sites for several general transcription factors were identified in the 1060 nucleotides upstream of the transcription start site. The human PPT cDNA sequence and gene structure will provide the means for the identification of further causative mutations in INCL and facilitate genetic screening in selected high-risk populations. 31 refs., 5 figs., 1 tab.« less
Insight into the transcriptome of Arthrobotrys conoides using high throughput sequencing.

PubMed

Ramesh, Pandit; Reena, Patel; Amitbikram, Mohapatra; Chaitanya, Joshi; Anju, Kunjadia

2015-12-01

Arthrobotrys conoides is a nematode-trapping fungus belonging to Orbiliales, Ascomycota group, and traps prey nematodes by means of adhesive network. Fungus has a potential to be used as a biocontrol agent against plant parasitic nematodes. In the present study, we characterized the transcriptome of A. conoides using high-throughput sequencing technology and characterized its virulence unigenes. Total 7,255 cDNA contigs with an average length of 425 bp were generated and 6184 (61.81%) transcripts were functionally annotated and characterized. Majority of unigenes were found analogous to the genes of plant pathogenic fungi. A total of 1749 transcripts were found to be orthologous with eukaryotic proteins of KOG database. Several carbohydrate active enzymes and peptidases were identified. We also analyzed classically and nonclassically secreted proteins and confirmed by BLASTP against fungal secretome database. A total of 916 contigs were analogous to 556 unique proteins of Pathogen Host Interaction (PHI) database. Further, we identified 91 unigenes homologous to the database of fungal virulence factor (DFVF). A total of 104 putative protein kinases coding transcripts were identified by BLASTP against KinBase database, which are major players in signaling pathways. This study provides a comprehensive look at the transcriptome of A. conoides and the identified unigenes might have a role in catching and killing prey nematodes by A. conoides. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Functional Genomics Analysis of Singapore Grouper Iridovirus: Complete Sequence Determination and Proteomic Analysis

PubMed Central

Song, Wen Jun; Qin, Qi Wei; Qiu, Jin; Huang, Can Hua; Wang, Fan; Hew, Choy Leong

2004-01-01

Here we report the complete genome sequence of Singapore grouper iridovirus (SGIV). Sequencing of the random shotgun and restriction endonuclease genomic libraries showed that the entire SGIV genome consists of 140,131 nucleotide bp. One hundred sixty-two open reading frames (ORFs) from the sense and antisense DNA strands, coding for lengths varying from 41 to 1,268 amino acids, were identified. Computer-assisted analyses of the deduced amino acid sequences revealed that 77 of the ORFs exhibited homologies to known virus genes, 23 of which matched functional iridovirus proteins. Forty-two putative conserved domains or signatures were detected in the National Center for Biotechnology Information CD-Search database and PROSITE database. An assortment of enzyme activities involved in DNA replication, transcription, nucleotide metabolism, cell signaling, etc., were identified. Viruses were cultured on a cell line derived from the embryonated egg of the grouper Epinephelus tauvina, isolated, and purified by sucrose gradient ultracentrifugation. The protein extract from the purified virions was analyzed by polyacrylamide gel electrophoresis followed by in-gel digestion of protein bands. Matrix-assisted laser desorption ionization-time of flight mass spectrometry and database searching led to identification of 26 proteins. Twenty of these represented novel or previously unidentified genes, which were further confirmed by reverse transcription-PCR (RT-PCR) and DNA sequencing of their respective RT-PCR products. PMID:15507645
SITEHOUND-web: a server for ligand binding site identification in protein structures.

PubMed

Hernandez, Marylens; Ghersi, Dario; Sanchez, Roberto

2009-07-01

SITEHOUND-web (http://sitehound.sanchezlab.org) is a binding-site identification server powered by the SITEHOUND program. Given a protein structure in PDB format SITEHOUND-web will identify regions of the protein characterized by favorable interactions with a probe molecule. These regions correspond to putative ligand binding sites. Depending on the probe used in the calculation, sites with preference for different ligands will be identified. Currently, a carbon probe for identification of binding sites for drug-like molecules, and a phosphate probe for phosphorylated ligands (ATP, phoshopeptides, etc.) have been implemented. SITEHOUND-web will display the results in HTML pages including an interactive 3D representation of the protein structure and the putative sites using the Jmol java applet. Various downloadable data files are also provided for offline data analysis.
Genes Involved in Anaerobic Metabolism of Phenol in the Bacterium Thauera aromatica

PubMed Central

Breinig, Sabine; Schiltz, Emile; Fuchs, Georg

2000-01-01

Genes involved in the anaerobic metabolism of phenol in the denitrifying bacterium Thauera aromatica have been studied. The first two committed steps in this metabolism appear to be phosphorylation of phenol to phenylphosphate by an unknown phosphoryl donor (“phenylphosphate synthase”) and subsequent carboxylation of phenylphosphate to 4-hydroxybenzoate under release of phosphate (“phenylphosphate carboxylase”). Both enzyme activities are strictly phenol induced. Two-dimensional gel electrophoresis allowed identification of several phenol-induced proteins. Based on N-terminal and internal amino acid sequences of such proteins, degenerate oligonucleotides were designed to identify the corresponding genes. A chromosomal DNA segment of about 14 kbp was sequenced which contained 10 genes transcribed in the same direction. These are organized in two adjacent gene clusters and include the genes coding for five identified phenol-induced proteins. Comparison with sequences in the databases revealed the following similarities: the gene products of two open reading frames (ORFs) are each similar to either the central part and N-terminal part of phosphoenolpyruvate synthases. We propose that these ORFs are components of the phenylphosphate synthase system. Three ORFs showed similarity to the ubiD gene product, 3-octaprenyl-4-hydroxybenzoate carboxy lyase; UbiD catalyzes the decarboxylation of a 4-hydroxybenzoate analogue in ubiquinone biosynthesis. Another ORF was similar to the ubiX gene product, an isoenzyme of UbiD. We propose that (some of) these four proteins are involved in the carboxylation of phenylphosphate. A 700-bp PCR product derived from one of these ORFs cross-hybridized with DNA from different Thauera and Azoarcus strains, even from those which have not been reported to grow with phenol. One ORF showed similarity to the mutT gene product, and three ORFs showed no strong similarities to sequences in the databases. Upstream of the first gene cluster, an ORF which is transcribed in the opposite direction codes for a protein highly similar to the DmpR regulatory protein of Pseudomonas putida. DmpR controls transcription of the genes of aerobic phenol metabolism, suggesting a similar regulation of anaerobic phenol metabolism by the putative regulator. PMID:11004186
PM19, a barley (Hordeum vulgare L.) gene encoding a putative plasma membrane protein, is expressed during embryo development and dormancy.

PubMed

Ranford, Julia C; Bryce, James H; Morris, Peter C

2002-01-01

A barley (Hordeum vulgare L.) cDNA, PM19, encoding a putative plasma membrane protein was isolated through differential screening of a dormant wild oat embryo library. PM19 is expressed in barley embryos from mid-embryogenesis up to maturity. PM19 mRNA levels decline upon germination, whereas dormant embryos retained high levels of message for up to 72 h of imbibition. PM19 mRNA levels also remained high or were reinduced in non-dormant embryos by treatments that prevented germination (250 mm NaCl, 10% sorbitol, or 50 microm ABA). The PM19 protein sequence is highly conserved in monocotyledonous and dicotyledonous plants.
Current Research on Non-Coding Ribonucleic Acid (RNA).

PubMed

Wang, Jing; Samuels, David C; Zhao, Shilin; Xiang, Yu; Zhao, Ying-Yong; Guo, Yan

2017-12-05

Non-coding ribonucleic acid (RNA) has without a doubt captured the interest of biomedical researchers. The ability to screen the entire human genome with high-throughput sequencing technology has greatly enhanced the identification, annotation and prediction of the functionality of non-coding RNAs. In this review, we discuss the current landscape of non-coding RNA research and quantitative analysis. Non-coding RNA will be categorized into two major groups by size: long non-coding RNAs and small RNAs. In long non-coding RNA, we discuss regular long non-coding RNA, pseudogenes and circular RNA. In small RNA, we discuss miRNA, transfer RNA, piwi-interacting RNA, small nucleolar RNA, small nuclear RNA, Y RNA, single recognition particle RNA, and 7SK RNA. We elaborate on the origin, detection method, and potential association with disease, putative functional mechanisms, and public resources for these non-coding RNAs. We aim to provide readers with a complete overview of non-coding RNAs and incite additional interest in non-coding RNA research.
Frog virus 3 ORF 53R, a putative myristoylated membrane protein, is essential for virus replication in vitro

DOE Office of Scientific and Technical Information (OSTI.GOV)

Whitley, Dexter S.; Yu, Kwang; Sample, Robert C.

2010-09-30

Although previous work identified 12 complementation groups with possible roles in virus assembly, currently only one frog virus 3 protein, the major capsid protein (MCP), has been linked with virion formation. To identify other proteins required for assembly, we used an antisense morpholino oligonucleotide to target 53R, a putative myristoylated membrane protein, and showed that treatment resulted in marked reductions in 53R levels and a 60% drop in virus titers. Immunofluorescence assays confirmed knock down and showed that 53R was found primarily within viral assembly sites, whereas transmission electron microscopy detected fewer mature virions and, in some cells, dense granularmore » bodies that may represent unencapsidated DNA-protein complexes. Treatment with a myristoylation inhibitor (2-hydroxymyristic acid) resulted in an 80% reduction in viral titers. Collectively, these data indicate that 53R is an essential viral protein that is required for replication in vitro and suggest it plays a critical role in virion formation.« less
Cloning of Bordetella pertussis putative outer protein D (BopD) and Leucin/Isoleucine/Valin binding protein (LivJ)

NASA Astrophysics Data System (ADS)

Öztürk, Burcu Emine Tefon

2017-04-01

Whooping cough also known as pertussis is a contagious acute upper respiratory disease primarily caused by Bordetella pertussis. It is known that this disease may be fatal especially in infants and recently, the number of pertussis cases has been increased. Despite the fact that there are numbers of acellular vaccines on the market, the current acellular vaccine compositions are inadequate for providing sustainable immunity and avoiding subclinical disease cases. Hence, exploring novel proteins with high immune protective capacities is essential to enhance the clinical efficacy of current vaccines. In this study, genes of selected immunogenic proteins via -omics studies, namely Putative outer protein D (BopD) and Leucin/Isoleucine/Valin Binding Protein (LivJ) were first cloned into pGEM-T Easy vector and transformed to into E. coli DH5α cells and then cloned into the expression vector pET-28a(+) and transformed into E. coli BL21 (DE3) cells to express the proteins.
A Novel Collection of snRNA-Like Promoters with Tissue-Specific Transcription Properties

PubMed Central

Garritano, Sonia; Gigoni, Arianna; Costa, Delfina; Malatesta, Paolo; Florio, Tullio; Cancedda, Ranieri; Pagano, Aldo

2012-01-01

We recently identified a novel dataset of snRNA-like trascriptional units in the human genome. The investigation of a subset of these elements showed that they play relevant roles in physiology and/or pathology. In this work we expand our collection of small RNAs taking advantage of a newly developed algorithm able to identify genome sequence stretches with RNA polymerase (pol) III type 3 promoter features thus constituting putative pol III binding sites. The bioinformatic analysis of a subset of these elements that map in introns of protein-coding genes in antisense configuration suggest their association with alternative splicing, similarly to other recently characterized small RNAs. Interestingly, the analysis of the transcriptional activity of these novel promoters shows that they are active in a cell-type specific manner, in accordance with the emerging body of evidence of a tissue/cell-specific activity of pol III. PMID:23109855
A novel collection of snRNA-like promoters with tissue-specific transcription properties.

PubMed

Garritano, Sonia; Gigoni, Arianna; Costa, Delfina; Malatesta, Paolo; Florio, Tullio; Cancedda, Ranieri; Pagano, Aldo

2012-01-01

We recently identified a novel dataset of snRNA-like trascriptional units in the human genome. The investigation of a subset of these elements showed that they play relevant roles in physiology and/or pathology. In this work we expand our collection of small RNAs taking advantage of a newly developed algorithm able to identify genome sequence stretches with RNA polymerase (pol) III type 3 promoter features thus constituting putative pol III binding sites. The bioinformatic analysis of a subset of these elements that map in introns of protein-coding genes in antisense configuration suggest their association with alternative splicing, similarly to other recently characterized small RNAs. Interestingly, the analysis of the transcriptional activity of these novel promoters shows that they are active in a cell-type specific manner, in accordance with the emerging body of evidence of a tissue/cell-specific activity of pol III.
Genetic Variation among Plasmodium vivax Isolates Adapted to Non-Human Primates and the Implication for Vaccine Development

PubMed Central

Ntumngia, Francis B.; McHenry, Amy M.; Barnwel, John W.; Cole-Tobian, Jennifer; King, Christopher L.; Adams, John H.

2009-01-01

Plasmodium vivax Duffy binding protein (DBP) is vital for parasite development, thereby making this molecule a good vaccine candidate. Preclinical development of a P. vivax vaccine often involves use of primate models prior to testing efficacy in humans, but primate isolates are poorly characterized. We analyzed the complete gene coding for the DBP in several P. vivax isolates that are used for experimental primate infections and compared these sequences with the Salvador I DBP isolate, which is being used for vaccine development. Our results affirm that primate-adapted isolates are genetically similar to P. vivax circulating in humans, but variability is greatest in the putative target of protective antibodies. In addition, some P. vivax isolates contain multiple genetically different clones. Testing a DBP vaccine may therefore be complicated by heterogeneity and diversity of the P. vivax isolates available for in vivo challenge. PMID:19190217

G = MAT: linking transcription factor expression and DNA binding data.

PubMed

Tretyakov, Konstantin; Laur, Sven; Vilo, Jaak

2011-01-31

Transcription factors are proteins that bind to motifs on the DNA and thus affect gene expression regulation. The qualitative description of the corresponding processes is therefore important for a better understanding of essential biological mechanisms. However, wet lab experiments targeted at the discovery of the regulatory interplay between transcription factors and binding sites are expensive. We propose a new, purely computational method for finding putative associations between transcription factors and motifs. This method is based on a linear model that combines sequence information with expression data. We present various methods for model parameter estimation and show, via experiments on simulated data, that these methods are reliable. Finally, we examine the performance of this model on biological data and conclude that it can indeed be used to discover meaningful associations. The developed software is available as a web tool and Scilab source code at http://biit.cs.ut.ee/gmat/.
G = MAT: Linking Transcription Factor Expression and DNA Binding Data

PubMed Central

Tretyakov, Konstantin; Laur, Sven; Vilo, Jaak

2011-01-01

Transcription factors are proteins that bind to motifs on the DNA and thus affect gene expression regulation. The qualitative description of the corresponding processes is therefore important for a better understanding of essential biological mechanisms. However, wet lab experiments targeted at the discovery of the regulatory interplay between transcription factors and binding sites are expensive. We propose a new, purely computational method for finding putative associations between transcription factors and motifs. This method is based on a linear model that combines sequence information with expression data. We present various methods for model parameter estimation and show, via experiments on simulated data, that these methods are reliable. Finally, we examine the performance of this model on biological data and conclude that it can indeed be used to discover meaningful associations. The developed software is available as a web tool and Scilab source code at http://biit.cs.ut.ee/gmat/. PMID:21297945
Unique features of a global human ectoparasite identified through sequencing of the bed bug genome.

PubMed

Benoit, Joshua B; Adelman, Zach N; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C; Szuter, Elise M; Hagan, Richard W; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M; Nelson, David R; Rosendale, Andrew J; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R; Ioannidis, Panagiotis; Waterhouse, Robert M; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J Spencer; Gondhalekar, Ameya D; Scharf, Michael E; Peterson, Brittany F; Raje, Kapil R; Hottel, Benjamin A; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S T; Duncan, Elizabeth J; Murali, Shwetha C; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C; Muzny, Donna M; Wheeler, David; Panfilio, Kristen A; Vargas Jentzsch, Iris M; Vargo, Edward L; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T; Anderson, Michelle A E; Jones, Jeffery W; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D; Attardo, Geoffrey M; Robertson, Hugh M; Zdobnov, Evgeny M; Ribeiro, Jose M C; Gibbs, Richard A; Werren, John H; Palli, Subba R; Schal, Coby; Richards, Stephen

2016-02-02

The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host-symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human-bed bug and symbiont-bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite.
The complete mitogenome of brown trout (Salmo trutta fario) and its phylogeny.

PubMed

Sahoo, Prabhati K; Singh, Lalit; Sharma, Lata; Kumar, Rohit; Singh, Vijay K; Ali, S; Singh, Atul K; Barat, Ashoktaru

2016-11-01

The complete mitochondrial genome of Salmo trutta fario, commonly known as brown trout, was sequenced using NGS technology. The mitochondrial genome size was determined to be 16 677 bp and composed of 13 protein-coding gene (PCG), 22 tRNAs, 2 rRNA genes, and 1 putative control region. The overall mitogenome composition of S. trutta fario is A: 28.13%, G: 16.44%, C: 29.47%, and T: 25.96% with A + T content of 54.09% and G + C content of 45.91%. The gene arrangement and the order are similar to other vertebrates. The phylogenetic tree constructed using 42 complete mitogenomes of Salmonidae fishes confirmed the position of the present species under the genus Salmo of subfamily Salmoninae. NGS platform was proved to be a rapid and time-saving technology to reveal complete mitogenomes.
Nucleotide sequences of bovine alpha S1- and kappa-casein cDNAs.

PubMed Central

Stewart, A F; Willis, I M; Mackinlay, A G

1984-01-01

The nucleotide sequences corresponding to bovine alpha S1- and kappa-casein mRNAs are presented. An unusual alpha S1-casein cDNA has been characterised whose 5' end commences upstream from its putative TATA box. The alpha S1-casein mRNA is compared to rat alpha-casein mRNA and two components of divergence are identified. Firstly, the two sequences have diverged at a high point mutation rate and the rate of amino acid replacement by this mechanism is at least as great as the rate of divergence of any other part of the mRNAs. Secondly, the protein coding sequence has been subjected to several insertion/deletion events, one of which may be an example of exon shuffling . The kappa-casein mRNA sequence verifies the proposition that it has arisen from a different ancestral gene to the other caseins. Images PMID:6328443
Unique features of a global human ectoparasite identified through sequencing of the bed bug genome

PubMed Central

Benoit, Joshua B.; Adelman, Zach N.; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C.; Szuter, Elise M.; Hagan, Richard W.; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M.; Nelson, David R.; Rosendale, Andrew J.; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M.; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R.; Ioannidis, Panagiotis; Waterhouse, Robert M.; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J. Spencer; Gondhalekar, Ameya D.; Scharf, Michael E.; Peterson, Brittany F.; Raje, Kapil R.; Hottel, Benjamin A.; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S. T.; Duncan, Elizabeth J.; Murali, Shwetha C.; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L.; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C.; Muzny, Donna M.; Wheeler, David; Panfilio, Kristen A.; Vargas Jentzsch, Iris M.; Vargo, Edward L.; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T.; Anderson, Michelle A. E.; Jones, Jeffery W.; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D.; Attardo, Geoffrey M.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Ribeiro, Jose M. C.; Gibbs, Richard A.; Werren, John H.; Palli, Subba R.; Schal, Coby; Richards, Stephen

2016-01-01

The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host–symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human–bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814
Proliferating cell nuclear antigen (Pcna) as a direct downstream target gene of Hoxc8

DOE Office of Scientific and Technical Information (OSTI.GOV)

Min, Hyehyun; Lee, Ji-Yeon; Bok, Jinwoong

2010-02-19

Hoxc8 is a member of Hox family transcription factors that play crucial roles in spatiotemporal body patterning during embryogenesis. Hox proteins contain a conserved 61 amino acid homeodomain, which is responsible for recognition and binding of the proteins onto Hox-specific DNA binding motifs and regulates expression of their target genes. Previously, using proteome analysis, we identified Proliferating cell nuclear antigen (Pcna) as one of the putative target genes of Hoxc8. Here, we asked whether Hoxc8 regulates Pcna expression by directly binding to the regulatory sequence of Pcna. In mouse embryos at embryonic day 11.5, the expression pattern of Pcna wasmore » similar to that of Hoxc8 along the anteroposterior body axis. Moreover, Pcna transcript levels as well as cell proliferation rate were increased by overexpression of Hoxc8 in C3H10T1/2 mouse embryonic fibroblast cells. Characterization of 2.3 kb genomic sequence upstream of Pcna coding region revealed that the upstream sequence contains several Hox core binding sequences and one Hox-Pbx binding sequence. Direct binding of Hoxc8 proteins to the Pcna regulatory sequence was verified by chromatin immunoprecipitation assay. Taken together, our data suggest that Pcna is a direct downstream target of Hoxc8.« less
Cloning and expression analysis of a new anther-specific gene CaMF4 in Capsicum annuum.

PubMed

Hao, Xuefeng; Chen, Changming; Chen, Guoju; Cao, Bihao; Lei, Jianjun

2017-03-01

Our previous study on the genic male sterile-fertile line 114AB of Capsicum annuum indicated a diversity of differentially expressed cDNA fragments in fertile and sterile lines. In this study, a transcript-derived fragment (TDF), male fertile 4 (CaMF4) was chosen for further investigation to observe that this specific fragment accumulates in the flower buds of the fertile line. The full genomic DNA sequence of CaMF4 was 894 bp in length, containing two exons and one intron, and the complete coding sequence encoded a putative 11.53 kDa protein of 109 amino acids. The derived protein of CaMF4 shared similarity with the members of PGPS/D3 protein family. The expression of CaMF4 was detected in both the flower buds at stage 8 and open flowers of the male fertile line. In contrast to this observation, expression of CaMF4 was not detected in any organs of the male sterile line. Further analysis revealed that CaMF4 was expressed particularly in anthers of the fertile line. Our results suggest that CaMF4 is an anther-specific gene and might be indispensable for anther or pollen development in C. annuum.
MicroRNAs: regulators of gene expression and cell differentiation

PubMed Central

Shivdasani, Ramesh A.

2006-01-01

The existence and roles of a class of abundant regulatory RNA molecules have recently come into sharp focus. Micro-RNAs (miRNAs) are small (approximately 22 bases), non–protein-coding RNAs that recognize target sequences of imperfect complementarity in cognate mRNAs and either destabilize them or inhibit protein translation. Although mechanisms of miRNA biogenesis have been elucidated in some detail, there is limited appreciation of their biological functions. Reported examples typically focus on miRNA regulation of a single tissue-restricted transcript, often one encoding a transcription factor, that controls a specific aspect of development, cell differentiation, or physiology. However, computational algorithms predict up to hundreds of putative targets for individual miRNAs, single transcripts may be regulated by multiple miRNAs, and miRNAs may either eliminate target gene expression or serve to finetune transcript and protein levels. Theoretical considerations and early experimental results hence suggest diverse roles for miRNAs as a class. One appealing possibility, that miRNAs eliminate low-level expression of unwanted genes and hence refine unilineage gene expression, may be especially amenable to evaluation in models of hematopoiesis. This review summarizes current understanding of miRNA mechanisms, outlines some of the important outstanding questions, and describes studies that attempt to define miRNA functions in hematopoiesis. PMID:16882713
Bitis gabonica (Gaboon viper) snake venom gland: toward a catalog for the full- length transcripts (cDNA) and proteins

PubMed Central

Francischetti, Ivo M. B.; My-Pham, Van; Harrison, Jim; Garfield, Mark K.; Ribeiro, José M. C.

2010-01-01

The venom gland of the snake Bitis gabonica (Gaboon viper) was used for the first time to construct a unidirectional cDNA phage library followed by high-throughput sequencing and bioinformatic analysis. Hundreds of cDNAs were obtained and clustered into contigs. We found mostly novel full-length cDNA coding for metalloproteases (P-II and P-III classes), Lys49-phospholipase A2, serine proteases with essential mutations in the active site, Kunitz protease inhibitors, several C-type lectins, bradykinin-potentiating peptide, vascular endothelial growth factor, nucleotidases and nucleases, nerve growth factor, and L-amino acid oxidases. Two new members of the recently described short coding region family of disintegrin, displaying RGD and MLD motifs are reported. In addition, we have identified for the first time a cytokine-like molecule and a multi-Kunitz protease inhibitor in snake venoms. The CLUSTAL alignment and the unrooted cladograms for selected families of B. gabonica venom proteins are also presented. A significant number of sequences were devoid of database matches, suggesting that their biologic function remains to be identified. This paper also reports the N-terminus of the 15 most abundant venom proteins and the sequences matching their corresponding transcripts. The electronic version of this manuscript, available on request, contains spreadsheets with hyperlinks to FASTA-formatted files for each contig and the best match to the GenBank and Conserved Domain Databases, in addition to CLUSTAL alignments of each contig. We have thus generated a comprehensive catalog of the B. gabonica venom gland, containing for each secreted protein: i) the predicted molecular weight, ii) the predicted isoelectric point, iii) the accession number, and iv) the putative function. The role of these molecules is discussed in the context of the envenomation caused by the Gaboon viper. PMID:15276202
Comparative genome analysis reveals genetic adaptation to versatile environmental conditions and importance of biofilm lifestyle in Comamonas testosteroni.

PubMed

Wu, Yichao; Arumugam, Krithika; Tay, Martin Qi Xiang; Seshan, Hari; Mohanty, Anee; Cao, Bin

2015-04-01

Comamonas testosteroni is an important environmental bacterium capable of degrading a variety of toxic aromatic pollutants and has been demonstrated to be a promising biocatalyst for environmental decontamination. This organism is often found to be among the primary surface colonizers in various natural and engineered ecosystems, suggesting an extraordinary capability of this organism in environmental adaptation and biofilm formation. The goal of this study was to gain genetic insights into the adaption of C. testosteroni to versatile environments and the importance of a biofilm lifestyle. Specifically, a draft genome of C. testosteroni I2 was obtained. The draft genome is 5,778,710 bp in length and comprises 110 contigs. The average G+C content was 61.88 %. A total of 5365 genes with 5263 protein-coding genes were predicted, whereas 4324 (80.60 % of total genes) protein-encoding genes were associated with predicted functions. The catabolic genes responsible for biodegradation of steroid and other aromatic compounds on draft genome were identified. Plasmid pI2 was found to encode a complete pathway for aniline degradation and a partial catabolic pathway for chloroaniline. This organism was found to be equipped with a sophisticated signaling system which helps it find ideal niches and switch between planktonic and biofilm lifestyles. A large number of putative multi-drug-resistant genes coding for abundant outer membrane transporters, chaperones, and heat shock proteins for the protection of cellular function were identified in the genome of strain I2. In addition, the genome of strain I2 was predicted to encode several proteins involved in producing, secreting, and uptaking siderophores under iron-limiting conditions. The genome of strain I2 contains a number of genes responsible for the synthesis and secretion of exopolysaccharides, an extracellular component essential for biofilm formation. Overall, our results reveal the genomic features underlying the adaption of C. testosteroni to versatile environments and highlighting the importance of its biofilm lifestyle.
A high-throughput venom-gland transcriptome for the Eastern Diamondback Rattlesnake (Crotalus adamanteus) and evidence for pervasive positive selection across toxin classes.

PubMed

Rokyta, Darin R; Wray, Kenneth P; Lemmon, Alan R; Lemmon, Emily Moriarty; Caudle, S Brian

2011-04-01

Despite causing considerable human mortality and morbidity, animal toxins represent a valuable source of pharmacologically active macromolecules, a unique system for studying molecular adaptation, and a powerful framework for examining structure-function relationships in proteins. Snake venoms are particularly useful in the latter regard as they consist primarily of a moderate number of proteins and peptides that have been found to belong to just a handful of protein families. As these proteins and peptides are produced in dedicated glands, transcriptome sequencing has proven to be an effective approach to identifying the expressed toxin genes. We generated a venom-gland transcriptome for the Eastern Diamondback Rattlesnake (Crotalus adamanteus) using Roche 454 sequencing technology. In the current work, we focus on transcripts encoding toxins. We identified 40 unique toxin transcripts, 30 of which have full-length coding sequences, and 10 have only partial coding sequences. These toxins account for 24% of the total sequencing reads. We found toxins from 11 previously described families of snake-venom toxins and have discovered two putative, previously undescribed toxin classes. The most diverse and highly expressed toxin classes in the C. adamanteus venom-gland transcriptome are the serine proteinases, metalloproteinases, and C-type lectins. The serine proteinases are the most abundant class, accounting for 35% of the toxin sequencing reads. Metalloproteinases are the most diverse; 11 different forms have been identified. Using our sequences and those available in public databases, we detected positive selection in seven of the eight toxin families for which sufficient sequences were available for the analysis. We find that the vast majority of the genes that contribute directly to this vertebrate trait show evidence for a role for positive selection in their evolutionary history. Copyright © 2011 Elsevier Ltd. All rights reserved.
Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

PubMed Central

Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

2013-01-01

In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105
Construction of a full-length enriched cDNA library and preliminary analysis of expressed sequence tags from Bengal Tiger Panthera tigris tigris.

PubMed

Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

2013-05-24

In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.
Biotechnology Conference: Protein Engineering Held in Oxford, United Kingdom on 5-8 April 1987.

DTIC Science & Technology

1987-07-27

engineered by protein engineering was reported by J. new variants which are now being checked. Brange (Novo Research Institute, Bags- Studies of a cassette...to Brange . Therefore, multidomain protein consisting of five Brange and his group applied protein en- putative domains: the fribonectin finger
A comparative hidden Markov model analysis pipeline identifies proteins characteristic of cereal-infecting fungi

PubMed Central

2013-01-01

Background Fungal pathogens cause devastating losses in economically important cereal crops by utilising pathogen proteins to infect host plants. Secreted pathogen proteins are referred to as effectors and have thus far been identified by selecting small, cysteine-rich peptides from the secretome despite increasing evidence that not all effectors share these attributes. Results We take advantage of the availability of sequenced fungal genomes and present an unbiased method for finding putative pathogen proteins and secreted effectors in a query genome via comparative hidden Markov model analyses followed by unsupervised protein clustering. Our method returns experimentally validated fungal effectors in Stagonospora nodorum and Fusarium oxysporum as well as the N-terminal Y/F/WxC-motif from the barley powdery mildew pathogen. Application to the cereal pathogen Fusarium graminearum reveals a secreted phosphorylcholine phosphatase that is characteristic of hemibiotrophic and necrotrophic cereal pathogens and shares an ancient selection process with bacterial plant pathogens. Three F. graminearum protein clusters are found with an enriched secretion signal. One of these putative effector clusters contains proteins that share a [SG]-P-C-[KR]-P sequence motif in the N-terminal and show features not commonly associated with fungal effectors. This motif is conserved in secreted pathogenic Fusarium proteins and a prime candidate for functional testing. Conclusions Our pipeline has successfully uncovered conservation patterns, putative effectors and motifs of fungal pathogens that would have been overlooked by existing approaches that identify effectors as small, secreted, cysteine-rich peptides. It can be applied to any pathogenic proteome data, such as microbial pathogen data of plants and other organisms. PMID:24252298
Profiling Antibody Responses to Infections by Chlamydia abortus Enables Identification of Potential Virulence Factors and Candidates for Serodiagnosis

PubMed Central

Forsbach-Birk, Vera; Foddis, Corinna; Simnacher, Ulrike; Wilkat, Max; Longbottom, David; Walder, Gernot; Benesch, Christiane; Ganter, Martin; Sachse, Konrad; Essig, Andreas

2013-01-01

Enzootic abortion of ewes (EAE) due to infection with the obligate intracellular pathogen Chlamydia (C.) abortus is an important zoonosis leading to considerable economic loss to agriculture worldwide. The pathogen can be transmitted to humans and may lead to serious infection in pregnant women. Knowledge about epidemiology, clinical course and transmission to humans is hampered by the lack of reliable diagnostic tools. Immunoreactive proteins, which are expressed in infected animals and humans, may serve as novel candidates for diagnostic marker proteins and represent putative virulence factors. In order to broaden the spectrum of immunogenic C. abortus proteins we applied 2D immunoblot analysis and screening of an expression library using human and animal sera. We have identified 48 immunoreactive proteins representing potential diagnostic markers and also putative virulence factors, such as CAB080 (homologue of the “macrophage infectivity potentiator”, MIP), CAB167 (homologue of the “translocated actin recruitment protein”, TARP), CAB712 (homologue of the “chlamydial protease-like activity factor”, CPAF), CAB776 (homologue of the “Polymorphic membrane protein D”, PmpD), and the “hypothetical proteins” CAB063, CAB408 and CAB821, which are predicted to be type III secreted. We selected two putative virulence factors for further characterization, i.e. CAB080 (cMIP) and CAB063, and studied their expression profiles at transcript and protein levels. Analysis of the subcellular localization of both proteins throughout the developmental cycle revealed CAB063 being the first C. abortus protein shown to be translocated to the host cell nucleus. PMID:24260366
Chimeras taking shape: Potential functions of proteins encoded by chimeric RNA transcripts

PubMed Central

Frenkel-Morgenstern, Milana; Lacroix, Vincent; Ezkurdia, Iakes; Levin, Yishai; Gabashvili, Alexandra; Prilusky, Jaime; del Pozo, Angela; Tress, Michael; Johnson, Rory; Guigo, Roderic; Valencia, Alfonso

2012-01-01

Chimeric RNAs comprise exons from two or more different genes and have the potential to encode novel proteins that alter cellular phenotypes. To date, numerous putative chimeric transcripts have been identified among the ESTs isolated from several organisms and using high throughput RNA sequencing. The few corresponding protein products that have been characterized mostly result from chromosomal translocations and are associated with cancer. Here, we systematically establish that some of the putative chimeric transcripts are genuinely expressed in human cells. Using high throughput RNA sequencing, mass spectrometry experimental data, and functional annotation, we studied 7424 putative human chimeric RNAs. We confirmed the expression of 175 chimeric RNAs in 16 human tissues, with an abundance varying from 0.06 to 17 RPKM (Reads Per Kilobase per Million mapped reads). We show that these chimeric RNAs are significantly more tissue-specific than non-chimeric transcripts. Moreover, we present evidence that chimeras tend to incorporate highly expressed genes. Despite the low expression level of most chimeric RNAs, we show that 12 novel chimeras are translated into proteins detectable in multiple shotgun mass spectrometry experiments. Furthermore, we confirm the expression of three novel chimeric proteins using targeted mass spectrometry. Finally, based on our functional annotation of exon organization and preserved domains, we discuss the potential features of chimeric proteins with illustrative examples and suggest that chimeras significantly exploit signal peptides and transmembrane domains, which can alter the cellular localization of cognate proteins. Taken together, these findings establish that some chimeric RNAs are translated into potentially functional proteins in humans. PMID:22588898
The Craterostigma plantagineum glycine-rich protein CpGRP1 interacts with a cell wall-associated protein kinase 1 (CpWAK1) and accumulates in leaf cell walls during dehydration.

PubMed

Giarola, Valentino; Krey, Stephanie; von den Driesch, Barbara; Bartels, Dorothea

2016-04-01

Craterostigma plantagineum tolerates extreme desiccation. Leaves of this plant shrink and extensively fold during dehydration and expand again during rehydration, preserving their structural integrity. Genes were analysed that may participate in the reversible folding mechanism. Analysis of transcripts abundantly expressed in desiccated leaves identified a gene putatively coding for an apoplastic glycine-rich protein (CpGRP1). We studied the expression, regulation and subcellular localization of CpGRP1 and its ability to interact with a cell wall-associated protein kinase (CpWAK1) to understand the role of CpGRP1 in the cell wall during dehydration. The CpGRP1 protein accumulates in the apoplast of desiccated leaves. Analysis of the promoter revealed that the gene expression is mainly regulated at the transcriptional level, is independent of abscisic acid (ABA) and involves a drought-responsive cis-element (DRE). CpGRP1 interacts with CpWAK1 which is down-regulated in response to dehydration. Our data suggest a role of the CpGRP1-CpWAK1 complex in dehydration-induced morphological changes in the cell wall during dehydration in C. plantagineum. Cell wall pectins and dehydration-induced pectin modifications are predicted to be involved in the activity of the CpGRP1-CpWAK1 complex. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Protein Sequence Annotation Tool (PSAT): A centralized web-based meta-server for high-throughput sequence annotations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leung, Elo; Huang, Amy; Cadag, Eithon

In this study, we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resultingmore » functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Lastly, PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequencebased genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.« less

Protein Sequence Annotation Tool (PSAT): A centralized web-based meta-server for high-throughput sequence annotations

DOE PAGES

Leung, Elo; Huang, Amy; Cadag, Eithon; ...

2016-01-20

In this study, we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resultingmore » functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Lastly, PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequencebased genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.« less
Lipid transfer particle from the silkworm, Bombyx mori, is a novel member of the apoB/large lipid transfer protein family[S

PubMed Central

Yokoyama, Hiroshi; Yokoyama, Takeru; Yuasa, Masashi; Fujimoto, Hirofumi; Sakudoh, Takashi; Honda, Naoko; Fugo, Hajime; Tsuchida, Kozo

2013-01-01

Lipid transfer particle (LTP) is a high-molecular-weight, very high-density lipoprotein known to catalyze the transfer of lipids between a variety of lipoproteins, including both insects and vertebrates. Studying the biosynthesis and regulation pathways of LTP in detail has not been possible due to a lack of information regarding the apoproteins. Here, we sequenced the cDNA and deduced amino acid sequences for three apoproteins of LTP from the silkworm (Bombyx mori). The three subunit proteins of the LTP are coded by two genes, apoLTP-II/I and apoLTP-III. ApoLTP-I and apoLTP-II are predicted to be generated by posttranslational cleavage of the precursor protein, apoLTP-II/I. Clusters of amphipathic secondary structure within apoLTP-II/I are similar to Homo sapiens apolipoprotein B (apoB) and insect lipophorins. The apoLTP-II/I gene is a novel member of the apoB/large lipid transfer protein gene family. ApoLTP-III has a putative conserved juvenile hormone-binding protein superfamily domain. Expression of apoLTP-II/I and apoLTP-III genes was synchronized and both genes were primarily expressed in the fat body at the stage corresponding to increased lipid transport needs. We are now in a position to study in detail the physiological role of LTP and its biosynthesis and assembly. PMID:23812557
Streptococcus iniae SF1: Complete Genome Sequence, Proteomic Profile, and Immunoprotective Antigens

PubMed Central

Zhang, Bao-cun; Zhang, Jian; Sun, Li

2014-01-01

Streptococcus iniae is a Gram-positive bacterium that is reckoned one of the most severe aquaculture pathogens. It has a broad host range among farmed marine and freshwater fish and can also cause zoonotic infection in humans. Here we report for the first time the complete genome sequence as well as the host factor-induced proteomic profile of a pathogenic S. iniae strain, SF1, a serotype I isolate from diseased fish. SF1 possesses a single chromosome of 2,149,844 base pairs, which contains 2,125 predicted protein coding sequences (CDS), 12 rRNA genes, and 45 tRNA genes. Among the protein-encoding CDS are genes involved in resource acquisition and utilization, signal sensing and transduction, carbohydrate metabolism, and defense against host immune response. Potential virulence genes include those encoding adhesins, autolysins, toxins, exoenzymes, and proteases. In addition, two putative prophages and a CRISPR-Cas system were found in the genome, the latter containing a CRISPR locus and four cas genes. Proteomic analysis detected 21 secreted proteins whose expressions were induced by host serum. Five of the serum-responsive proteins were subjected to immunoprotective analysis, which revealed that two of the proteins were highly protective against lethal S. iniae challenge when used as purified recombinant subunit vaccines. Taken together, these results provide an important molecular basis for future study of S. iniae in various aspects, in particular those related to pathogenesis and disease control. PMID:24621602
The Interactions between the Long Non-coding RNA NERDL and Its Target Gene Affect Wood Formation in Populus tomentosa

PubMed Central

Shi, Wan; Quan, Mingyang; Du, Qingzhang; Zhang, Deqiang

2017-01-01

Long non-coding RNAs (lncRNAs) are important regulatory factors for plant growth and development, but little is known about the allelic interactions of lncRNAs with mRNA in perennial plants. Here, we analyzed the interaction of the NERD (Needed for RDR2-independent DNA methylation) Populus tomentosa gene PtoNERD with its putative regulator, the lncRNA NERDL (NERD-related lncRNA), which partially overlaps with the promoter region of this gene. Expression analysis in eight tissues showed a positive correlation between NERDL and PtoNERD (r = 0.62), suggesting that the interaction of NERDL with its putative target might be involved in wood formation. We conducted association mapping in a natural population of P. tomentosa (435 unrelated individuals) to evaluate genetic variation and the interaction of the lncRNA NERDL with PtoNERD. Using additive and dominant models, we identified 30 SNPs (P < 0.01) associated with five tree growth and wood property traits. Each SNP explained 3.90–8.57% of phenotypic variance, suggesting that NERDL and its putative target play a common role in wood formation. Epistasis analysis uncovered nine SNP-SNP association pairs between NERDL and PtoNERD, with an information gain of -7.55 to 2.16%, reflecting the strong interactions between NERDL and its putative target. This analysis provides a powerful method for deciphering the genetic interactions of lncRNAs with mRNA and dissecting the complex genetic network of quantitative traits in trees. PMID:28674544
In silico Prediction, in vitro Antibacterial Spectrum, and Physicochemical Properties of a Putative Bacteriocin Produced by Lactobacillus rhamnosus Strain L156.4

PubMed Central

Oliveira, Letícia de C.; Silveira, Aline M. M.; Monteiro, Andréa de S.; dos Santos, Vera L.; Nicoli, Jacques R.; Azevedo, Vasco A. de C.; Soares, Siomar de C.; Dias-Souza, Marcus V.; Nardi, Regina M. D.

2017-01-01

A bacteriocinogenic Lactobacillus rhamnosus L156.4 strain isolated from the feces of NIH mice was identified by 16S rRNA gene sequencing and MALDI-TOF mass spectrometry. The entire genome was sequenced using Illumina, annotated in the PGAAP, and RAST servers, and deposited. Conserved genes associated with bacteriocin synthesis were predicted using BAGEL3, leading to the identification of an open reading frame (ORF) that shows homology with the L. rhamnosus GG (ATCC 53103) prebacteriocin gene. The encoded protein contains a conserved protein motif associated a structural gene of the Enterocin A superfamily. We found ORFs related to the prebacteriocin, immunity protein, ABC transporter proteins, and regulatory genes with 100% identity to those of L. rhamnosus HN001. In this study, we provide evidence of a putative bacteriocin produced by L. rhamnosus L156.4 that was further confirmed by in vitro assays. The antibacterial activity of the substances produced by this strain was evaluated using the deferred agar-spot and spot-on-the lawn assays, and a wide antimicrobial activity spectrum against human and foodborne pathogens was observed. The physicochemical characterization of the putative bacteriocin indicated that it was sensitive to proteolytic enzymes, heat stable and maintained its antibacterial activity in a pH ranging from 3 to 9. The activity against Lactobacillus fermentum, which was used as an indicator strain, was detected during bacterial logarithmic growth phase, and a positive correlation was confirmed between bacterial growth and production of the putative bacteriocin. After a partial purification from cell-free supernatant by salt precipitation, the putative bacteriocin migrated as a diffuse band of approximately 1.0–3.0 kDa by SDS-PAGE. Additional studies are being conducted to explore its use in the food industry for controlling bacterial growth and for probiotic applications. PMID:28579977
Understanding the molecular basis of plant growth promotional effect of Pseudomonas fluorescens on rice through protein profiling.

PubMed

Kandasamy, Saveetha; Loganathan, Karthiba; Muthuraj, Raveendran; Duraisamy, Saravanakumar; Seetharaman, Suresh; Thiruvengadam, Raguchander; Ponnusamy, Balasubramanian; Ramasamy, Samiyappan

2009-12-24

Plant Growth Promoting Rhizobacteria (PGPR), Pseudomonas fluorescens strain KH-1 was found to exhibit plant growth promotional activity in rice under both in-vitro and in-vivo conditions. But the mechanism underlying such promotional activity of P. fluorescens is not yet understood clearly. In this study, efforts were made to elucidate the molecular responses of rice plants to P. fluorescens treatment through protein profiling. Two-dimensional polyacrylamide gel electrophoresis strategy was adopted to identify the PGPR responsive proteins and the differentially expressed proteins were analyzed by mass spectrometry. Priming of P. fluorescens, 23 different proteins found to be differentially expressed in rice leaf sheaths and MS analysis revealed the differential expression of some important proteins namely putative p23 co-chaperone, Thioredoxin h- rice, Ribulose-bisphosphate carboxylase large chain precursor, Nucleotide diPhosphate kinase, Proteosome sub unit protein and putative glutathione S-transferase protein. Functional analyses of the differential proteins were reported to be directly or indirectly involved in growth promotion in plants. Thus, this study confirms the primary role of PGPR strain KH-1 in rice plant growth promotion.
Correlation approach to identify coding regions in DNA sequences

NASA Technical Reports Server (NTRS)

Ossadnik, S. M.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.

1994-01-01

Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.
Regulation of the aceI multidrug efflux pump gene in Acinetobacter baumannii.

PubMed

Liu, Qi; Hassan, Karl A; Ashwood, Heather E; Gamage, Hasinika K A H; Li, Liping; Mabbutt, Bridget C; Paulsen, Ian T

2018-06-01

To investigate the function of AceR, a putative transcriptional regulator of the chlorhexidine efflux pump gene aceI in Acinetobacter baumannii. Chlorhexidine susceptibility and chlorhexidine induction of aceI gene expression were determined by MIC and quantitative real-time PCR, respectively, in A. baumannii WT and ΔaceR mutant strains. Recombinant AceR was prepared as both a full-length protein and as a truncated protein, AceR (86-299), i.e. AceRt, which has the DNA-binding domain deleted. The binding interaction of the purified AceR protein and its putative operator region was investigated by electrophoretic mobility shift assays and DNase I footprinting assays. The binding of AceRt with its putative ligand chlorhexidine was examined using surface plasmon resonance and tryptophan fluorescence quenching assays. MIC determination assays indicated that the ΔaceI and ΔaceR mutant strains both showed lower resistance to chlorhexidine than the parental strain. Chlorhexidine-induced expression of aceI was abolished in a ΔaceR background. Electrophoretic mobility shift assays and DNase I footprinting assays demonstrated chlorhexidine-stimulated binding of AceR with two sites upstream of the putative aceI promoter. Surface plasmon resonance and tryptophan fluorescence quenching assays suggested that the purified ligand-binding domain of the AceR protein was able to bind with chlorhexidine with high affinity. This study provides strong evidence that AceR is an activator of aceI gene expression when challenged with chlorhexidine. This study is the first characterization, to our knowledge, of a regulator controlling expression of a PACE family multidrug efflux pump.
Electron Microscopy Structural Insights into CPAP Oligomeric Behavior: A Plausible Assembly Process of a Supramolecular Scaffold of the Centrosome

PubMed Central

Alvarez-Cabrera, Ana L.; Delgado, Sandra; Gil-Carton, David; Mortuza, Gulnahar B.; Montoya, Guillermo; Sorzano, Carlos O. S.; Tang, Tang K.; Carazo, Jose M.

2017-01-01

Centrosomal P4.1-associated protein (CPAP) is a cell cycle regulated protein fundamental for centrosome assembly and centriole elongation. In humans, the region between residues 897–1338 of CPAP mediates interactions with other proteins and includes a homodimerization domain. CPAP mutations cause primary autosomal recessive microcephaly and Seckel syndrome. Despite of the biological/clinical relevance of CPAP, its mechanistic behavior remains unclear and its C-terminus (the G-box/TCP domain) is the only part whose structure has been solved. This situation is perhaps due in part to the challenges that represent obtaining the protein in a soluble, homogeneous state for structural studies. Our work constitutes a systematic structural analysis on multiple oligomers of HsCPAP897−1338, using single-particle electron microscopy (EM) of negatively stained (NS) samples. Based on image classification into clearly different regular 3D maps (putatively corresponding to dimers and tetramers) and direct observation of individual images representing other complexes of HsCPAP897−1338 (i.e., putative flexible monomers and higher-order multimers), we report a dynamic oligomeric behavior of this protein, where different homo-oligomers coexist in variable proportions. We propose that dimerization of the putative homodimer forms a putative tetramer which could be the structural unit for the scaffold that either tethers the pericentriolar material to centrioles or promotes procentriole elongation. A coarse fitting of atomic models into the NS 3D maps at resolutions around 20 Å is performed only to complement our experimental data, allowing us to hypothesize on the oligomeric composition of the different complexes. In this way, the current EM work represents an initial step toward the structural characterization of different oligomers of CPAP, suggesting further insights to understand how this protein works, contributing to the elucidation of control mechanisms for centriole biogenesis. PMID:28396859
Genetic structure and viability selection in the golden eagle (Aquila chrysaetos), a vagile raptor with a Holarctic distribution

USGS Publications Warehouse

Doyle, Jacqueline M.; Katzner, Todd E.; Roemer, Gary; Cain, James W.; Millsap, Brian; McIntyre, Carol; Sonsthagen, Sarah A.; Fernandez, Nadia B.; Wheeler, Maria; Bulut, Zafer; Bloom, Peter; DeWoody, J. Andrew

2016-01-01

Molecular markers can reveal interesting aspects of organismal ecology and evolution, especially when surveyed in rare or elusive species. Herein, we provide a preliminary assessment of golden eagle (Aquila chrysaetos) population structure in North America using novel single nucleotide polymorphisms (SNPs). These SNPs included one molecular sexing marker, two mitochondrial markers, 85 putatively neutral markers that were derived from noncoding regions within large intergenic intervals, and 74 putatively nonneutral markers found in or very near protein-coding genes. We genotyped 523 eagle samples at these 162 SNPs and quantified genotyping error rates and variability at each marker. Our samples corresponded to 344 individual golden eagles as assessed by unique multilocus genotypes. Observed heterozygosity of known adults was significantly higher than of chicks, as was the number of heterozygous loci, indicating that mean zygosity measured across all 159 autosomal markers was an indicator of fitness as it is associated with eagle survival to adulthood. Finally, we used chick samples of known provenance to test for population differentiation across portions of North America and found pronounced structure among geographic sampling sites. These data indicate that cryptic genetic population structure is likely widespread in the golden eagle gene pool, and that extensive field sampling and genotyping will be required to more clearly delineate management units within North America and elsewhere.
Characterization of two new putative adhesins of Leptospira interrogans.

PubMed

Figueredo, Jupciana M; Siqueira, Gabriela H; de Souza, Gisele O; Heinemann, Marcos B; Vasconcellos, Silvio A; Chapola, Erica G B; Nascimento, Ana L T O

2017-01-01

We here report the characterization of two novel proteins encoded by the genes LIC11122 and LIC12287, identified in the genome sequences of Leptospira interrogans, annotated, respectively, as a putative sigma factor and a hypothetical protein. The CDSs LIC11122 and LIC12287 have signal peptide SPII and SPI and are predicted to be located mainly at the cytoplasmic membrane of the bacteria. The genes were cloned and the proteins expressed using Escherichia coli. Proteinase K digestion showed that both proteins are surface exposed. Evaluation of interaction of recombinant proteins with extracellular matrix components revealed that they are laminin binding and they were called Lsa19 (LIC11122) and Lsa14 (LIC12287), for Leptospiral-surface adhesin of 19 and 14 kDa, respectively. The bindings were dose-dependent on protein concentration, reaching saturation, fulfilling the ligand-binding criteria. Reactivity of the recombinant proteins with leptospirosis human sera has shown that Lsa19 and, to a lesser extent, Lsa14, are recognized by antibodies, suggesting that, most probably, Lsa19 is expressed during infection. The proteins interact with plasminogen and generate plasmin in the presence of urokinase-type plasminogen activator. Plasmin generation in Leptospira has been associated with tissue penetration and immune evasion strategies. The presence of a sigma factor on the cell surface playing a secondary role, probably mediating host -pathogen interaction, suggests that LIC11122 is a moonlighting protein candidate. Although the biological significance of these putative adhesins will require the generation of mutants, our data suggest that Lsa19 is a potential candidate for future evaluation of its role in adhesion/colonization activities during L. interrogans infection.
Identification of novel putative-binding proteins for cellular prion protein and a specific interaction with the STIP1 homology and U-Box-containing protein 1

PubMed Central

Gimenez, Ana Paula Lappas; Richter, Larissa Morato Luciani; Atherino, Mariana Campos; Beirão, Breno Castello Branco; Fávaro, Celso; Costa, Michele Dietrich Moura; Zanata, Silvio Marques; Malnic, Bettina; Mercadante, Adriana Frohlich

2015-01-01

ABSTRACT Prion diseases involve the conversion of the endogenous cellular prion protein, PrPC, into a misfolded infectious isoform, PrPSc. Several functions have been attributed to PrPC, and its role has also been investigated in the olfactory system. PrPC is expressed in both the olfactory bulb (OB) and olfactory epithelium (OE) and the nasal cavity is an important route of transmission of diseases caused by prions. Moreover, Prnp−/− mice showed impaired behavior in olfactory tests. Given the high PrPC expression in OE and its putative role in olfaction, we screened a mouse OE cDNA library to identify novel PrPC-binding partners. Ten different putative PrPC ligands were identified, which were involved in functions such as cellular proliferation and apoptosis, cytoskeleton and vesicle transport, ubiquitination of proteins, stress response, and other physiological processes. In vitro binding assays confirmed the interaction of PrPC with STIP1 homology and U-Box containing protein 1 (Stub1) and are reported here for the first time. Stub1 is a co-chaperone with ubiquitin E3-ligase activity, which is associated with neurodegenerative diseases characterized by protein misfolding and aggregation. Physiological and pathological implications of PrPC-Stub1 interaction are under investigation. The PrPC-binding proteins identified here are not exclusive to the OE, suggesting that these interactions may occur in other tissues and play general biological roles. These data corroborate the proposal that PrPC is part of a multiprotein complex that modulates several cellular functions and provide a platform for further studies on the physiological and pathological roles of prion protein. PMID:26237451
Identification and Characterization of Two Temperature-Induced Surface-Associated Proteins of Streptococcus suis with High Homologies to Members of the Arginine Deiminase System of Streptococcus pyogenes

PubMed Central

Winterhoff, Nora; Goethe, Ralph; Gruening, Petra; Rohde, Manfred; Kalisz, Henryk; Smith, Hilde E.; Valentin-Weigand, Peter

2002-01-01

The present study was performed to identify stress-induced putative virulence proteins of Streptococcus suis. For this, protein expression patterns of streptococci grown at 32, 37, and 42°C were compared by one- and two-dimensional gel electrophoresis. Temperature shifts from 32 and 37 to 42°C induced expression of two cell wall-associated proteins with apparent molecular masses of approximately 47 and 53 kDa. Amino-terminal sequence analysis of the two proteins indicated homologies of the 47-kDa protein with an ornithine carbamoyltransferase (OCT) from Streptococcus pyogenes and of the 53-kDa protein with the streptococcal acid glycoprotein (SAGP) from S. pyogenes, an arginine deiminase (AD) recently proposed as a putative virulence factor. Cloning and sequencing the genes encoding the putative OCT and AD of S. suis, octS and adiS, respectively, revealed that they had 81.2 (octS) and 80.2% (adiS) identity with the respective genes of S. pyogenes. Both genes belong to the AD system, also found in other bacteria. Southern hybridization analysis demonstrated the presence of the adiS gene in all 42 serotype 2 and 9 S. suis strains tested. In 9 of these 42 strains, selected randomly, we confirmed expression of the AdiS protein, homologous to SAGP, by immunoblot analysis using a specific antiserum against the SAGP of S. pyogenes. In all strains AD activity was detected. Furthermore, by immunoelectron microscopy using the anti-S. pyogenes SAGP antiserum we were able to demonstrate that the AdiS protein is expressed on the streptococcal surface in association with the capsular polysaccharides but is not coexpressed with them. PMID:12446626
Functional Analysis of the Gene Cluster Involved in Production of the Bacteriocin Circularin A by Clostridium beijerinckii ATCC 25752

PubMed Central

Kemperman, Robèr; Jonker, Marnix; Nauta, Arjen; Kuipers, Oscar P.; Kok, Jan

2003-01-01

A region of 12 kb flanking the structural gene of the cyclic antibacterial peptide circularin A of Clostridium beijerinckii ATCC 25752 was sequenced, and the putative proteins involved in the production and secretion of circularin A were identified. The genes are tightly organized in overlapping open reading frames. Heterologous expression of circularin A in Enterococcus faecalis was achieved, and five genes were identified as minimally required for bacteriocin production and secretion. Two of the putative proteins, CirB and CirC, are predicted to contain membrane-spanning domains, while CirD contains a highly conserved ATP-binding domain. Together with CirB and CirC, this ATP-binding protein is involved in the production of circularin A. The fifth gene, cirE, confers immunity towards circularin A when expressed in either Lactococcus lactis or E. faecalis and is needed in order to allow the bacteria to produce bacteriocin. Additional resistance against circularin A is conferred by the activity of the putative transporter consisting of CirB and CirD. PMID:14532033
DNA-Binding Properties of African Swine Fever Virus pA104R, a Histone-Like Protein Involved in Viral Replication and Transcription.

PubMed

Frouco, Gonçalo; Freitas, Ferdinando B; Coelho, João; Leitão, Alexandre; Martins, Carlos; Ferreira, Fernando

2017-06-15

African swine fever virus (ASFV) codes for a putative histone-like protein (pA104R) with extensive sequence homology to bacterial proteins that are implicated in genome replication and packaging. Functional characterization of purified recombinant pA104R revealed that it binds to single-stranded DNA (ssDNA) and double-stranded DNA (dsDNA) over a wide range of temperatures, pH values, and salt concentrations and in an ATP-independent manner, with an estimated binding site size of about 14 to 16 nucleotides. Using site-directed mutagenesis, the arginine located in pA104R's DNA-binding domain, at position 69, was found to be relevant for efficient DNA-binding activity. Together, pA104R and ASFV topoisomerase II (pP1192R) display DNA-supercoiling activity, although none of the proteins by themselves do, indicating that the two cooperate in this process. In ASFV-infected cells, A104R transcripts were detected from 2 h postinfection (hpi) onward, reaching a maximum concentration around 16 hpi. pA104R was detected from 12 hpi onward, localizing with viral DNA replication sites and being found exclusively in the Triton-insoluble fraction. Small interfering RNA (siRNA) knockdown experiments revealed that pA104R plays a critical role in viral DNA replication and gene expression, with transfected cells showing lower viral progeny numbers (up to a reduction of 82.0%), lower copy numbers of viral genomes (-78.3%), and reduced transcription of a late viral gene (-47.6%). Taken together, our results strongly suggest that pA104R participates in the modulation of viral DNA topology, probably being involved in viral DNA replication, transcription, and packaging, emphasizing that ASFV mutants lacking the A104R gene could be used as a strategy to develop a vaccine against ASFV. IMPORTANCE Recently reintroduced in Europe, African swine fever virus (ASFV) causes a fatal disease in domestic pigs, causing high economic losses in affected countries, as no vaccine or treatment is currently available. Remarkably, ASFV is the only known mammalian virus that putatively codes for a histone-like protein (pA104R) that shares extensive sequence homology with bacterial histone-like proteins. In this study, we characterized the DNA-binding properties of pA104R, analyzed the functional importance of two conserved residues, and showed that pA104R and ASFV topoisomerase II cooperate and display DNA-supercoiling activity. Moreover, pA104R is expressed during the late phase of infection and accumulates in viral DNA replication sites, and its downregulation revealed that pA104R is required for viral DNA replication and transcription. These results suggest that pA104R participates in the modulation of viral DNA topology and genome packaging, indicating that A104R deletion mutants may be a good strategy for vaccine development against ASFV. Copyright © 2017 American Society for Microbiology.
Chamber Specific Gene Expression Landscape of the Zebrafish Heart

PubMed Central

Singh, Angom Ramcharan; Sivadas, Ambily; Sabharwal, Ankit; Vellarikal, Shamsudheen Karuthedath; Jayarajan, Rijith; Verma, Ankit; Kapoor, Shruti; Joshi, Adita; Scaria, Vinod; Sivasubbu, Sridhar

2016-01-01

The organization of structure and function of cardiac chambers in vertebrates is defined by chamber-specific distinct gene expression. This peculiarity and uniqueness of the genetic signatures demonstrates functional resolution attributed to the different chambers of the heart. Altered expression of the cardiac chamber genes can lead to individual chamber related dysfunctions and disease patho-physiologies. Information on transcriptional repertoire of cardiac compartments is important to understand the spectrum of chamber specific anomalies. We have carried out a genome wide transcriptome profiling study of the three cardiac chambers in the zebrafish heart using RNA sequencing. We have captured the gene expression patterns of 13,396 protein coding genes in the three cardiac chambers—atrium, ventricle and bulbus arteriosus. Of these, 7,260 known protein coding genes are highly expressed (≥10 FPKM) in the zebrafish heart. Thus, this study represents nearly an all-inclusive information on the zebrafish cardiac transcriptome. In this study, a total of 96 differentially expressed genes across the three cardiac chambers in zebrafish were identified. The atrium, ventricle and bulbus arteriosus displayed 20, 32 and 44 uniquely expressing genes respectively. We validated the expression of predicted chamber-restricted genes using independent semi-quantitative and qualitative experimental techniques. In addition, we identified 23 putative novel protein coding genes that are specifically restricted to the ventricle and not in the atrium or bulbus arteriosus. In our knowledge, these 23 novel genes have either not been investigated in detail or are sparsely studied. The transcriptome identified in this study includes 68 differentially expressing zebrafish cardiac chamber genes that have a human ortholog. We also carried out spatiotemporal gene expression profiling of the 96 differentially expressed genes throughout the three cardiac chambers in 11 developmental stages and 6 tissue types of zebrafish. We hypothesize that clustering the differentially expressed genes with both known and unknown functions will deliver detailed insights on fundamental gene networks that are important for the development and specification of the cardiac chambers. It is also postulated that this transcriptome atlas will help utilize zebrafish in a better way as a model for studying cardiac development and to explore functional role of gene networks in cardiac disease pathogenesis. PMID:26815362
The prediction of a pathogenesis-related secretome of Puccinia helianthi through high-throughput transcriptome analysis.

PubMed

Jing, Lan; Guo, Dandan; Hu, Wenjie; Niu, Xiaofan

2017-03-11

Many plant pathogen secretory proteins are known to be elicitors or pathogenic factors,which play an important role in the host-pathogen interaction process. Bioinformatics approaches make possible the large scale prediction and analysis of secretory proteins from the Puccinia helianthi transcriptome. The internet-based software SignalP v4.1, TargetP v1.01, Big-PI predictor, TMHMM v2.0 and ProtComp v9.0 were utilized to predict the signal peptides and the signal peptide-dependent secreted proteins among the 35,286 ORFs of the P. helianthi transcriptome. 908 ORFs (accounting for 2.6% of the total proteins) were identified as putative secretory proteins containing signal peptides. The length of the majority of proteins ranged from 51 to 300 amino acids (aa), while the signal peptides were from 18 to 20 aa long. Signal peptidase I (SpI) cleavage sites were found in 463 of these putative secretory signal peptides. 55 proteins contained the lipoprotein signal peptide recognition site of signal peptidase II (SpII). Out of 908 secretory proteins, 581 (63.8%) have functions related to signal recognition and transduction, metabolism, transport and catabolism. Additionally, 143 putative secretory proteins were categorized into 27 functional groups based on Gene Ontology terms, including 14 groups in biological process, seven in cellular component, and six in molecular function. Gene ontology analysis of the secretory proteins revealed an enrichment of hydrolase activity. Pathway associations were established for 82 (9.0%) secretory proteins. A number of cell wall degrading enzymes and three homologous proteins specific to Phytophthora sojae effectors were also identified, which may be involved in the pathogenicity of the sunflower rust pathogen. This investigation proposes a new approach for identifying elicitors and pathogenic factors. The eventual identification and characterization of 908 extracellularly secreted proteins will advance our understanding of the molecular mechanisms of interactions between sunflower and rust pathogen and will enhance our ability to intervene in disease states.
[Prokaryotic expression of the major antigenic domain of equine arteritis virus GL protein and the establishment of putative indirect ELISA assay].

PubMed

Liang, Cheng-Zhu; Cao, Rui-Bing; Wei, Jian-Chao; Zhu, Lai-Hua; Chen, Pu-Yan

2006-06-01

According to the antigenic analysis of equine arteritis virus (EAV) GL protein, one pair of primers were designed, with which the gene fragment coding the high antigenic domain of EAV GL protein was amplified from the EAV genome. The cloned gene was digested with BamH I and Xho I and then inserted into pET-32a and resulted pET-GL1. The pET-GL1 was transformed into the host cell BL21(DE3) and the expression was optimized including cultivation temperature and concentration of IPTG. The aim protein was highly expressed and the obtained recombinant protein manifested well reactiongenicity as was confirmed by Western blot. The recombinant GL1 protein was purified by the means of His * Bind resin protein purification procedure. Then an indirect ELISA was established to detect antibody against EAV with the purified GL1 protein as the coating antigen. The result showed that the optimal concentration of coated antigen was 9.65 microg/mL and the optimal dilution of serum was 1:80. The positive criterion of this ELISA assay is OD (the tested serum) > 0.4 and OD (the tested serum) /OD (the negative serum) > 2.0. The iGL-ELISA was evaluated versus micro-virus neutralization test. The ELISA was performed on 900 sera from which were preserved by this lab during horse entry/exit inspection, the agreement (94.1%) of these test were considered suitable for individual serological detection. In another test which 180 sera samples were detected by iGL-ELISA and INGEZIM ELISA kit respectively. The agreement ratio between the two methods is 95.6%.
Crystal structure at 2.8 A of Huntingtin-interacting protein 1 (HIP1) coiled-coil domain reveals a charged surface suitable for HIP1 protein interactor (HIPPI).

PubMed

Niu, Qian; Ybe, Joel A

2008-02-01

Huntington's disease is a genetic neurological disorder that is triggered by the dissociation of the huntingtin protein (htt) from its obligate interaction partner Huntingtin-interacting protein 1 (HIP1). The release of the huntingtin protein permits HIP1 protein interactor (HIPPI) to bind to its recognition site on HIP1 to form a HIPPI/HIP1 complex that recruits procaspase-8 to begin the process of apoptosis. The interaction module between HIPPI and HIP1 was predicted to resemble a death-effector domain. Our 2.8-A crystal structure of the HIP1 371-481 subfragment that includes F432 and K474, which is important for HIPPI binding, is not a death-effector domain but is a partially opened coiled coil. The HIP1 371-481 model reveals a basic surface that we hypothesize to be suitable for binding HIPPI. There is an opened region next to the putative HIPPI site that is highly negatively charged. The acidic residues in this region are highly conserved in HIP1 and a related protein, HIP1R, from different organisms but are not conserved in the yeast homologue of HIP1, sla2p. We have modeled approximately 85% of the coiled-coil domain by joining our new HIP1 371-481 structure to the HIP1 482-586 model (Protein Data Bank code: 2NO2). Finally, the middle of this coiled-coil domain may be intrinsically flexible and suggests a new interaction model where HIPPI binds to a U-shaped HIP1 molecule.
Computational analysis identifies putative prognostic biomarkers of pathological scarring in skin wounds.

PubMed

Nagaraja, Sridevi; Chen, Lin; DiPietro, Luisa A; Reifman, Jaques; Mitrophanov, Alexander Y

2018-02-20

Pathological scarring in wounds is a prevalent clinical outcome with limited prognostic options. The objective of this study was to investigate whether cellular signaling proteins could be used as prognostic biomarkers of pathological scarring in traumatic skin wounds. We used our previously developed and validated computational model of injury-initiated wound healing to simulate the time courses for platelets, 6 cell types, and 21 proteins involved in the inflammatory and proliferative phases of wound healing. Next, we analysed thousands of simulated wound-healing scenarios to identify those that resulted in pathological (i.e., excessive) scarring. Then, we identified candidate proteins that were elevated (or decreased) at the early stages of wound healing in those simulations and could therefore serve as predictive biomarkers of pathological scarring outcomes. Finally, we performed logistic regression analysis and calculated the area under the receiver operating characteristic curve to quantitatively assess the predictive accuracy of the model-identified putative biomarkers. We identified three proteins (interleukin-10, tissue inhibitor of matrix metalloproteinase-1, and fibronectin) whose levels were elevated in pathological scars as early as 2 weeks post-wounding and could predict a pathological scarring outcome occurring 40 days after wounding with 80% accuracy. Our method for predicting putative prognostic wound-outcome biomarkers may serve as an effective means to guide the identification of proteins predictive of pathological scarring.

Bacteriophage SP6 encodes a second tailspike protein that recognizes Salmonella enterica serogroups C{sub 2} and C{sub 3}

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gebhart, Dana; Williams, Steven R.; Scholl, Dean,

SP6 is a salmonella phage closely related to coliphage K1-5. K1-5 is notable in that it encodes two polysaccharide-degrading tailspike proteins, an endosialidase that allows it to infect E. coli K1, and a lyase that enables it to infect K5 strains. SP6 is similar to K1-5 except that it encodes a P22-like endorhamnosidase tailspike, gp46, allowing it to infect group B Salmonella. We show here that SP6 can also infect Salmonella serogroups C{sub 2} and C{sub 3} and that a mutation in a putative second tailspike, gp47, eliminates this specificity. Gene 47 was fused to the coding region of themore » N-terminal portion of the Pseudomonas aeruginosa R2 pyocin tail fiber and expressed in trans such that the fusion protein becomes incorporated into pyocin particles. These pyocins, termed AvR2-SP47, killed serogroups C{sub 2} and C{sub 3}Salmonella. We conclude that SP6 encodes two tail proteins providing it a broad host range among Salmonella enterica. - Highlights: • SP6 is a “dual specificity” bacteriophage that encodes two different receptor binding proteins giving it a broad host range. • These receptor binding proteins can be used to re-target the spectrum of R-type bacteriocins to Salmonella enterica. • Both SP6 and the engineered R-type bacteriocins can kill the Salmonella serovars most associated with human disease making them attractive for development as antimicrobial agents.« less
Correlation between structure, protein composition, morphogenesis and cytopathology of Glossina pallidipes salivary gland hypertrophy virus.

PubMed

Kariithi, Henry M; van Lent, Jan W M; Boeren, Sjef; Abd-Alla, Adly M M; Ince, Ikbal Agah; van Oers, Monique M; Vlak, Just M

2013-01-01

The Glossina pallidipes salivary gland hypertrophy virus (GpSGHV) is a dsDNA virus with rod-shaped, enveloped virions. Its 190 kb genome contains 160 putative protein-coding ORFs. Here, the structural components, protein composition and associated aspects of GpSGHV morphogenesis and cytopathology were investigated. Four morphologically distinct structures: the nucleocapsid, tegument, envelope and helical surface projections, were observed in purified GpSGHV virions by electron microscopy. Nucleocapsids were present in virogenic stroma within the nuclei of infected salivary gland cells, whereas enveloped virions were located in the cytoplasm. The cytoplasm of infected cells appeared disordered and the plasma membranes disintegrated. Treatment of virions with 1 % NP-40 efficiently partitioned the virions into envelope and nucleocapsid fractions. The fractions were separated by SDS-PAGE followed by in-gel trypsin digestion and analysis of the tryptic peptides by liquid chromatography coupled to electrospray and tandem mass spectrometry. Using the MaxQuant program with Andromeda as a database search engine, a total of 45 viral proteins were identified. Of these, ten and 15 were associated with the envelope and the nucleocapsid fractions, respectively, whilst 20 were detected in both fractions, most likely representing tegument proteins. In addition, 51 host-derived proteins were identified in the proteome of the virus particle, 13 of which were verified to be incorporated into the mature virion using a proteinase K protection assay. This study provides important information about GpSGHV biology and suggests options for the development of future anti-GpSGHV strategies by interfering with virus-host interactions.
Crystal structure at 2.8Å of Huntingtin-interacting protein 1 (HIP1) coiled-coil domain reveals a charged surface suitable for HIP-protein interactor (HIPPI)

PubMed Central

Niu, Qian; Ybe, Joel A.

2008-01-01

Summary Huntington’s disease is a genetic neurological disorder that is triggered by the dissociation of the huntingtin protein (htt) from its obligate interaction partner Huntingtin-interacting protein 1 (HIP1). The release of htt permits HIP-protein interactor (HIPPI) to bind to its recognition site on HIP1 to form a HIPPI/HIP1 complex that recruits Procaspase-8 to begin the process of apoptosis. The interaction module between HIPPI and HIP1 was predicted to resemble a death-effector domain (DED). Our 2.8 Å crystal structure of the HIP1 371-481 sub-fragment that includes F432 and K474 important for HIPPI binding is not a DED, but is a partially opened coiled-coil. The HIP1 371-481 model reveals a basic surface we hypothesize is suitable for binding HIPPI. There is an opened region next to the putative HIPPI site that is highly negatively charged. The acidic residues in this region are highly conserved in HIP1 and a related protein, HIP1R from different organisms, but are not conserved in the yeast homolog of HIP1, sla2p. We have modeled ∼85% of the coiled-coil domain by joining our new HIP1 371-481 structure to the HIP1 482-586 model (PDB code: 2NO2). Finally, the middle of this coiled-coil domain may be intrinsically flexible and suggests a new interaction model where HIPPI binds to a “U” shaped HIP1 molecule. PMID:18155047
A MADS box protein interacts with a mating-type protein and is required for fruiting body development in the homothallic ascomycete Sordaria macrospora.

PubMed

Nolting, Nicole; Pöggeler, Stefanie

2006-07-01

MADS box transcription factors control diverse developmental processes in plants, metazoans, and fungi. To analyze the involvement of MADS box proteins in fruiting body development of filamentous ascomycetes, we isolated the mcm1 gene from the homothallic ascomycete Sordaria macrospora, which encodes a putative homologue of the Saccharomyces cerevisiae MADS box protein Mcm1p. Deletion of the S. macrospora mcm1 gene resulted in reduced biomass, increased hyphal branching, and reduced hyphal compartment length during vegetative growth. Furthermore, the S. macrospora Deltamcm1 strain was unable to produce fruiting bodies or ascospores during sexual development. A yeast two-hybrid analysis in conjugation with in vitro analyses demonstrated that the S. macrospora MCM1 protein can interact with the putative transcription factor SMTA-1, encoded by the S. macrospora mating-type locus. These results suggest that the S. macrospora MCM1 protein is involved in the transcriptional regulation of mating-type-specific genes as well as in fruiting body development.
In Silico Analysis of Small RNAs Suggest Roles for Novel and Conserved miRNAs in the Formation of Epigenetic Memory in Somatic Embryos of Norway Spruce.

PubMed

Yakovlev, Igor A; Fossdal, Carl G

2017-01-01

Epigenetic memory in Norway spruce affects the timing of bud burst and bud set, vitally important adaptive traits for this long-lived forest species. Epigenetic memory is established in response to the temperature conditions during embryogenesis. Somatic embryogenesis at different epitype inducing (EpI) temperatures closely mimics the natural processes of epigenetic memory formation in seeds, giving rise to epigenetically different clonal plants in a reproducible and predictable manner, with respect to altered bud phenology. MicroRNAs (miRNAs) and other small non-coding RNAs (sRNAs) play an essential role in the regulation of plant gene expression and may affect this epigenetic mechanism. We used NGS sequencing and computational in silico methods to identify and profile conserved and novel miRNAs among small RNAs in embryogenic tissues of Norway spruce at three EpI temperatures (18, 23 and 28°C). We detected three predominant classes of sRNAs related to a length of 24 nt, followed by a 21-22 nt class and a third 31 nt class of sRNAs. More than 2100 different miRNAs within the prevailing length 21-22 nt were identified. Profiling these putative miRNAs allowed identification of 1053 highly expressed miRNAs, including 523 conserved and 530 novels. 654 of these miRNAs were found to be differentially expressed (DEM) depending on EpI temperature. For most DEMs, we defined their putative mRNA targets. The targets represented mostly by transcripts of multiple-repeats proteins, like TIR, NBS-LRR, PPR and TPR repeat, Clathrin/VPS proteins, Myb-like, AP2, etc. Notably, 124 DE miRNAs targeted 203 differentially expressed epigenetic regulators. Developing Norway spruce embryos possess a more complex sRNA structure than that reported for somatic tissues. A variety of the predicted miRNAs showed distinct EpI temperature dependent expression patterns. These putative EpI miRNAs target spruce genes with a wide range of functions, including genes known to be involved in epigenetic regulation, which in turn could provide a feedback process leading to the formation of epigenetic marks. We suggest that TIR, NBS and LRR domain containing proteins could fulfill more general functions for signal transduction from external environmental stimuli and conversion them into molecular response. Fine-tuning of the miRNA production likely participates in both developmental regulation and epigenetic memory formation in Norway spruce.
Open reading frames in a 4556 nucleotide sequence within MDV-1 BamHI-D DNA fragment: evidence for splicing of mRNA from a new viral glycoprotein gene.

PubMed

Becker, Y; Asher, Y; Tabor, E; Davidson, I; Malkinson, M

1994-01-01

A DNA segment of the MDV-1 BamHI-D fragment was sequenced, and the open reading frames (ORFs) present in the 4556 nucleotide fragment were analyzed by computer programs. Computer analysis identified 19 putative ORFs in the sequence ranging from a coding capacity of 37 amino acids (aa) (ORF-1a) to 684aa (ORF-1). The special properties of four ORFs (1a, 1, 2, and 3) were investigated. Two adjacent ORFs, ORF-1a and ORF-1, were found by computer analysis to have the properties of two introns encoding a glycoprotein: ORF-1a encodes an aa sequence with the properties of a signal peptide, and ORF-1 encodes a polypeptide with a membrane anchor domain and putative N-glycosylation sites in the aa sequence. ORF-1a and ORF-1 were found to be transcribed in MDV-1-infected cells. Two RNA transcripts were detected: a precursor RNA and its spliced form. Both are transcribed from a promoter located 5' to ORF-1a, and splice donor and acceptor sites are used to splice the mRNA after cleavage of a 71-nucleotide sequence. This finding suggest that ORF-1a and ORF-1 are two introns of a new MDV-1 glycoprotein gene. The DNA sequence containing ORF-1 was transiently expressed in COS-1 cells, and the viral protein produced in these cells was found to react with anti-MDV serotype-1 Antigen B-specific monoclonal antibodies. These studies indicate that the protein encoded by ORF-1 has antigenic properties resembling Antigen B of MDV-1. A gene homologous to ORF-1 was detected in the genome of both MDV-2(SB1) and MDV-3(HVT), which serve as commercial vaccine strains. Two additional ORFs were noted in the 4556 nucleotide sequence: ORF-2, which encodes a 333 aa polypeptide initiating in the UL and terminating in the TRL prior to the putative origin of replication, and ORF-3, which encodes a 155 aa polypeptide that is partly homologous to the phosphoprotein pp38 encoded by the BamHI-H sequence. The 65 N-terminal aa of the two gene products are identical, both being derived from the nucleotide sequences in the TRL and IRL, respectively. Additional homologous aa sequences are the hydrophobic aa domain in the middle of both proteins. The functions of ORF-2, ORF-3, and additional ORFs are under study.
A subset of conserved mammalian long non-coding RNAs are fossils of ancestral protein-coding genes.

PubMed

Hezroni, Hadas; Ben-Tov Perry, Rotem; Meir, Zohar; Housman, Gali; Lubelsky, Yoav; Ulitsky, Igor

2017-08-30

Only a small portion of human long non-coding RNAs (lncRNAs) appear to be conserved outside of mammals, but the events underlying the birth of new lncRNAs in mammals remain largely unknown. One potential source is remnants of protein-coding genes that transitioned into lncRNAs. We systematically compare lncRNA and protein-coding loci across vertebrates, and estimate that up to 5% of conserved mammalian lncRNAs are derived from lost protein-coding genes. These lncRNAs have specific characteristics, such as broader expression domains, that set them apart from other lncRNAs. Fourteen lncRNAs have sequence similarity with the loci of the contemporary homologs of the lost protein-coding genes. We propose that selection acting on enhancer sequences is mostly responsible for retention of these regions. As an example of an RNA element from a protein-coding ancestor that was retained in the lncRNA, we describe in detail a short translated ORF in the JPX lncRNA that was derived from an upstream ORF in a protein-coding gene and retains some of its functionality. We estimate that ~ 55 annotated conserved human lncRNAs are derived from parts of ancestral protein-coding genes, and loss of coding potential is thus a non-negligible source of new lncRNAs. Some lncRNAs inherited regulatory elements influencing transcription and translation from their protein-coding ancestors and those elements can influence the expression breadth and functionality of these lncRNAs.
[Isolation of ABA-regulated genes in Oryza sativa through fluorescent differential display PCR (FDD-PCR)].

PubMed

Xu, Shou Ling; Shen, Si Shi; Xu, Zhi Hong; Xue, Hong Wei

2002-12-01

Abscisic acid (ABA) was critical in plant seed development and response to environmental factors such as stress situations. To study the possible ABA related signaling transduction pathways, we tried to isolate the ABA-regulated genes through fluorescent differential display PCR (FDD-PCR) technology using rice seedling as materials (treated with ABA for 2, 4, 8 and 12h). In the 17 fragments isolated, 14 and 3 clones were up-and down-regulated respectively. Sequence analyses revealed that the encoded proteins were involved in photosynthesis (7 fragments), signal transduction (1 fragments), transcription (2 fragments), metabolism and resistance (6 fragments), and unknown protein (1 fragments). 3 clones, encoding putative alpha/beta hydrolase fold, putative vacuolar H+ -ATPase B subunit, putative tyrosine phosphatase, were confirmed to be regulated under ABA treatment by RT-PCR and northern blot analysis. FDD-PCR and possible functional mechanisms of ABA were discussed.
Strigolactone-Induced Putative Secreted Protein 1 Is Required for the Establishment of Symbiosis by the Arbuscular Mycorrhizal Fungus Rhizophagus irregularis.

PubMed

Tsuzuki, Syusaku; Handa, Yoshihiro; Takeda, Naoya; Kawaguchi, Masayoshi

2016-04-01

Arbuscular mycorrhizal (AM) symbiosis is the most widespread association between plants and fungi. To provide novel insights into the molecular mechanisms of AM symbiosis, we screened and investigated genes of the AM fungus Rhizophagus irregularis that contribute to the infection of host plants. R. irregularis genes involved in the infection were explored by RNA-sequencing (RNA-seq) analysis. One of the identified genes was then characterized by a reverse genetic approach using host-induced gene silencing (HIGS), which causes RNA interference in the fungus via the host plant. The RNA-seq analysis revealed that 19 genes are up-regulated by both treatment with strigolactone (SL) (a plant symbiotic signal) and symbiosis. Eleven of the 19 genes were predicted to encode secreted proteins and, of these, SL-induced putative secreted protein 1 (SIS1) showed the largest induction under both conditions. In hairy roots of Medicago truncatula, SIS1 expression is knocked down by HIGS, resulting in significant suppression of colonization and formation of stunted arbuscules. These results suggest that SIS1 is a putative secreted protein that is induced in a wide spatiotemporal range including both the presymbiotic and symbiotic stages and that SIS1 positively regulates colonization of host plants by R. irregularis.
cncRNAs: Bi-functional RNAs with protein coding and non-coding functions

PubMed Central

Kumari, Pooja; Sampath, Karuna

2015-01-01

For many decades, the major function of mRNA was thought to be to provide protein-coding information embedded in the genome. The advent of high-throughput sequencing has led to the discovery of pervasive transcription of eukaryotic genomes and opened the world of RNA-mediated gene regulation. Many regulatory RNAs have been found to be incapable of protein coding and are hence termed as non-coding RNAs (ncRNAs). However, studies in recent years have shown that several previously annotated non-coding RNAs have the potential to encode proteins, and conversely, some coding RNAs have regulatory functions independent of the protein they encode. Such bi-functional RNAs, with both protein coding and non-coding functions, which we term as ‘cncRNAs’, have emerged as new players in cellular systems. Here, we describe the functions of some cncRNAs identified from bacteria to humans. Because the functions of many RNAs across genomes remains unclear, we propose that RNAs be classified as coding, non-coding or both only after careful analysis of their functions. PMID:26498036
Regulation of neural macroRNAs by the transcriptional repressor REST

PubMed Central

Johnson, Rory; Teh, Christina Hui-Leng; Jia, Hui; Vanisri, Ravi Raj; Pandey, Tridansh; Lu, Zhong-Hao; Buckley, Noel J.; Stanton, Lawrence W.; Lipovich, Leonard

2009-01-01

The essential transcriptional repressor REST (repressor element 1-silencing transcription factor) plays central roles in development and human disease by regulating a large cohort of neural genes. These have conventionally fallen into the class of known, protein-coding genes; recently, however, several noncoding microRNA genes were identified as REST targets. Given the widespread transcription of messenger RNA-like, noncoding RNAs (“macroRNAs”), some of which are functional and implicated in disease in mammalian genomes, we sought to determine whether this class of noncoding RNAs can also be regulated by REST. By applying a new, unbiased target gene annotation pipeline to computationally discovered REST binding sites, we find that 23% of mammalian REST genomic binding sites are within 10 kb of a macroRNA gene. These putative target genes were overlooked by previous studies. Focusing on a set of 18 candidate macroRNA targets from mouse, we experimentally demonstrate that two are regulated by REST in neural stem cells. Flanking protein-coding genes are, at most, weakly repressed, suggesting specific targeting of the macroRNAs by REST. Similar to the majority of known REST target genes, both of these macroRNAs are induced during nervous system development and have neurally restricted expression profiles in adult mouse. We observe a similar phenomenon in human: the DiGeorge syndrome-associated noncoding RNA, DGCR5, is repressed by REST through a proximal upstream binding site. Therefore neural macroRNAs represent an additional component of the REST regulatory network. These macroRNAs are new candidates for understanding the role of REST in neuronal development, neurodegeneration, and cancer. PMID:19050060
Regulation of neural macroRNAs by the transcriptional repressor REST.

PubMed

Johnson, Rory; Teh, Christina Hui-Leng; Jia, Hui; Vanisri, Ravi Raj; Pandey, Tridansh; Lu, Zhong-Hao; Buckley, Noel J; Stanton, Lawrence W; Lipovich, Leonard

2009-01-01

The essential transcriptional repressor REST (repressor element 1-silencing transcription factor) plays central roles in development and human disease by regulating a large cohort of neural genes. These have conventionally fallen into the class of known, protein-coding genes; recently, however, several noncoding microRNA genes were identified as REST targets. Given the widespread transcription of messenger RNA-like, noncoding RNAs ("macroRNAs"), some of which are functional and implicated in disease in mammalian genomes, we sought to determine whether this class of noncoding RNAs can also be regulated by REST. By applying a new, unbiased target gene annotation pipeline to computationally discovered REST binding sites, we find that 23% of mammalian REST genomic binding sites are within 10 kb of a macroRNA gene. These putative target genes were overlooked by previous studies. Focusing on a set of 18 candidate macroRNA targets from mouse, we experimentally demonstrate that two are regulated by REST in neural stem cells. Flanking protein-coding genes are, at most, weakly repressed, suggesting specific targeting of the macroRNAs by REST. Similar to the majority of known REST target genes, both of these macroRNAs are induced during nervous system development and have neurally restricted expression profiles in adult mouse. We observe a similar phenomenon in human: the DiGeorge syndrome-associated noncoding RNA, DGCR5, is repressed by REST through a proximal upstream binding site. Therefore neural macroRNAs represent an additional component of the REST regulatory network. These macroRNAs are new candidates for understanding the role of REST in neuronal development, neurodegeneration, and cancer.
Identification and allelic dissection uncover roles of lncRNAs in secondary growth of Populus tomentosa.

PubMed

Zhou, Daling; Du, Qingzhang; Chen, Jinhui; Wang, Qingshi; Zhang, Deqiang

2017-10-01

Long non-coding RNAs (lncRNAs) function in various biological processes. However, their roles in secondary growth of plants remain poorly understood. Here, 15,691 lncRNAs were identified from vascular cambium, developing xylem, and mature xylem of Populus tomentosa with high and low biomass using RNA-seq, including 1,994 lncRNAs that were differentially expressed (DE) among the six libraries. 3,569 cis-regulated and 3,297 trans-regulated protein-coding genes were predicted as potential target genes (PTGs) of the DE lncRNAs to participate in biological regulation. Then, 476 and 28 lncRNAs were identified as putative targets and endogenous target mimics (eTMs) of Populus known microRNAs (miRNAs), respectively. Genome re-sequencing of 435 individuals from a natural population of P. tomentosa found 34,015 single nucleotide polymorphisms (SNPs) within 178 lncRNA loci and 522 PTGs. Single-SNP associations analysis detected 2,993 associations with 10 growth and wood-property traits under additive and dominance model. Epistasis analysis identified 17,656 epistatic SNP pairs, providing evidence for potential regulatory interactions between lncRNAs and their PTGs. Furthermore, a reconstructed epistatic network, representing interactions of 8 lncRNAs and 15 PTGs, might enrich regulation roles of genes in the phenylpropanoid pathway. These findings may enhance our understanding of non-coding genes in plants. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues.

PubMed Central

Prody, C A; Zevin-Sonkin, D; Gnatt, A; Goldberg, O; Soreq, H

1987-01-01

To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase (BtChoEase; EC 3.1.1.8) and Torpedo electric organ "true" acetylcholinesterase (AcChoEase; EC 3.1.1.7). Using these probes, we isolated several cDNA clones from lambda gt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. In RNA blots of poly(A)+ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These findings demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species. Images PMID:3035536
Proteins involved in neuronal differentiation of neuroblastoma cell line N1E-115.

PubMed

Oh, Ji-Eun; Freilinger, Angelika; Gelpi, Ellen; Pollak, Arnold; Hengstschläger, Markus; Lubec, Gert

2007-06-01

Neuronal differentiation (ND) represents a well-defined phenomenon in biological terms but proteins involved have not been studied systematically. We therefore aimed to study ND by retinoic acid (RA) in a widely used neuroblastoma cell line by comparative proteomics. The ND was induced in the N1E-115 cell line by serum deprivation and RA treatment. Undifferentiated cells and cells undergoing serum deprivation served as controls. Protein extracts were run on 2-DE followed by MALDI-TOF or MALDI-TOF-TOF analysis. Quantification was carried out using specific software and stringent statistical analysis was performed. Tubulin beta 5, cat eye syndrome critical region protein 5 homolog, putative GTP-binding protein PTD004 homolog, and the metabolic proteins glyceraldehyde-3-phosphate dehydrogenase and transketolase were differentially regulated. Differential protein levels of cytoskeleton proteins including tubulins and metabolic proteins have been reported to be regulated by ND. Herein, specific signaling differences as reflected by putative GTP-binding protein PTD004 changes in differentiated cells are shown and a possible role for the Cat eye syndrome critical region protein 5 homolog is proposed. The protein disulfide isomerase associated 3 protein fits the already proposed findings of chaperon regulation by ND. The study forms the molecular basis for further evaluation of the functional roles of the differentially expressed proteins in ND.
Binding of hnRNP H and U2AF65 to Respective G-codes and a Poly-Uridine Tract Collaborate in the N50-5'ss Selection of the REST N Exon in H69 Cells

PubMed Central

Ortuño-Pineda, Carlos; Galindo-Rosales, José Manuel; Calderón-Salinas, José Victor; Villegas-Sepúlveda, Nicolás; Saucedo-Cárdenas, Odila; De Nova-Ocampo, Mónica; Valdés, Jesús

2012-01-01

The splicing of the N exon in the pre-mRNA coding for the RE1-silencing transcription factor (REST) results in a truncated protein that modifies the expression pattern of some of its target genes. A weak 3'ss, three alternative 5'ss (N4-, N50-, and N62-5'ss) and a variety of putative target sites for splicing regulatory proteins are found around the N exon; two GGGG codes (G2-G3) and a poly-Uridine tract (N-PU) are found in front of the N50-5'ss. In this work we analyzed some of the regulatory factors and elements involved in the preferred selection of the N50-5'ss (N50 activation) in the small cell lung cancer cell line H69. Wild type and mutant N exon/β-globin minigenes recapitulated N50 exon splicing in H69 cells, and showed that the N-PU and the G2-G3 elements are required for N50 exon splicing. Biochemical and knockdown experiments identified these elements as U2AF65 and hnRNP H targets, respectively, and that they are also required for N50 exon activation. Compared to normal MRC5 cells, and in keeping with N50 exon activation, U2AF65, hnRNP H and other splicing factors were highly expressed in H69 cells. CLIP experiments revealed that hnRNP H RNA-binding occurs first and is a prerequisite for U2AF65 RNA binding, and EMSA and CLIP experiments suggest that U2AF65-RNA recognition displaces hnRNP H and helps to recruit other splicing factors (at least U1 70K) to the N50-5'ss. Our results evidenced novel hnRNP H and U2AF65 functions: respectively, U2AF65-recruiting to a 5'ss in humans and the hnRNP H-displacing function from two juxtaposed GGGG codes. PMID:22792276
Highly tissue specific expression of Sphinx supports its male courtship related role in Drosophila melanogaster.

PubMed

Chen, Ying; Dai, Hongzheng; Chen, Sidi; Zhang, Luoying; Long, Manyuan

2011-04-26

Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5' flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes.
Highly Tissue Specific Expression of Sphinx Supports Its Male Courtship Related Role in Drosophila melanogaster

PubMed Central

Chen, Sidi; Zhang, Luoying; Long, Manyuan

2011-01-01

Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5′ flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes. PMID:21541324
Characterization of an Invertebrate-Type Dopamine Receptor of the American Cockroach, Periplaneta americana

PubMed Central

Troppmann, Britta; Balfanz, Sabine; Krach, Christian; Baumann, Arnd; Blenau, Wolfgang

2014-01-01

We have isolated a cDNA coding for a putative invertebrate-type dopamine receptor (Peadop2) from P. americana brain by using a PCR-based strategy. The mRNA is present in samples from brain and salivary glands. We analyzed the distribution of the PeaDOP2 receptor protein with specific affinity-purified polyclonal antibodies. On Western blots, PeaDOP2 was detected in protein samples from brain, subesophageal ganglion, thoracic ganglia, and salivary glands. In immunocytochemical experiments, we detected PeaDOP2 in neurons with their somata being located at the anterior edge of the medulla bilaterally innervating the optic lobes and projecting to the ventro-lateral protocerebrum. In order to determine the functional and pharmacological properties of the cloned receptor, we generated a cell line constitutively expressing PeaDOP2. Activation of PeaDOP2-expressing cells with dopamine induced an increase in intracellular cAMP. In contrast, a C-terminally truncated splice variant of this receptor did not exhibit any functional property by itself. The molecular and pharmacological characterization of the first dopamine receptor from P. americana provides the basis for forthcoming studies focusing on the significance of the dopaminergic system in cockroach behavior and physiology. PMID:24398985
Characterization of an invertebrate-type dopamine receptor of the American cockroach, Periplaneta americana.

PubMed

Troppmann, Britta; Balfanz, Sabine; Krach, Christian; Baumann, Arnd; Blenau, Wolfgang

2014-01-06

We have isolated a cDNA coding for a putative invertebrate-type dopamine receptor (Peadop2) from P. americana brain by using a PCR-based strategy. The mRNA is present in samples from brain and salivary glands. We analyzed the distribution of the PeaDOP2 receptor protein with specific affinity-purified polyclonal antibodies. On Western blots, PeaDOP2 was detected in protein samples from brain, subesophageal ganglion, thoracic ganglia, and salivary glands. In immunocytochemical experiments, we detected PeaDOP2 in neurons with their somata being located at the anterior edge of the medulla bilaterally innervating the optic lobes and projecting to the ventro-lateral protocerebrum. In order to determine the functional and pharmacological properties of the cloned receptor, we generated a cell line constitutively expressing PeaDOP2. Activation of PeaDOP2-expressing cells with dopamine induced an increase in intracellular cAMP. In contrast, a C-terminally truncated splice variant of this receptor did not exhibit any functional property by itself. The molecular and pharmacological characterization of the first dopamine receptor from P. americana provides the basis for forthcoming studies focusing on the significance of the dopaminergic system in cockroach behavior and physiology.

The genome sequence of the plant pathogen Xylella fastidiosa. The Xylella fastidiosa Consortium of the Organization for Nucleotide Sequencing and Analysis.

PubMed

Simpson, A J; Reinach, F C; Arruda, P; Abreu, F A; Acencio, M; Alvarenga, R; Alves, L M; Araya, J E; Baia, G S; Baptista, C S; Barros, M H; Bonaccorsi, E D; Bordin, S; Bové, J M; Briones, M R; Bueno, M R; Camargo, A A; Camargo, L E; Carraro, D M; Carrer, H; Colauto, N B; Colombo, C; Costa, F F; Costa, M C; Costa-Neto, C M; Coutinho, L L; Cristofani, M; Dias-Neto, E; Docena, C; El-Dorry, H; Facincani, A P; Ferreira, A J; Ferreira, V C; Ferro, J A; Fraga, J S; França, S C; Franco, M C; Frohme, M; Furlan, L R; Garnier, M; Goldman, G H; Goldman, M H; Gomes, S L; Gruber, A; Ho, P L; Hoheisel, J D; Junqueira, M L; Kemper, E L; Kitajima, J P; Krieger, J E; Kuramae, E E; Laigret, F; Lambais, M R; Leite, L C; Lemos, E G; Lemos, M V; Lopes, S A; Lopes, C R; Machado, J A; Machado, M A; Madeira, A M; Madeira, H M; Marino, C L; Marques, M V; Martins, E A; Martins, E M; Matsukuma, A Y; Menck, C F; Miracca, E C; Miyaki, C Y; Monteriro-Vitorello, C B; Moon, D H; Nagai, M A; Nascimento, A L; Netto, L E; Nhani, A; Nobrega, F G; Nunes, L R; Oliveira, M A; de Oliveira, M C; de Oliveira, R C; Palmieri, D A; Paris, A; Peixoto, B R; Pereira, G A; Pereira, H A; Pesquero, J B; Quaggio, R B; Roberto, P G; Rodrigues, V; de M Rosa, A J; de Rosa, V E; de Sá, R G; Santelli, R V; Sawasaki, H E; da Silva, A C; da Silva, A M; da Silva, F R; da Silva, W A; da Silveira, J F; Silvestri, M L; Siqueira, W J; de Souza, A A; de Souza, A P; Terenzi, M F; Truffi, D; Tsai, S M; Tsuhako, M H; Vallada, H; Van Sluys, M A; Verjovski-Almeida, S; Vettore, A L; Zago, M A; Zatz, M; Meidanis, J; Setubal, J C

2000-07-13

Xylella fastidiosa is a fastidious, xylem-limited bacterium that causes a range of economically important plant diseases. Here we report the complete genome sequence of X. fastidiosa clone 9a5c, which causes citrus variegated chlorosis--a serious disease of orange trees. The genome comprises a 52.7% GC-rich 2,679,305-base-pair (bp) circular chromosome and two plasmids of 51,158 bp and 1,285 bp. We can assign putative functions to 47% of the 2,904 predicted coding regions. Efficient metabolic functions are predicted, with sugars as the principal energy and carbon source, supporting existence in the nutrient-poor xylem sap. The mechanisms associated with pathogenicity and virulence involve toxins, antibiotics and ion sequestration systems, as well as bacterium-bacterium and bacterium-host interactions mediated by a range of proteins. Orthologues of some of these proteins have only been identified in animal and human pathogens; their presence in X. fastidiosa indicates that the molecular basis for bacterial pathogenicity is both conserved and independent of host. At least 83 genes are bacteriophage-derived and include virulence-associated genes from other bacteria, providing direct evidence of phage-mediated horizontal gene transfer.
Genome Sequence and Analysis of the Soil Cellulolytic ActinomyceteThermobifida fusca

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lykidis, Athanasios; Mavromatis, Konstantinos; Ivanova, Natalia

Thermobifida fusca is a moderately thermophilic soilbacterium that belongs to Actinobacteria. 3 It is a major degrader ofplant cell walls and has been used as a model organism for the study of 4secreted, thermostable cellulases. The complete genome sequence showedthat T. fusca has a 5 single circular chromosome of 3642249 bp predictedto encode 3117 proteins and 65 RNA6 species with a coding densityof 85percent. Genome analysis revealed the existence of 29 putative 7glycoside hydrolases in addition to the previously identified cellulasesand xylanases. The 8 glycosyl hydrolases include enzymes predicted toexhibit mainly dextran/starch and xylan 9 degrading functions. T. fuscapossesses twomore » protein secretion systems: the sec general secretion 10system and the twin-arginine translocation system. Several of thesecreted cellulases have 11 sequence signatures indicating theirsecretion may be mediated by the twin-arginine12 translocation system. T.fusca has extensive transport systems for import of carbohydrates 13coupled to transcriptional regulators controlling the expression of thetransporters and14 glycosylhydrolases. In addition to providing anoverview of the physiology of a soil 15 actinomycete, this study presentsinsights on the transcriptional regulation and secretion of16 cellulaseswhich may facilitate the industrial exploitation of thesesystems.« less
Complete genome sequence analysis of the fish pathogen Flavobacterium columnare provides insights into antibiotic resistance and pathogenicity related genes.

PubMed

Zhang, Yulei; Zhao, Lijuan; Chen, Wenjie; Huang, Yunmao; Yang, Ling; Sarathbabu, V; Wu, Zaohe; Li, Jun; Nie, Pin; Lin, Li

2017-10-01

We analyzed here the complete genome sequences of a highly virulent Flavobacterium columnare Pf1 strain isolated in our laboratory. The complete genome consists of a 3,171,081 bp circular DNA with 2784 predicted protein-coding genes. Among these, 286 genes were predicted as antibiotic resistance genes, including 32 RND-type efflux pump related genes which were associated with the export of aminoglycosides, indicating inducible aminoglycosides resistances in F. columnare. On the other hand, 328 genes were predicted as pathogenicity related genes which could be classified as virulence factors, gliding motility proteins, adhesins, and many putative secreted proteases. These genes were probably involved in the colonization, invasion and destruction of fish tissues during the infection of F. columnare. Apparently, our obtained complete genome sequences provide the basis for the explanation of the interactions between the F. columnare and the infected fish. The predicted antibiotic resistance and pathogenicity related genes will shed a new light on the development of more efficient preventional strategies against the infection of F. columnare, which is a major worldwide fish pathogen. Copyright © 2017 Elsevier Ltd. All rights reserved.
Vanilla mosaic virus isolates from French Polynesia and the Cook Islands are Dasheen mosaic virus strains that exclusively infect vanilla.

PubMed

Farreyrol, K; Pearson, M N; Grisoni, M; Cohen, D; Beck, D

2006-05-01

Sequence was determined for the coat protein (CP) gene and 3' non-translated region (3'NTR) of two vanilla mosaic virus (VanMV) isolates from Vanilla tahitensis, respectively from the Cook Islands (VanMV-CI) and French Polynesia (VanMV-FP). Both viruses displayed distinctive features in the N-terminal region of their CPs; for VanMV-CI, a 16-amino-acid deletion including the aphid transmission-related DAG motif, and for VanMV-FP, a stretch of GTN repeats that putatively belongs to the class of natively unfolded proteins. VanMV-FP CP also has a novel DVG motif in place of the DAG motif, and an uncommon Q//V protease cleavage site. The sequences were compared to a range of Dasheen mosaic virus (DsMV) strains and to potyviruses infecting orchids. Identity was low to DsMV strains across the entire CP coding region and across the 3'NTR, but high across the CP core and the CI-6K2-NIa region. In accordance with current ICTV criteria for species demarcation within the family Potyviridae, VanMV-CI and VanMV-FP are strains of DsMV that exclusively infect vanilla.
Citrus psorosis virus RNA 1 is of negative polarity and potentially encodes in its complementary strand a 24K protein of unknown function and 280K putative RNA dependent RNA polymerase.

PubMed

Naum-Onganía, Gabriela; Gago-Zachert, Selma; Peña, Eduardo; Grau, Oscar; Garcia, Maria Laura

2003-10-01

Citrus psorosis virus (CPsV), the type member of genus Ophiovirus, has three genomic RNAs. Complete sequencing of CPsV RNA 1 revealed a size of 8184 nucleotides and Northern blot hybridization with chain specific probes showed that its non-coding strand is preferentially encapsidated. The complementary strand of RNA 1 contains two open reading frames (ORFs) separated by a 109-nt intergenic region, one located near the 5'-end potentially encoding a 24K protein of unknown function, and another of 280K containing the core polymerase motifs characteristic of viral RNA-dependent RNA polymerases (RdRp). Comparison of the core RdRp motifs of negative-stranded RNA viruses, supports grouping CPsV, Ranunculus white mottle virus (RWMV) and Mirafiori lettuce virus (MiLV) within the same genus (Ophiovirus), constituting a monophyletic group separated from all other negative-stranded RNA viruses. Furthermore, RNAs 1 of MiLV, CPsV and RWMV are similar in size and those of MiLV and CPsV also in genomic organization and sequence.
Intact Protein Analysis at 21 Tesla and X-Ray Crystallography Define Structural Differences in Single Amino Acid Variants of Human Mitochondrial Branched-Chain Amino Acid Aminotransferase 2 (BCAT2)

NASA Astrophysics Data System (ADS)

Anderson, Lissa C.; Håkansson, Maria; Walse, Björn; Nilsson, Carol L.

2017-09-01

Structural technologies are an essential component in the design of precision therapeutics. Precision medicine entails the development of therapeutics directed toward a designated target protein, with the goal to deliver the right drug to the right patient at the right time. In the field of oncology, protein structural variants are often associated with oncogenic potential. In a previous proteogenomic screen of patient-derived glioblastoma (GBM) tumor materials, we identified a sequence variant of human mitochondrial branched-chain amino acid aminotransferase 2 as a putative factor of resistance of GBM to standard-of-care-treatments. The enzyme generates glutamate, which is neurotoxic. To elucidate structural coordinates that may confer altered substrate binding or activity of the variant BCAT2 T186R, a 45 kDa protein, we applied combined ETD and CID top-down mass spectrometry in a LC-FT-ICR MS at 21 T, and X-Ray crystallography in the study of both the variant and non-variant intact proteins. The combined ETD/CID fragmentation pattern allowed for not only extensive sequence coverage but also confident localization of the amino acid variant to its position in the sequence. The crystallographic experiments confirmed the hypothesis generated by in silico structural homology modeling, that the Lys59 side-chain of BCAT2 may repulse the Arg186 in the variant protein (PDB code: 5MPR), leading to destabilization of the protein dimer and altered enzyme kinetics. Taken together, the MS and novel 3D structural data give us reason to further pursue BCAT2 T186R as a precision drug target in GBM. [Figure not available: see fulltext.
Discovery of Salmonella Virulence Factors Translocated via Outer Membrane Vesicles to Murine Macrophages.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yoon, Hyunjin; Ansong, Charles; Adkins, Joshua N.

We have previously shown that the regulators SpvR, FruR, IHF, PhoP/PhoQ, SsrA/SsrB, SlyA, Hnr, RpoE, SmpB, CsrA, RpoS, Crp, OmpR/EnvZ, and Hfq are essential for Salmonella Typhimurium virulence in mice. Here we use quantitative LC-MS-based proteomics profiling of in-frame deletion mutants of these 14 regulators to identify proteins that are coordinately regulated by these virulence regulators and are thus presumably novel factors contributing to Salmonella pathogenesis. Putative candidate proteins from proteomics analysis were determined, which exhibited similar abundance profiles to those of Salmonella pathogenicity island (SPI)-2 type III secretion system (TTSS) proteins. A subset of 5 proteins including STM0082, STM1548,more » PdgL, STM1633, and STM3595 was selected for further analysis. All 5 proteins were expressed inside macrophage cells and STM0082 (SrfN) was secreted into host cytoplasm. Furthermore, deletion of STM0082 attenuated virulence in mice when administered intraperitoneally as determined by competitive index. srfN transcription was positively regulated by SsrAB, however, secretion was independent of SPI-2 TTSS as well as SPI-1 TTSS and flagella. Proteins including PagK and STM2585A, which are positively regulated by PhoP/PhoQ, have sec signal peptides as predicted for SrfN and were secreted into macrophage cytoplasm regardless of SPI-2 TTSS. Isolation of outer membrane vesicles (OMVs) revealed the presence of SrfN, PagK, and STM2585A inside vesicle compartments. This result is the first case showing delivery of virulence effectors via OMVs in S. Typhimurium. Moreover, Hfq regulation of SrfN translation suggests that small non-coding RNAs may be responsible for regulating effector protein expression.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)

Abbasifar, Reza; Griffiths, Mansel W.; Sabour, Parviz M.

Cronobacter sakazakii is a Gram-negative pathogen found in milk-based formulae that causes infant meningitis. Bacteriophages have been proposed to control bacterial pathogens; however, comprehensive knowledge about a phage is required to ensure its safety before clinical application. We have characterized C. sakazakii phage vB{sub C}saM{sub G}AP32 (GAP32), which possesses the second largest sequenced phage genome (358,663 bp). A total of 571 genes including 545 protein coding sequences and 26 tRNAs were identified, thus more genes than in the smallest bacterium, Mycoplasma genitalium G37. BLASTP and HHpred searches, together with proteomic analyses reveal that only 23.9% of the putative proteins havemore » defined functions. Some of the unique features of this phage include: a chromosome condensation protein, two copies of the large subunit terminase, a predicted signal-arrest-release lysin; and an RpoD-like protein, which is possibly involved in the switch from immediate early to delayed early transcription. Its closest relatives are all extremely large myoviruses, namely coliphage PBECO4 and Klebsiella phage vB{sub K}leM-RaK2, with whom it shares approximately 44% homologous proteins. Since the homologs are not evenly distributed, we propose that these three phages belong to a new subfamily. - Highlights: • Cronobacter sakazakii phage vB{sub C}saM{sub G}AP32 has a genome of 358,663 bp. • It encodes 545 proteins which is more than Mycoplasma genitalium G37. • It is a member of the Myoviridae. • It is peripherally related to coliphage PBECO4 and Klebsiella phage vB{sub K}leM-RaK2. • GAP32 encodes a chromosome condensation protein.« less
Systematic analysis of human kinase genes: a large number of genes and alternative splicing events result in functional and structural diversity

PubMed Central

Milanesi, Luciano; Petrillo, Mauro; Sepe, Leandra; Boccia, Angelo; D'Agostino, Nunzio; Passamano, Myriam; Di Nardo, Salvatore; Tasco, Gianluca; Casadio, Rita; Paolella, Giovanni

2005-01-01

Background Protein kinases are a well defined family of proteins, characterized by the presence of a common kinase catalytic domain and playing a significant role in many important cellular processes, such as proliferation, maintenance of cell shape, apoptosys. In many members of the family, additional non-kinase domains contribute further specialization, resulting in subcellular localization, protein binding and regulation of activity, among others. About 500 genes encode members of the kinase family in the human genome, and although many of them represent well known genes, a larger number of genes code for proteins of more recent identification, or for unknown proteins identified as kinase only after computational studies. Results A systematic in silico study performed on the human genome, led to the identification of 5 genes, on chromosome 1, 11, 13, 15 and 16 respectively, and 1 pseudogene on chromosome X; some of these genes are reported as kinases from NCBI but are absent in other databases, such as KinBase. Comparative analysis of 483 gene regions and subsequent computational analysis, aimed at identifying unannotated exons, indicates that a large number of kinase may code for alternately spliced forms or be incorrectly annotated. An InterProScan automated analysis was perfomed to study domain distribution and combination in the various families. At the same time, other structural features were also added to the annotation process, including the putative presence of transmembrane alpha helices, and the cystein propensity to participate into a disulfide bridge. Conclusion The predicted human kinome was extended by identifiying both additional genes and potential splice variants, resulting in a varied panorama where functionality may be searched at the gene and protein level. Structural analysis of kinase proteins domains as defined in multiple sources together with transmembrane alpha helices and signal peptide prediction provides hints to function assignment. The results of the human kinome analysis are collected in the KinWeb database, available for browsing and searching over the internet, where all results from the comparative analysis and the gene structure annotation are made available, alongside the domain information. Kinases may be searched by domain combinations and the relative genes may be viewed in a graphic browser at various level of magnification up to gene organization on the full chromosome set. PMID:16351747
Computer Simulation of the Virulome of Bacillus anthracis Using Proteomics

DTIC Science & Technology

2006-07-31

hypothetical protein gi|47526566 spermidine /putrescine ABC transporter, spermidine /putrescine-binding protein gi|47526625 oligoendopeptidase F, putative gi...glutamyl-trna(gln) amidotransferase, a subunit x gi|50196927 aspartate aminotransferase x gi|50196970 spermidine synthase x
Protein preparation and preliminary X-ray crystallographic analysis of a putative glucosamine 6-phosphate deaminase from Streptococcus mutants

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hu, Guan-Jing; Li, Lan-Fen; Li, Dan

2007-09-01

A glucosamine 6-phosphate deaminase homologue from S. mutans was expressed, purified and crystallized. Diffraction data have been collected to 2.4 Å resolution. The SMU.636 protein from Streptococcus mutans is a putative glucosamine 6-phosphate deaminase with 233 residues. The smu.636 gene was PCR-amplified from S. mutans genomic DNA and cloned into the expression vector pET-28a(+). The resultant His-tagged fusion protein was expressed in Escherichia coli and purified to homogeneity in two steps. Crystals of the fusion protein were obtained by the hanging-drop vapour-diffusion method. The crystals diffracted to 2.4 Å resolution and belong to space group P2{sub 1}2{sub 1}2{sub 1}, withmore » unit-cell parameters a = 53.83, b = 82.13, c = 134.70 Å.« less
Identification of putative Z-ring-associated proteins, involved in cell division in human pathogenic bacteria Helicobacter pylori.

PubMed

Kamran, Mohammad; Sinha, Swati; Dubey, Priyanka; Lynn, Andrew M; Dhar, Suman K

2016-07-01

Cell division in bacteria is initiated by FtsZ, which forms a Z ring at the middle of the cell, between the nucleoids. The Z ring is stabilized by Z ring-associated proteins (Zaps), which crosslink the FtsZ filaments and provide strength. The deletion of Zaps leads to the elongation phenotype with an abnormal Z ring. The components of cell division in Helicobacter pylori are similar to other gram negative bacteria except for the absence of few components including Zaps. Here, we used HHsearch to identify homologs of the missing cell division proteins and got potential hits for ZapA and ZapB, as well as for few other cell division proteins. We further validated the function of the putative ZapA homolog by genetic complementation, immuno-colocalization and biochemical analysis. © 2016 Federation of European Biochemical Societies.
Chromosomal localization of the mouse Src-like adapter protein (Slap) gene and its putative human homolog SLA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Angrist, M.; Chakravarti, A.; Wells, D.E.

1995-12-10

Molecules containing Src-homology 2 (SH2) and Src-homology 3 (SH3) domains are critical components of signal transduction pathways that serve to relay signals originating from the cell surface to the interior of the cell. Src-like adapter protein (SLAP) is a recently described adapter protein that binds activated the Eck receptor protein-tyrosine kinase. Although SLAP bears a striking homology to the SH3 and SH2 domains of the Src family of nonreceptor tyrosine kinases, it does not contain a tyrosine kinase catalytic domain. In this report, the Slap gene was mapped by linkage analysis to mouse chromosome 15, while its putative human homologmore » (SLA) was identified and mapped to human 8q22.3-qter using a panel of somatic cell hybrids. 10 refs., 2 figs.« less
Unusual varieties and duplication of Rig-I like receptors encoded in the marine mollusk, Crassostrea gigas

NASA Astrophysics Data System (ADS)

Tian, Z. H.; Jiao, C. Z.

2017-07-01

RIG-I like receptors (RLRs) play key roles in sensing non-self nucleic acids in cytoplasm and trigger antiviral innate immune response in vertebrates and human body. Here we carried out in silico analysis to identify and investigate the putative RLRs encoded in the genome of marine mollusk, Crassostrea gigas (cgRLRs), an invertebrate species. We found the unusual duplication and varieties on domain architecture of putative cgRLRs encoded in the genome of C. gigas. Three putative cgRLRs (accessions numbers are EKC24603, EKC31344.1 and EKC38304.1 on GenBank), have the similar domain architecture with that of human RIG-I or MDA5, and one protein (EKC34573.1) with that of human LGP2; The fifth putative cgRLRs (EKC38303.1) is somewhat similar with human RIG-I/MDA5 except that it has only one caspase activation and recruitment domain (CARD) in its N-terminal. Other nine proteins were identified to be partialy similar with RLRs while with the incomplete sequences, which maybe reflect the events of partial duplication of cgRLRs genes occurred in the oyster genome.
Understanding the molecular basis of plant growth promotional effect of Pseudomonas fluorescens on rice through protein profiling

PubMed Central

2009-01-01

Background Plant Growth Promoting Rhizobacteria (PGPR), Pseudomonas fluorescens strain KH-1 was found to exhibit plant growth promotional activity in rice under both in-vitro and in-vivo conditions. But the mechanism underlying such promotional activity of P. fluorescens is not yet understood clearly. In this study, efforts were made to elucidate the molecular responses of rice plants to P. fluorescens treatment through protein profiling. Two-dimensional polyacrylamide gel electrophoresis strategy was adopted to identify the PGPR responsive proteins and the differentially expressed proteins were analyzed by mass spectrometry. Results Priming of P. fluorescens, 23 different proteins found to be differentially expressed in rice leaf sheaths and MS analysis revealed the differential expression of some important proteins namely putative p23 co-chaperone, Thioredoxin h- rice, Ribulose-bisphosphate carboxylase large chain precursor, Nucleotide diPhosphate kinase, Proteosome sub unit protein and putative glutathione S-transferase protein. Conclusion Functional analyses of the differential proteins were reported to be directly or indirectly involved in growth promotion in plants. Thus, this study confirms the primary role of PGPR strain KH-1 in rice plant growth promotion. PMID:20034395
Analysis of salivary transcripts and antigens of the sand fly Phlebotomus arabicus

PubMed Central

Hostomská, Jitka; Volfová, Věra; Mu, Jianbing; Garfield, Mark; Rohoušová, Iva; Volf, Petr; Valenzuela, Jesus G; Jochim, Ryan C

2009-01-01

Background Sand fly saliva plays an important role in blood feeding and Leishmania transmission as it was shown to increase parasite virulence. On the other hand, immunity to salivary components impedes the establishment of infection. Therefore, it is most desirable to gain a deeper insight into the composition of saliva in sand fly species which serve as vectors of various forms of leishmaniases. In the present work, we focused on Phlebotomus (Adlerius) arabicus, which was recently shown to transmit Leishmania tropica, the causative agent of cutaneous leishmaniasis in Israel. Results A cDNA library from salivary glands of P. arabicus females was constructed and transcripts were sequenced and analyzed. The most abundant protein families identified were SP15-like proteins, ParSP25-like proteins, D7-related proteins, yellow-related proteins, PpSP32-like proteins, antigen 5-related proteins, and 34 kDa-like proteins. Sequences coding for apyrases, hyaluronidase and other putative secreted enzymes were also represented, including endonuclease, phospholipase, pyrophosphatase, amylase and trehalase. Mass spectrometry analysis confirmed the presence of 20 proteins predicted to be secreted in the salivary proteome. Humoral response of mice bitten by P. arabicus to salivary antigens was assessed and many salivary proteins were determined to be antigenic. Conclusion This transcriptomic analysis of P. arabicus salivary glands is the first description of salivary proteins of a sand fly in the subgenus Adlerius. Proteomic analysis of P. arabicus salivary glands produced the most comprehensive account in a single sand fly species to date. Detailed information and phylogenetic relationships of the salivary proteins are provided, expanding the knowledge base of molecules that are likely important factors of sand fly-host and sand fly-Leishmania interactions. Enzymatic and immunological investigations further demonstrate the value of functional transcriptomics in advancing biological and epidemiological research that can impact leishmaniasis. PMID:19555500
Evidence for an ergot alkaloid gene cluster in Claviceps purpurea.

PubMed

Tudzynski, P; Hölter, K; Correia, T; Arntz, C; Grammel, N; Keller, U

1999-02-01

A gene (cpd1) coding for the dimethylallyltryptophan synthase (DMATS) that catalyzes the first specific step in the biosynthesis of ergot alkaloids, was cloned from a strain of Claviceps purpurea that produces alkaloids in axenic culture. The derived gene product (CPD1) shows only 70% similarity to the corresponding gene previously isolated from Claviceps strain ATCC 26245, which is likely to be an isolate of C. fusiformis. Therefore, the related cpd1 most probably represents the first C. purpurea gene coding for an enzymatic step of the alkaloid biosynthetic pathway to be cloned. Analysis of the 3'-flanking region of cpd1 revealed a second, closely linked ergot alkaloid biosynthetic gene named cpps1, which codes for a 356-kDa polypeptide showing significant similarity to fungal modular peptide synthetases. The protein contains three amino acid-activating modules, and in the second module a sequence is found which matches that of an internal peptide (17 amino acids in length) obtained from a tryptic digest of lysergyl peptide synthetase 1 (LPS1) of C. purpurea, thus confirming that cpps1 encodes LPS1. LPS1 activates the three amino acids of the peptide portion of ergot peptide alkaloids during D-lysergyl peptide assembly. Chromosome walking revealed the presence of additional genes upstream of cpd1 which are probably also involved in ergot alkaloid biosynthesis: cpox1 probably codes for an FAD-dependent oxidoreductase (which could represent the chanoclavine cyclase), and a second putative oxidoreductase gene, cpox2, is closely linked to it in inverse orientation. RT-PCR experiments confirm that all four genes are expressed under conditions of peptide alkaloid biosynthesis. These results strongly suggest that at least some genes of ergot alkaloid biosynthesis in C. purpurea are clustered, opening the way for a detailed molecular genetic analysis of the pathway.
Identification of a putative protein profile associating with tamoxifen therapy resistance in breast cancer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Umar, Arzu; Kang, Hyuk; Timmermans, A. M.

2009-06-01

Tamoxifen-resistance is a major cause of death in patients with recurrent breast cancer. Current clinical factors can correctly predict therapy response in only half of the treated patients. Identification of proteins that associate with tamoxifen-resistance is a first step towards better response prediction and tailored treatment of patients. In the present study we intended to identify putative protein biomarkers indicative of tamoxifen therapy-resistance in breast cancer, using nanoLC coupled with FTICR MS. Comparative proteome analysis was performed on ~5,500 pooled tumor cells (corresponding to ~550 ng protein lysate/analysis) obtained through laser capture microdissection (LCM) from two independently processed data setsmore » (n=24 and n=27) containing both tamoxifen therapy-sensitive and therapy-resistant tumors. Peptides and proteins were identified by matching mass and elution time of newly acquired LC-MS features to information in previously generated accurate mass and time tag (AMT) reference databases.« less
Replicase activity of purified recombinant protein P2 of double-stranded RNA bacteriophage phi6.

PubMed

Makeyev, E V; Bamford, D H

2000-01-04

In nature, synthesis of both minus- and plus-sense RNA strands of all the known double-stranded RNA viruses occurs in the interior of a large protein assembly referred to as the polymerase complex. In addition to other proteins, the complex contains a putative polymerase possessing characteristic sequence motifs. However, none of the previous studies has shown template-dependent RNA synthesis directly with an isolated putative polymerase protein. In this report, recombinant protein P2 of double-stranded RNA bacteriophage phi6 was purified and demonstrated in an in vitro enzymatic assay to act as the replicase. The enzyme efficiently utilizes phage-specific, positive-sense RNA substrates to produce double-stranded RNA molecules, which are formed by newly synthesized, full-length minus-strands base paired with the plus-strand templates. P2-catalyzed replication is also shown to be very effective with a broad range of heterologous single-stranded RNA templates. The importance and implications of these results are discussed.
Membrane topology of Golgi-localized probable S-adenosylmethionine-dependent methyltransferase in tobacco (Nicotiana tabacum) BY-2 cells.

PubMed

Liu, Jianping; Hayashi, Kyoko; Matsuoka, Ken

2015-01-01

S-adenosylmethionine (SAM)-dependent methyltransferases (MTases) transfer methyl groups to substrates. In this study, a novel putative tobacco SAM-MTase termed Golgi-localized methyl transferase 1 (GLMT1) has been characterized. GLMT1 is comprised of 611 amino acids with short N-terminal region, putative transmembrane region, and C-terminal SAM-MTase domain. Expression of monomeric red fluorescence protein (mRFP)-tagged protein in tobacco BY-2 cell indicated that GLMT1 is a Golgi-localized protein. Analysis of the membrane topology by protease digestion suggested that both C-terminal catalytic region and N-terminal region seem to be located to the cytosolic side of the Golgi apparatus. Therefore, GLMT1 might have a different function than the previously studied SAM-MTases in plants.

Position-dependent and neuron-specific splicing regulation by the CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans

PubMed Central

Kuroyanagi, Hidehito; Watanabe, Yohei; Suzuki, Yutaka; Hagiwara, Masatoshi

2013-01-01

A large fraction of protein-coding genes in metazoans undergo alternative pre-mRNA splicing in tissue- or cell-type-specific manners. Recent genome-wide approaches have identified many putative-binding sites for some of tissue-specific trans-acting splicing regulators. However, the mechanisms of splicing regulation in vivo remain largely unknown. To elucidate the modes of splicing regulation by the neuron-specific CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans, we performed deep sequencing of poly(A)+ RNAs from the unc-75(+)- and unc-75-mutant worms and identified more than 20 cassette and mutually exclusive exons repressed or activated by UNC-75. Motif searches revealed that (G/U)UGUUGUG stretches are enriched in the upstream and downstream introns of the UNC-75-repressed and -activated exons, respectively. Recombinant UNC-75 protein specifically binds to RNA fragments carrying the (G/U)UGUUGUG stretches in vitro. Bi-chromatic fluorescence alternative splicing reporters revealed that the UNC-75-target exons are regulated in tissue-specific and (G/U)UGUUGUG element-dependent manners in vivo. The unc-75 mutation affected the splicing reporter expression specifically in the nervous system. These results indicate that UNC-75 regulates alternative splicing of its target exons in neuron-specific and position-dependent manners through the (G/U)UGUUGUG elements in C. elegans. This study thus reveals the repertoire of target events for the CELF family in the living organism. PMID:23416545
Can the 'neuron theory' be complemented by a universal mechanism for generic neuronal differentiation.

PubMed

Ernsberger, Uwe

2015-01-01

With the establishment of the 'neuron theory' at the turn of the twentieth century, this remarkably powerful term was introduced to name a breathtaking diversity of cells unified by a characteristic structural compartmentalization and unique information processing and propagating features. At the beginning of the twenty-first century, developmental, stem cell and reprogramming studies converged to suggest a common mechanism involved in the generation of possibly all vertebrate, and at least a significant number of invertebrate, neurons. Sox and, in particular, SoxB and SoxC proteins as well as basic helix-loop-helix proteins play major roles, even though their precise contributions to progenitor programming, proliferation and differentiation are not fully resolved. In addition to neuronal development, these transcription factors also regulate sensory receptor and endocrine cell development, thus specifying a range of cells with regulatory and communicative functions. To what extent microRNAs contribute to the diversification of these cell types is an upcoming question. Understanding the transcriptional and post-transcriptional regulation of genes coding for cell type-specific cytoskeletal and motor proteins as well as synaptic and ion channel proteins, which mark differences but also similarities between the three communicator cell types, will provide a key to the comprehension of their diversification and the signature of 'generic neuronal' differentiation. Apart from the general scientific significance of a putative universal core instruction for neuronal development, the impact of this line of research for cell replacement therapy and brain tumor treatment will be of considerable interest.
[Cloning, expression and transcriptional analysis of biotin carboxyl carrier protein gene (accA) from Amycolatopsis mediterranei U32 ].

PubMed

Lu, Jie; Yao, Yufeng; Jiang, Weihong; Jiao, Ruishen

2003-02-01

Acetyl CoA carboxylase (EC 6.4.1.2, ACC) catalyzes the ATP-dependent carboxylation of acetyl CoA to yield malonyl CoA, which is the first committed step in fatty acid synthesis. A pair of degenerate PCR primers were designed according to the conserved amino acid sequence of AccA from M. tuberculosis and S. coelicolor. The product of the PCR amplification, a DNA fragment of 250bp was used as a probe for screening the U32 genomic cosmid library and its gene, accA, coding the biotinylated protein subunit of acetyl CoA carboxylase, was successfully cloned from U32. The accA ORF encodes a 598-amino-acid protein with the calculated molecular mass of 63.7kD, with 70.1% of G + C content. A typical Streptomyces RBS sequence, AGGAGG, was found at the - 6 position upstream of the start codon GTG. Analysis of the deduced amino acid sequence showed the presence of biotin-binding site and putative ATP-bicarbonate interaction region, which suggested the U32 AccA may act as a biotin carboxylase as well as a biotin carrier protein. Gene accA was then cloned into the pET28 (b) vector and expressed solubly in E. coli BL21 (DE3) by 0.1 mmol/L IPTG induction. Western blot confirmed the covalent binding of biotin with AccA. Northern blot analyzed transcriptional regulation of accA by 5 different nitrogen sources.
Comparative Proteomic Analysis of Mature Pollen in Triploid and Diploid Populus deltoides

PubMed Central

Zhang, Xiao-Ling; Zhang, Jin; Guo, Ying-Hua; Sun, Pei; Jia, Hui-Xia; Fan, Wei; Lu, Meng-Zhu; Hu, Jian-Jun

2016-01-01

Ploidy affects plant growth vigor and cell size, but the relative effects of pollen fertility and allergenicity between triploid and diploid have not been systematically examined. Here we performed comparative analyses of fertility, proteome, and abundances of putative allergenic proteins of pollen in triploid poplar ‘ZhongHuai1’ (‘ZH1’, triploid) and ‘ZhongHuai2’ (‘ZH2’, diploid) generated from the same parents. The mature pollen was sterile in triploid poplar ‘ZH1’. By applying two-dimensional gel electrophoresis (2-DE), a total of 72 differentially expressed protein spots (DEPs) were detected in triploid poplar pollen. Among them, 24 upregulated and 43 downregulated proteins were identified in triploid poplar pollen using matrix-assisted laser desorption/ionisation coupled with time of-flight tandem mass spectrometer analysis (MALDI-TOF/TOF MS/MS). The main functions of these DEPs were related with “S-adenosylmethionine metabolism”, “actin cytoskeleton organization”, or “translational elongation”. The infertility of triploid poplar pollen might be related to its abnormal cytoskeletal system. In addition, the abundances of previously identified 28 putative allergenic proteins were compared among three poplar varieties (‘ZH1’, ‘ZH2’, and ‘2KEN8‘). Most putative allergenic proteins were downregulated in triploid poplar pollen. This work provides an insight into understanding the protein regulation mechanism of pollen infertility and low allergenicity in triploid poplar, and gives a clue to improving poplar polyploidy breeding and decreasing the pollen allergenicity. PMID:27598155
RNA Sequencing-Based Genome Reannotation of the Dermatophyte Arthroderma benhamiae and Characterization of Its Secretome and Whole Gene Expression Profile during Infection

PubMed Central

De Coi, Niccolò; Feuermann, Marc; Schmid-Siegert, Emanuel; Băguţ, Elena-Tatiana; Mignon, Bernard; Waridel, Patrice; Peter, Corinne; Pradervand, Sylvain

2016-01-01

ABSTRACT Dermatophytes are the most common agents of superficial mycoses in humans and animals. The aim of the present investigation was to systematically identify the extracellular, possibly secreted, proteins that are putative virulence factors and antigenic molecules of dermatophytes. A complete gene expression profile of Arthroderma benhamiae was obtained during infection of its natural host (guinea pig) using RNA sequencing (RNA-seq) technology. This profile was completed with those of the fungus cultivated in vitro in two media containing either keratin or soy meal protein as the sole source of nitrogen and in Sabouraud medium. More than 60% of transcripts deduced from RNA-seq data differ from those previously deposited for A. benhamiae. Using these RNA-seq data along with an automatic gene annotation procedure, followed by manual curation, we produced a new annotation of the A. benhamiae genome. This annotation comprised 7,405 coding sequences (CDSs), among which only 2,662 were identical to the currently available annotation, 383 were newly identified, and 15 secreted proteins were manually corrected. The expression profile of genes encoding proteins with a signal peptide in infected guinea pigs was found to be very different from that during in vitro growth when using keratin as the substrate. Especially, the sets of the 12 most highly expressed genes encoding proteases with a signal sequence had only the putative vacuolar aspartic protease gene PEP2 in common, during infection and in keratin medium. The most upregulated gene encoding a secreted protease during infection was that encoding subtilisin SUB6, which is a known major allergen in the related dermatophyte Trichophyton rubrum. IMPORTANCE Dermatophytoses (ringworm, jock itch, athlete’s foot, and nail infections) are the most common fungal infections, but their virulence mechanisms are poorly understood. Combining transcriptomic data obtained from growth under various culture conditions with data obtained during infection led to a significantly improved genome annotation. About 65% of the protein-encoding genes predicted with our protocol did not match the existing annotation for A. benhamiae. Comparing gene expression during infection on guinea pigs with keratin degradation in vitro, which is supposed to mimic the host environment, revealed the critical importance of using real in vivo conditions for investigating virulence mechanisms. The analysis of genes expressed in vivo, encoding cell surface and secreted proteins, particularly proteases, led to the identification of new allergen and virulence factor candidates. PMID:27822542
Identification and Characterization of a Novel Issatchenkia orientalis GPI-Anchored Protein, IoGas1, Required for Resistance to Low pH and Salt Stress

PubMed Central

Matsushika, Akinori; Negi, Kanako; Suzuki, Toshihiro; Goshima, Tetsuya; Hoshino, Tamotsu

2016-01-01

The use of yeasts tolerant to acid (low pH) and salt stress is of industrial importance for several bioproduction processes. To identify new candidate genes having potential roles in low-pH tolerance, we screened an expression genomic DNA library of a multiple-stress-tolerant yeast, Issatchenkia orientalis (Pichia kudriavzevii), for clones that allowed Saccharomyces cerevisiae cells to grow under highly acidic conditions (pH 2.0). A genomic DNA clone containing two putative open reading frames was obtained, of which the putative protein-coding gene comprising 1629 bp was retransformed into the host. This transformant grew significantly at pH 2.0, and at pH 2.5 in the presence of 7.5% Na2SO4. The predicted amino acid sequence of this new gene, named I. orientalis GAS1 (IoGAS1), was 60% identical to the S. cerevisiae Gas1 protein, a glycosylphosphatidylinositol-anchored protein essential for maintaining cell wall integrity, and 58–59% identical to Candida albicans Phr1 and Phr2, pH-responsive proteins implicated in cell wall assembly and virulence. Northern hybridization analyses indicated that, as for the C. albicans homologs, IoGAS1 expression was pH-dependent, with expression increasing with decreasing pH (from 4.0 to 2.0) of the medium. These results suggest that IoGAS1 represents a novel pH-regulated system required for the adaptation of I. orientalis to environments of diverse pH. Heterologous expression of IoGAS1 complemented the growth and morphological defects of a S. cerevisiae gas1Δ mutant, demonstrating that IoGAS1 and the corresponding S. cerevisiae gene play similar roles in cell wall biosynthesis. Site-directed mutagenesis experiments revealed that two conserved glutamate residues (E161 and E262) in the IoGas1 protein play a crucial role in yeast morphogenesis and tolerance to low pH and salt stress. Furthermore, overexpression of IoGAS1 in S. cerevisiae remarkably improved the ethanol fermentation ability at pH 2.5, and at pH 2.0 in the presence of salt (5% Na2SO4), compared to that of a reference strain. Our results strongly suggest that constitutive expression of the IoGAS1 gene in S. cerevisiae could be advantageous for several fermentation processes under these stress conditions. PMID:27589271
RNA Sequencing-Based Genome Reannotation of the Dermatophyte Arthroderma benhamiae and Characterization of Its Secretome and Whole Gene Expression Profile during Infection.

PubMed

Tran, Van Du T; De Coi, Niccolò; Feuermann, Marc; Schmid-Siegert, Emanuel; Băguţ, Elena-Tatiana; Mignon, Bernard; Waridel, Patrice; Peter, Corinne; Pradervand, Sylvain; Pagni, Marco; Monod, Michel

2016-01-01

Dermatophytes are the most common agents of superficial mycoses in humans and animals. The aim of the present investigation was to systematically identify the extracellular, possibly secreted, proteins that are putative virulence factors and antigenic molecules of dermatophytes. A complete gene expression profile of Arthroderma benhamiae was obtained during infection of its natural host (guinea pig) using RNA sequencing (RNA-seq) technology. This profile was completed with those of the fungus cultivated in vitro in two media containing either keratin or soy meal protein as the sole source of nitrogen and in Sabouraud medium. More than 60% of transcripts deduced from RNA-seq data differ from those previously deposited for A. benhamiae . Using these RNA-seq data along with an automatic gene annotation procedure, followed by manual curation, we produced a new annotation of the A. benhamiae genome. This annotation comprised 7,405 coding sequences (CDSs), among which only 2,662 were identical to the currently available annotation, 383 were newly identified, and 15 secreted proteins were manually corrected. The expression profile of genes encoding proteins with a signal peptide in infected guinea pigs was found to be very different from that during in vitro growth when using keratin as the substrate. Especially, the sets of the 12 most highly expressed genes encoding proteases with a signal sequence had only the putative vacuolar aspartic protease gene PEP2 in common, during infection and in keratin medium. The most upregulated gene encoding a secreted protease during infection was that encoding subtilisin SUB6, which is a known major allergen in the related dermatophyte Trichophyton rubrum . IMPORTANCE Dermatophytoses (ringworm, jock itch, athlete's foot, and nail infections) are the most common fungal infections, but their virulence mechanisms are poorly understood. Combining transcriptomic data obtained from growth under various culture conditions with data obtained during infection led to a significantly improved genome annotation. About 65% of the protein-encoding genes predicted with our protocol did not match the existing annotation for A. benhamiae . Comparing gene expression during infection on guinea pigs with keratin degradation in vitro , which is supposed to mimic the host environment, revealed the critical importance of using real in vivo conditions for investigating virulence mechanisms. The analysis of genes expressed in vivo , encoding cell surface and secreted proteins, particularly proteases, led to the identification of new allergen and virulence factor candidates.
Comparative Analysis of Predicted Plastid-Targeted Proteomes of Sequenced Higher Plant Genomes

PubMed Central

Schaeffer, Scott; Harper, Artemus; Raja, Rajani; Jaiswal, Pankaj; Dhingra, Amit

2014-01-01

Plastids are actively involved in numerous plant processes critical to growth, development and adaptation. They play a primary role in photosynthesis, pigment and monoterpene synthesis, gravity sensing, starch and fatty acid synthesis, as well as oil, and protein storage. We applied two complementary methods to analyze the recently published apple genome (Malus × domestica) to identify putative plastid-targeted proteins, the first using TargetP and the second using a custom workflow utilizing a set of predictive programs. Apple shares roughly 40% of its 10,492 putative plastid-targeted proteins with that of the Arabidopsis (Arabidopsis thaliana) plastid-targeted proteome as identified by the Chloroplast 2010 project and ∼57% of its entire proteome with Arabidopsis. This suggests that the plastid-targeted proteomes between apple and Arabidopsis are different, and interestingly alludes to the presence of differential targeting of homologs between the two species. Co-expression analysis of 2,224 genes encoding putative plastid-targeted apple proteins suggests that they play a role in plant developmental and intermediary metabolism. Further, an inter-specific comparison of Arabidopsis, Prunus persica (Peach), Malus × domestica (Apple), Populus trichocarpa (Black cottonwood), Fragaria vesca (Woodland Strawberry), Solanum lycopersicum (Tomato) and Vitis vinifera (Grapevine) also identified a large number of novel species-specific plastid-targeted proteins. This analysis also revealed the presence of alternatively targeted homologs across species. Two separate analyses revealed that a small subset of proteins, one representing 289 protein clusters and the other 737 unique protein sequences, are conserved between seven plastid-targeted angiosperm proteomes. Majority of the novel proteins were annotated to play roles in stress response, transport, catabolic processes, and cellular component organization. Our results suggest that the current state of knowledge regarding plastid biology, preferentially based on model systems is deficient. New plant genomes are expected to enable the identification of potentially new plastid-targeted proteins that will aid in studying novel roles of plastids. PMID:25393533
Comparative transcriptome analysis to investigate the potential role of miRNAs in milk protein/fat quality.

PubMed

Wang, Xuehui; Zhang, Li; Jin, Jing; Xia, Anting; Wang, Chunmei; Cui, Yingjun; Qu, Bo; Li, Qingzhang; Sheng, Chunyan

2018-04-19

miRNAs play an important role in the processes of cell differentiation, biological development, and physiology. Here we investigated the molecular mechanisms regulating milk secretion and quality in dairy cows via transcriptome analyses of mammary gland tissues from dairy cows during the high-protein/high-fat, low-protein/low-fat or dry periods. To characterize the important roles of miRNAs and mRNAs in milk quality and to elucidate their regulatory networks in relation to milk secretion and quality, an integrated analysis was performed. A total of 25 core miRNAs were found to be differentially expressed (DE) during lactation compared to non-lactation, and these miRNAs were involved in epithelial cell terminal differentiation and mammary gland development. In addition, comprehensive analysis of mRNA and miRNA expression between high-protein/high-fat group and low-protein/low-fat groups indicated that, 38 miRNAs and 944 mRNAs were differentially expressed between them. Furthermore, 38 DE miRNAs putatively negatively regulated 253 DE mRNAs. The putative genes (253 DE mRNAs) were enriched in lipid biosynthetic process and amino acid transmembrane transporter activity. Moreover, putative DE genes were significantly enriched in fatty acid (FA) metabolism, biosynthesis of amino acids, synthesis and degradation of ketone bodies and biosynthesis of unsaturated FAs. Our results suggest that DE miRNAs might play roles as regulators of milk quality and milk secretion during mammary gland differentiation.
Characterization of Plasmids in a Human Clinical Strain of Lactococcus garvieae

PubMed Central

Blanco, M. Mar; López-Campos, Guillermo H.; Cutuli, M. Teresa; Fernández-Garayzábal, José F.

2012-01-01

The present work describes the molecular characterization of five circular plasmids found in the human clinical strain Lactococcus garvieae 21881. The plasmids were designated pGL1-pGL5, with molecular sizes of 4,536 bp, 4,572 bp, 12,948 bp, 14,006 bp and 68,798 bp, respectively. Based on detailed sequence analysis, some of these plasmids appear to be mosaics composed of DNA obtained by modular exchange between different species of lactic acid bacteria. Based on sequence data and the derived presence of certain genes and proteins, the plasmid pGL2 appears to replicate via a rolling-circle mechanism, while the other four plasmids appear to belong to the group of lactococcal theta-type replicons. The plasmids pGL1, pGL2 and pGL5 encode putative proteins related with bacteriocin synthesis and bacteriocin secretion and immunity. The plasmid pGL5 harbors genes (txn, orf5 and orf25) encoding proteins that could be considered putative virulence factors. The gene txn encodes a protein with an enzymatic domain corresponding to the family actin-ADP-ribosyltransferases toxins, which are known to play a key role in pathogenesis of a variety of bacterial pathogens. The genes orf5 and orf25 encode two putative surface proteins containing the cell wall-sorting motif LPXTG, with mucin-binding and collagen-binding protein domains, respectively. These proteins could be involved in the adherence of L. garvieae to mucus from the intestine, facilitating further interaction with intestinal epithelial cells and to collagenous tissues such as the collagen-rich heart valves. To our knowledge, this is the first report on the characterization of plasmids in a human clinical strain of this pathogen. PMID:22768237
Covisualization in living onion cells of putative integrin, putative spectrin, actin, putative intermediate filaments, and other proteins at the cell membrane and in an endomembrane sheath

NASA Technical Reports Server (NTRS)

Reuzeau, C.; Doolittle, K. W.; McNally, J. G.; Pickard, B. G.; Evans, M. L. (Principal Investigator)

1997-01-01

Covisualizations with wide-field computational optical-sectioning microscopy of living epidermal cells of the onion bulb scale have evidenced two major new cellular features. First, a sheath of cytoskeletal elements clads the endomembrane system. Similar elements clad the inner faces of punctate plasmalemmal sites interpreted as plasmalemmal control centers. One component of the endomembrane sheath and plasmalemmal control center cladding is anti-genicity-recognized by two injected antibodies against animal spectrin. Immunoblots of separated epidermal protein also showed bands recognized by these antibodies. Injected phalloidin identified F-actin with the same cellular distribution pattern, as did antibodies against intermediate-filament protein and other cytoskeletal elements known from animal cells. Injection of general protein stains demonstrated the abundance of endomembrane sheath protein. Second, the endomembrane system, like the plasmalemmal puncta, contains antigen recognized by an anti-beta 1 integrin injected into the cytoplasm. Previously, immunoblots of separated epidermal protein were shown to have a major band recognized both by this antibody prepared against a peptide representing the cytosolic region of beta 1 integrin and an antibody against the matrix region of beta 1 integrin. The latter antiboby also identified puncta at the external face of protoplasts. It is proposed that integrin and associated transmembrane proteins secure the endomembrane sheath and transmit signals between it and the lumen or matrix of the endoplasmic reticulum and organellar matrices. This function is comparable to that proposed for such transmembrane linkers in the plasmalemmal control centers, which also appear to bind cytoskeleton and a host of related molecules and transmit signals between them and the wall matrix. It is at the plasmalemmal control centers that the endoplasmic reticulum, a major component of the endomembrane system, attaches to the plasma membrane.
Identification of a putative triacylglycerol lipase from papaya latex by functional proteomics.

PubMed

Dhouib, R; Laroche-Traineau, J; Shaha, R; Lapaillerie, D; Solier, E; Rualès, J; Pina, M; Villeneuve, P; Carrière, F; Bonneu, M; Arondel, V

2011-01-01

Latex from Caricaceae has been known since 1925 to contain strong lipase activity. However, attempts to purify and identify the enzyme were not successful, mainly because of the lack of solubility of the enzyme. Here, we describe the characterization of lipase activity of the latex of Vasconcellea heilbornii and the identification of a putative homologous lipase from Carica papaya. Triacylglycerol lipase activity was enriched 74-fold from crude latex of Vasconcellea heilbornii to a specific activity (SA) of 57 μmol·min(-1)·mg(-1) on long-chain triacylglycerol (olive oil). The extract was also active on trioctanoin (SA = 655 μmol·min(-1)·mg(-1) ), tributyrin (SA = 1107 μmol·min(-1)·mg(-1) ) and phosphatidylcholine (SA = 923 μmol·min(-1)·mg(-1) ). The optimum pH ranged from 8.0 to 9.0. The protein content of the insoluble fraction of latex was analyzed by electrophoresis followed by mass spectrometry, and 28 different proteins were identified. The protein fraction was incubated with the lipase inhibitor [(14) C]tetrahydrolipstatin, and a 45 kDa protein radiolabeled by the inhibitor was identified as being a putative lipase. A C. papaya cDNA encoding a 55 kDa protein was further cloned, and its deduced sequence had 83.7% similarity with peptides from the 45 kDa protein, with a coverage of 25.6%. The protein encoded by this cDNA had 35% sequence identity and 51% similarity to castor bean acid lipase, suggesting that it is the lipase responsible for the important lipolytic activities detected in papaya latex. © 2010 The Authors Journal compilation © 2010 FEBS.
A single active trehalose-6-P synthase (TPS) and a family of putative regulatory TPS-like proteins in Arabidopsis.

PubMed

Vandesteene, Lies; Ramon, Matthew; Le Roy, Katrien; Van Dijck, Patrick; Rolland, Filip

2010-03-01

Higher plants typically do not produce trehalose in large amounts, but their genome sequences reveal large families of putative trehalose metabolism enzymes. An important regulatory role in plant growth and development is also emerging for the metabolic intermediate trehalose-6-P (T6P). Here, we present an update on Arabidopsis trehalose metabolism and a resource for further detailed analyses. In addition, we provide evidence that Arabidopsis encodes a single trehalose-6-P synthase (TPS) next to a family of catalytically inactive TPS-like proteins that might fulfill specific regulatory functions in actively growing tissues.
Construction of a full-length cDNA Library from Chinese oak silkworm pupa and identification of a KK-42-binding protein gene in relation to pupa-diapause termination.

PubMed

Li, Yu-Ping; Xia, Run-Xi; Wang, Huan; Li, Xi-Sheng; Liu, Yan-Qun; Wei, Zhao-Jun; Lu, Cheng; Xiang, Zhong-Huai

2009-06-24

In this study we successfully constructed a full-length cDNA library from Chinese oak silkworm, Antheraea pernyi, the most well-known wild silkworm used for silk production and insect food. Total RNA was extracted from a single fresh female pupa at the diapause stage. The titer of the library was 5 x 10(5) cfu/ml and the proportion of recombinant clones was approximately 95%. Expressed sequence tag (EST) analysis was used to characterize the library. A total of 175 clustered ESTs consisting of 24 contigs and 151 singlets were generated from 250 effective sequences. Of the 175 unigenes, 97 (55.4%) were known genes but only five from A. pernyi, 37 (21.2%) were known ESTs without function annotation, and 41 (23.4%) were novel ESTs. By EST sequencing, a gene coding KK-42-binding protein in A. pernyi (named as ApKK42-BP; GenBank accession no. FJ744151) was identified and characterized. Protein sequence analysis showed that ApKK42-BP was not a membrane protein but an extracellular protein with a signal peptide at position 1-18, and contained two putative conserved domains, abhydro_lipase and abhydrolase_1, suggesting it may be a member of lipase superfamily. Expression analysis based on number of ESTs showed that ApKK42-BP was an abundant gene in the period of diapause stage, suggesting it may also be involved in pupa-diapause termination.
Construction of a full-length cDNA Library from Chinese oak silkworm pupa and identification of a KK-42-binding protein gene in relation to pupa-diapause termination

PubMed Central

Li, Yu-Ping; Xia, Run-Xi; Wang, Huan; Li, Xi-Sheng; Liu, Yan-Qun; Wei, Zhao-Jun; Lu, Cheng; Xiang, Zhong-Huai

2009-01-01

In this study we successfully constructed a full-length cDNA library from Chinese oak silkworm, Antheraea pernyi, the most well-known wild silkworm used for silk production and insect food. Total RNA was extracted from a single fresh female pupa at the diapause stage. The titer of the library was 5 × 105 cfu/ml and the proportion of recombinant clones was approximately 95%. Expressed sequence tag (EST) analysis was used to characterize the library. A total of 175 clustered ESTs consisting of 24 contigs and 151 singlets were generated from 250 effective sequences. Of the 175 unigenes, 97 (55.4%) were known genes but only five from A. pernyi, 37 (21.2%) were known ESTs without function annotation, and 41 (23.4%) were novel ESTs. By EST sequencing, a gene coding KK-42-binding protein in A. pernyi (named as ApKK42-BP; GenBank accession no. FJ744151) was identified and characterized. Protein sequence analysis showed that ApKK42-BP was not a membrane protein but an extracellular protein with a signal peptide at position 1-18, and contained two putative conserved domains, abhydro_lipase and abhydrolase_1, suggesting it may be a member of lipase superfamily. Expression analysis based on number of ESTs showed that ApKK42-BP was an abundant gene in the period of diapause stage, suggesting it may also be involved in pupa-diapause termination. PMID:19564928
Prokaryote-derived protein inhibitors of peptidases: a sketchy occurrence and mostly unknown function

PubMed Central

Kantyka, Tomasz; Rawlings, Neil D.; Potempa, Jan

2010-01-01

In metazoan organisms protein inhibitors of peptidases are important factors essential for regulation of proteolytic activity. In vertebrates genes encoding peptidase inhibitors constitute up to 1% of genes reflecting a need for tight and specific control of proteolysis especially in extracellular body fluids. In stark contrast unicellular organisms, both prokaryotic and eukaryotic consistently contain only few, if any, genes coding for putative peptidase inhibitors. This may seem perplexing in the light of the fact that these organisms produce large numbers of proteases of different catalytic classes with the genes constituting up to 6% of the total gene count with the average being about 3%. Apparently, however, a unicellular life-style is fully compatible with other mechanisms of regulation of proteolysis and does not require protein inhibitors to control their intracellular and extracellular proteolytic activity. So in prokaryotes occurrence of genes encoding different types of peptidase inhibitors is infrequent and often scattered among phylogenetically distinct orders or even phyla of microbiota. Genes encoding proteins homologous to alpha-2-macroglobulin (family I39), serine carboxypeptidase Y inhibitor (family I51), alpha-1-peptidase inhibitor (family I4) and ecotin (family I11) are the most frequently represented in Bacteria. Although several of these gene products were shown to possess inhibitory activity, with an exception of ecotin and staphostatins, the biological function of microbial inhibitors is unclear. In this review we present distribution of protein inhibitors from different families among prokaryotes, describe their mode of action and hypothesize on their role in microbial physiology and interactions with hosts and environment. PMID:20558234
Transcriptional regulation of the operon encoding stress-responsive ECF sigma factor SigH and its anti-sigma factor RshA, and control of its regulatory network in Corynebacterium glutamicum

PubMed Central

2012-01-01

Background The expression of genes in Corynebacterium glutamicum, a Gram-positive non-pathogenic bacterium used mainly for the industrial production of amino acids, is regulated by seven different sigma factors of RNA polymerase, including the stress-responsive ECF-sigma factor SigH. The sigH gene is located in a gene cluster together with the rshA gene, putatively encoding an anti-sigma factor. The aim of this study was to analyze the transcriptional regulation of the sigH and rshA gene cluster and the effects of RshA on the SigH regulon, in order to refine the model describing the role of SigH and RshA during stress response. Results Transcription analyses revealed that the sigH gene and rshA gene are cotranscribed from four sigH housekeeping promoters in C. glutamicum. In addition, a SigH-controlled rshA promoter was found to only drive the transcription of the rshA gene. To test the role of the putative anti-sigma factor gene rshA under normal growth conditions, a C. glutamicum rshA deletion strain was constructed and used for genome-wide transcription profiling with DNA microarrays. In total, 83 genes organized in 61 putative transcriptional units, including those previously detected using sigH mutant strains, exhibited increased transcript levels in the rshA deletion mutant compared to its parental strain. The genes encoding proteins related to disulphide stress response, heat stress proteins, components of the SOS-response to DNA damage and proteasome components were the most markedly upregulated gene groups. Altogether six SigH-dependent promoters upstream of the identified genes were determined by primer extension and a refined consensus promoter consisting of 45 original promoter sequences was constructed. Conclusions The rshA gene codes for an anti-sigma factor controlling the function of the stress-responsive sigma factor SigH in C. glutamicum. Transcription of rshA from a SigH-dependent promoter may serve to quickly shutdown the SigH-dependent stress response after the cells have overcome the stress condition. Here we propose a model of the regulation of oxidative and heat stress response including redox homeostasis by SigH, RshA and the thioredoxin system. PMID:22943411
Transcriptional regulation of the operon encoding stress-responsive ECF sigma factor SigH and its anti-sigma factor RshA, and control of its regulatory network in Corynebacterium glutamicum.

PubMed

Busche, Tobias; Silar, Radoslav; Pičmanová, Martina; Pátek, Miroslav; Kalinowski, Jörn

2012-09-03

The expression of genes in Corynebacterium glutamicum, a Gram-positive non-pathogenic bacterium used mainly for the industrial production of amino acids, is regulated by seven different sigma factors of RNA polymerase, including the stress-responsive ECF-sigma factor SigH. The sigH gene is located in a gene cluster together with the rshA gene, putatively encoding an anti-sigma factor. The aim of this study was to analyze the transcriptional regulation of the sigH and rshA gene cluster and the effects of RshA on the SigH regulon, in order to refine the model describing the role of SigH and RshA during stress response. Transcription analyses revealed that the sigH gene and rshA gene are cotranscribed from four sigH housekeeping promoters in C. glutamicum. In addition, a SigH-controlled rshA promoter was found to only drive the transcription of the rshA gene. To test the role of the putative anti-sigma factor gene rshA under normal growth conditions, a C. glutamicum rshA deletion strain was constructed and used for genome-wide transcription profiling with DNA microarrays. In total, 83 genes organized in 61 putative transcriptional units, including those previously detected using sigH mutant strains, exhibited increased transcript levels in the rshA deletion mutant compared to its parental strain. The genes encoding proteins related to disulphide stress response, heat stress proteins, components of the SOS-response to DNA damage and proteasome components were the most markedly upregulated gene groups. Altogether six SigH-dependent promoters upstream of the identified genes were determined by primer extension and a refined consensus promoter consisting of 45 original promoter sequences was constructed. The rshA gene codes for an anti-sigma factor controlling the function of the stress-responsive sigma factor SigH in C. glutamicum. Transcription of rshA from a SigH-dependent promoter may serve to quickly shutdown the SigH-dependent stress response after the cells have overcome the stress condition. Here we propose a model of the regulation of oxidative and heat stress response including redox homeostasis by SigH, RshA and the thioredoxin system.
Chitinase Expression in Listeria monocytogenes Is Influenced by lmo0327, Which Encodes an Internalin-Like Protein.

PubMed

Paspaliari, Dafni Katerina; Kastbjerg, Vicky Gaedt; Ingmer, Hanne; Popowska, Magdalena; Larsen, Marianne Halberg

2017-11-15

The chitinolytic system of Listeria monocytogenes thus far comprises two chitinases, ChiA and ChiB, and a lytic polysaccharide monooxygenase, Lmo2467. The role of the system in the bacterium appears to be pleiotropic, as besides mediating the hydrolysis of chitin, the second most ubiquitous carbohydrate in nature, the chitinases have been deemed important for the colonization of unicellular molds, as well as mammalian hosts. To identify additional components of the chitinolytic system, we screened a transposon mutant library for mutants exhibiting impaired chitin hydrolysis. The screening yielded a mutant with a transposon insertion in a locus corresponding to lmo0327 of the EGD-e strain. lmo0327 encodes a large (1,349 amino acids [aa]) cell wall-associated protein that has been proposed to possess murein hydrolase activity. The single inactivation of lmo0327 , as well as of lmo0325 that codes for a putative transcriptional regulator functionally related to lmo0327 , led to an almost complete abolishment of chitinolytic activity. The effect could be traced at the transcriptional level, as both chiA and chiB transcripts were dramatically decreased in the lmo0327 mutant. In accordance with that, we could barely detect ChiA and ChiB in the culture supernatants of the mutant strain. Our results provide new information regarding the function of the lmo0325-lmo0327 locus in L. monocytogenes and link it to the expression of chitinolytic activity. IMPORTANCE Many bacteria from terrestrial and marine environments express chitinase activities enabling them to utilize chitin as the sole source of carbon and nitrogen. Interestingly, several bacterial chitinases may also be involved in host pathogenesis. For example, in the important foodborne pathogen Listeria monocytogenes , the chitinases ChiA and ChiB and the lytic polysaccharide monooxygenase Lmo2467 are implicated in chitin assimilation but also act as virulence factors during the infection of mammalian hosts. Therefore, it is important to identify their regulators and induction cues to understand how the different roles of the chitinolytic system are controlled and mediated. Here, we provide evidence for the importance of lmo0327 and lmo0325 , encoding a putative internalin/autolysin and a putative transcriptional activator, respectively, in the efficient expression of chitinase activity in L. monocytogenes and thereby provide new information regarding the function of the lmo0325-lmo0327 locus. Copyright © 2017 Paspaliari et al.
Genome sequence and comparative analysis of a putative entomopathogenic Serratia isolated from Caenorhabditis briggsae.

PubMed

Abebe-Akele, Feseha; Tisa, Louis S; Cooper, Vaughn S; Hatcher, Philip J; Abebe, Eyualem; Thomas, W Kelley

2015-07-18

Entomopathogenic associations between nematodes in the genera Steinernema and Heterorhabdus with their cognate bacteria from the bacterial genera Xenorhabdus and Photorhabdus, respectively, are extensively studied for their potential as biological control agents against invasive insect species. These two highly coevolved associations were results of convergent evolution. Given the natural abundance of bacteria, nematodes and insects, it is surprising that only these two associations with no intermediate forms are widely studied in the entomopathogenic context. Discovering analogous systems involving novel bacterial and nematode species would shed light on the evolutionary processes involved in the transition from free living organisms to obligatory partners in entomopathogenicity. We report the complete genome sequence of a new member of the enterobacterial genus Serratia that forms a putative entomopathogenic complex with Caenorhabditis briggsae. Analysis of the 5.04 MB chromosomal genome predicts 4599 protein coding genes, seven sets of ribosomal RNA genes, 84 tRNA genes and a 64.8 KB plasmid encoding 74 genes. Comparative genomic analysis with three of the previously sequenced Serratia species, S. marcescens DB11 and S. proteamaculans 568, and Serratia sp. AS12, revealed that these four representatives of the genus share a core set of ~3100 genes and extensive structural conservation. The newly identified species shares a more recent common ancestor with S. marcescens with 99% sequence identity in rDNA sequence and orthology across 85.6% of predicted genes. Of the 39 genes/operons implicated in the virulence, symbiosis, recolonization, immune evasion and bioconversion, 21 (53.8%) were present in Serratia while 33 (84.6%) and 35 (89%) were present in Xenorhabdus and Photorhabdus EPN bacteria respectively. The majority of unique sequences in Serratia sp. SCBI (South African Caenorhabditis briggsae Isolate) are found in ~29 genomic islands of 5 to 65 genes and are enriched in putative functions that are biologically relevant to an entomopathogenic lifestyle, including non-ribosomal peptide synthetases, bacteriocins, fimbrial biogenesis, ushering proteins, toxins, secondary metabolite secretion and multiple drug resistance/efflux systems. By revealing the early stages of adaptation to this lifestyle, the Serratia sp. SCBI genome underscores the fact that in EPN formation the composite end result - killing, bioconversion, cadaver protection and recolonization- can be achieved by dissimilar mechanisms. This genome sequence will enable further study of the evolution of entomopathogenic nematode-bacteria complexes.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.